Skip to main content

Showing 1–14 of 14 results for author: Kankanhalli, M S

.
  1. arXiv:2305.05962  [pdf, other

    cs.CY

    A Comprehensive Picture of Factors Affecting User Willingness to Use Mobile Health Applications

    Authors: Shao**g Fan, Ramesh C. Jain, Mohan S. Kankanhalli

    Abstract: Mobile health (mHealth) applications have become increasingly valuable in preventive healthcare and in reducing the burden on healthcare organizations. The aim of this paper is to investigate the factors that influence user acceptance of mHealth apps and identify the underlying structure that shapes users' behavioral intention. An online study that employed factorial survey design with vignettes w… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

  2. arXiv:2201.09193  [pdf, other

    cs.CV cs.LG

    Learning to Minimize the Remainder in Supervised Learning

    Authors: Yan Luo, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao

    Abstract: The learning process of deep learning methods usually updates the model's parameters in multiple iterations. Each iteration can be viewed as the first-order approximation of Taylor's series expansion. The remainder, which consists of higher-order terms, is usually ignored in the learning process for simplicity. This learning scheme empowers various multimedia based applications, such as image retr… ▽ More

    Submitted 6 March, 2022; v1 submitted 23 January, 2022; originally announced January 2022.

    Comments: Accepted to IEEE TMM

  3. arXiv:2110.00529  [pdf, other

    cs.CV cs.LG

    Unsupervised Motion Representation Learning with Capsule Autoencoders

    Authors: Ziwei Xu, Xudong Shen, Yongkang Wong, Mohan S Kankanhalli

    Abstract: We propose the Motion Capsule Autoencoder (MCAE), which addresses a key challenge in the unsupervised learning of motion representations: transformation invariance. MCAE models motion in a two-level hierarchy. In the lower level, a spatio-temporal motion signal is divided into short, local, and semantic-agnostic snippets. In the higher level, the snippets are aggregated to form full-length semanti… ▽ More

    Submitted 1 October, 2021; originally announced October 2021.

    Comments: Accepted by NeurIPS 2021

  4. arXiv:2110.00054  [pdf, other

    cs.CV

    Learning to Predict Trustworthiness with Steep Slope Loss

    Authors: Yan Luo, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao

    Abstract: Understanding the trustworthiness of a prediction yielded by a classifier is critical for the safe and effective use of AI models. Prior efforts have been proven to be reliable on small-scale datasets. In this work, we study the problem of predicting trustworthiness on real-world large-scale datasets, where the task is more challenging due to high-dimensional features, diverse visual concepts, and… ▽ More

    Submitted 27 October, 2021; v1 submitted 30 September, 2021; originally announced October 2021.

    Comments: NeurIPS 2021

  5. arXiv:2007.05104  [pdf, other

    cs.CV cs.LG

    $n$-Reference Transfer Learning for Saliency Prediction

    Authors: Yan Luo, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao

    Abstract: Benefiting from deep learning research and large-scale datasets, saliency prediction has achieved significant success in the past decade. However, it still remains challenging to predict saliency maps on images in new domains that lack sufficient data for data-hungry models. To solve this problem, we propose a few-shot transfer learning paradigm for saliency prediction, which enables efficient tra… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

    Comments: ECCV 2020

  6. arXiv:1912.08136  [pdf, other

    cs.LG cs.CV stat.ML

    Direction Concentration Learning: Enhancing Congruency in Machine Learning

    Authors: Yan Luo, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao

    Abstract: One of the well-known challenges in computer vision tasks is the visual diversity of images, which could result in an agreement or disagreement between the learned knowledge and the visual content exhibited by the current observation. In this work, we first define such an agreement in a concepts learning process as congruency. Formally, given a particular task and sufficiently large dataset, the c… ▽ More

    Submitted 1 January, 2020; v1 submitted 17 December, 2019; originally announced December 2019.

    Comments: This is a preprint and the formal version has been published in TPAMI

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2019

  7. arXiv:1909.01161  [pdf, other

    cs.AI cs.CV cs.MM

    Embedding Symbolic Knowledge into Deep Networks

    Authors: Yaqi Xie, Ziwei Xu, Mohan S. Kankanhalli, Kuldeep S. Meel, Harold Soh

    Abstract: In this work, we aim to leverage prior symbolic knowledge to improve the performance of deep models. We propose a graph embedding network that projects propositional formulae (and assignments) onto a manifold via an augmented Graph Convolutional Network (GCN). To generate semantically-faithful embeddings, we develop techniques to recognize node heterogeneity, and semantic regularization that incor… ▽ More

    Submitted 29 October, 2019; v1 submitted 3 September, 2019; originally announced September 2019.

    Comments: *Equal contribution; Accepted at conference Neural Information Processing Systems (NeurIPS), 2019

  8. arXiv:1812.05917  [pdf, other

    cs.CV

    Visual Social Relationship Recognition

    Authors: Junnan Li, Yongkang Wong, Qi Zhao, Mohan S. Kankanhalli

    Abstract: Social relationships form the basis of social structure of humans. Develo** computational models to understand social relationships from visual data is essential for building intelligent machines that can better interact with humans in a social environment. In this work, we study the problem of visual social relationship recognition in images. We propose a Dual-Glance model for social relationsh… ▽ More

    Submitted 13 December, 2018; originally announced December 2018.

    Comments: arXiv admin note: text overlap with arXiv:1708.00634

  9. arXiv:1809.01844  [pdf, other

    cs.CV

    Unsupervised Learning of View-invariant Action Representations

    Authors: Junnan Li, Yongkang Wong, Qi Zhao, Mohan S. Kankanhalli

    Abstract: The recent success in human action recognition with deep learning methods mostly adopt the supervised learning paradigm, which requires significant amount of manually labeled data to achieve good performance. However, label collection is an expensive and time-consuming process. In this work, we propose an unsupervised learning framework, which exploits unlabeled data to learn video representations… ▽ More

    Submitted 6 September, 2018; originally announced September 2018.

    Comments: NIPS 2018

  10. arXiv:1808.09796  [pdf, other

    cs.CV

    Interact as You Intend: Intention-Driven Human-Object Interaction Detection

    Authors: Bingjie Xu, Junnan Li, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao

    Abstract: The recent advances in instance-level detection tasks lay strong foundation for genuine comprehension of the visual scenes. However, the ability to fully comprehend a social scene is still in its preliminary stage. In this work, we focus on detecting human-object interactions (HOIs) in social scene images, which is demanding in terms of research and increasingly useful for practical applications.… ▽ More

    Submitted 22 September, 2019; v1 submitted 29 August, 2018; originally announced August 2018.

  11. Video Storytelling: Textual Summaries for Events

    Authors: Junnan Li, Yongkang Wong, Qi Zhao, Mohan S. Kankanhalli

    Abstract: Bridging vision and natural language is a longstanding goal in computer vision and multimedia research. While earlier works focus on generating a single-sentence description for visual content, recent works have studied paragraph generation. In this work, we introduce the problem of video storytelling, which aims at generating coherent and succinct stories for long videos. Video storytelling intro… ▽ More

    Submitted 14 May, 2020; v1 submitted 24 July, 2018; originally announced July 2018.

    Comments: Published in IEEE Transactions on Multimedia

    Journal ref: J. Li, Y. Wong, Q. Zhao and M. S. Kankanhalli, "Video Storytelling: Textual Summaries for Events," in IEEE Transactions on Multimedia, 2019

  12. arXiv:1708.00634  [pdf, other

    cs.CV

    Dual-Glance Model for Deciphering Social Relationships

    Authors: Junnan Li, Yongkang Wong, Qi Zhao, Mohan S. Kankanhalli

    Abstract: Since the beginning of early civilizations, social relationships derived from each individual fundamentally form the basis of social structure in our daily life. In the computer vision literature, much progress has been made in scene understanding, such as object detection and scene parsing. Recent research focuses on the relationship between objects based on its functionality and geometrical rela… ▽ More

    Submitted 2 August, 2017; originally announced August 2017.

    Comments: IEEE International Conference on Computer Vision (ICCV), 2017

  13. arXiv:1501.00825  [pdf, ps, other

    cs.CV

    Group $K$-Means

    Authors: Jianfeng Wang, Shuicheng Yan, Yi Yang, Mohan S Kankanhalli, Shipeng Li, **gdong Wang

    Abstract: We study how to learn multiple dictionaries from a dataset, and approximate any data point by the sum of the codewords each chosen from the corresponding dictionary. Although theoretically low approximation errors can be achieved by the global solution, an effective solution has not been well studied in practice. To solve the problem, we propose a simple yet effective algorithm \textit{Group $K$-M… ▽ More

    Submitted 5 January, 2015; originally announced January 2015.

    Comments: The developed algorithm is similar with "Christopher F. Barnes, A new multiple path search technique for residual vector quantizers, 1994", but we conduct the research independently and apply it in data/feature compression and image retrieval

  14. arXiv:1307.4980  [pdf, other

    cs.GT cs.IR

    Multi-keyword multi-click advertisement option contracts for sponsored search

    Authors: Bowei Chen, Jun Wang, Ingemar J. Cox, Mohan S. Kankanhalli

    Abstract: In sponsored search, advertisement (abbreviated ad) slots are usually sold by a search engine to an advertiser through an auction mechanism in which advertisers bid on keywords. In theory, auction mechanisms have many desirable economic properties. However, keyword auctions have a number of limitations including: the uncertainty in payment prices for advertisers; the volatility in the search engin… ▽ More

    Submitted 9 December, 2015; v1 submitted 18 July, 2013; originally announced July 2013.

    Comments: Chen, Bowei and Wang, Jun and Cox, Ingemar J. and Kankanhalli, Mohan S. (2015) Multi-keyword multi-click advertisement option contracts for sponsored search. ACM Transactions on Intelligent Systems and Technology, 7 (1). pp. 1-29. ISSN: 2157-6904