Skip to main content

Showing 1–42 of 42 results for author: Ai, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00119  [pdf, other

    cs.LG cs.AI cs.CL

    Efficient Long-distance Latent Relation-aware Graph Neural Network for Multi-modal Emotion Recognition in Conversations

    Authors: Yuntao Shou, Wei Ai, Jiayi Du, Tao Meng, Haiyan Liu

    Abstract: The task of multi-modal emotion recognition in conversation (MERC) aims to analyze the genuine emotional state of each utterance based on the multi-modal information in the conversation, which is crucial for conversation understanding. Existing methods focus on using graph neural networks (GNN) to model conversational relationships and capture contextual latent semantic relationships. However, due… ▽ More

    Submitted 27 June, 2024; originally announced July 2024.

    Comments: 11 pages, 3 tables

  2. arXiv:2406.13114  [pdf, other

    cs.CL cs.AI

    Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation

    Authors: Yuhang Zhou, **g Zhu, Paiheng Xu, Xiaoyu Liu, Xiyao Wang, Danai Koutra, Wei Ai, Furong Huang

    Abstract: Large language models (LLMs) have significantly advanced various natural language processing tasks, but deploying them remains computationally expensive. Knowledge distillation (KD) is a promising solution, enabling the transfer of capabilities from larger teacher LLMs to more compact student models. Particularly, sequence-level KD, which distills rationale-based reasoning processes instead of mer… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: preprint

  3. arXiv:2406.05322  [pdf, other

    cs.CL cs.AI

    Teaching-Assistant-in-the-Loop: Improving Knowledge Distillation from Imperfect Teacher Models in Low-Budget Scenarios

    Authors: Yuhang Zhou, Wei Ai

    Abstract: There is increasing interest in distilling task-specific knowledge from large language models (LLM) to smaller student models. Nonetheless, LLM distillation presents a dual challenge: 1) there is a high cost associated with querying the teacher LLM, such as GPT-4, for gathering an ample number of demonstrations; 2) the teacher LLM might provide imperfect outputs with a negative impact on the stude… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted by ACL 2024 Findings

  4. arXiv:2405.09546  [pdf, other

    cs.CV

    BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation

    Authors: Yunhao Ge, Yihe Tang, Jiashu Xu, Cem Gokmen, Chengshu Li, Wensi Ai, Benjamin Jose Martinez, Arman Aydin, Mona Anvari, Ayush K Chakravarthy, Hong-Xing Yu, Josiah Wong, Sanjana Srivastava, Sharon Lee, Shengxin Zha, Laurent Itti, Yunzhu Li, Roberto Martín-Martín, Miao Liu, Pengchuan Zhang, Ruohan Zhang, Li Fei-Fei, Jiajun Wu

    Abstract: The systematic evaluation and understanding of computer vision models under varying conditions require large amounts of data with comprehensive and customized labels, which real-world vision datasets rarely satisfy. While current synthetic data generators offer a promising alternative, particularly for embodied AI tasks, they often fall short for computer vision tasks due to low asset and renderin… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: CVPR 2024 (Highlight). Project website: https://behavior-vision-suite.github.io/

  5. arXiv:2404.17862  [pdf, other

    cs.CL

    Revisiting Multimodal Emotion Recognition in Conversation from the Perspective of Graph Spectrum

    Authors: Tao Meng, Fuchen Zhang, Yuntao Shou, Wei Ai, Nan Yin, Keqin Li

    Abstract: Efficiently capturing consistent and complementary semantic features in a multimodal conversation context is crucial for Multimodal Emotion Recognition in Conversation (MERC). Existing methods mainly use graph structures to model dialogue context semantic dependencies and employ Graph Neural Networks (GNN) to capture multimodal semantic features for emotion recognition. However, these methods are… ▽ More

    Submitted 2 May, 2024; v1 submitted 27 April, 2024; originally announced April 2024.

    Comments: 10 pages, 4 figures

  6. arXiv:2404.02444  [pdf, other

    cs.CL cs.AI

    The Promises and Pitfalls of Using Language Models to Measure Instruction Quality in Education

    Authors: Paiheng Xu, **g Liu, Nathan Jones, Julie Cohen, Wei Ai

    Abstract: Assessing instruction quality is a fundamental component of any improvement efforts in the education system. However, traditional manual assessments are expensive, subjective, and heavily dependent on observers' expertise and idiosyncratic factors, preventing teachers from getting timely and frequent feedback. Different from prior research that mostly focuses on low-inference instructional practic… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: NAACL 2024

  7. arXiv:2403.13409  [pdf, ps, other

    physics.chem-ph cond-mat.mtrl-sci cs.CE physics.app-ph

    Influence of concentration-dependent material properties on the fracture and debonding of electrode particles with core-shell structure

    Authors: Y. Tu, B. Wu, W. Ai, E. Martínez-Pañeda

    Abstract: Core-shell electrode particle designs offer a route to improved lithium-ion battery performance. However, they are susceptible to mechanical damage such as fracture and debonding, which can significantly reduce their lifetime. Using a coupled finite element model, we explore the impacts of diffusion-induced stresses on the failure mechanisms of an exemplar system with an NMC811 core and an NMC111… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  8. arXiv:2403.09606  [pdf, ps, other

    cs.CL cs.AI

    Large Language Models and Causal Inference in Collaboration: A Comprehensive Survey

    Authors: Xiaoyu Liu, Paiheng Xu, Junda Wu, Jiaxin Yuan, Yifan Yang, Yuhang Zhou, Fuxiao Liu, Tianrui Guan, Haoliang Wang, Tong Yu, Julian McAuley, Wei Ai, Furong Huang

    Abstract: Causal inference has shown potential in enhancing the predictive accuracy, fairness, robustness, and explainability of Natural Language Processing (NLP) models by capturing causal relationships among variables. The emergence of generative Large Language Models (LLMs) has significantly impacted various NLP domains, particularly through their advanced reasoning capabilities. This survey focuses on e… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  9. arXiv:2403.09227  [pdf, other

    cs.RO cs.AI

    BEHAVIOR-1K: A Human-Centered, Embodied AI Benchmark with 1,000 Everyday Activities and Realistic Simulation

    Authors: Chengshu Li, Ruohan Zhang, Josiah Wong, Cem Gokmen, Sanjana Srivastava, Roberto Martín-Martín, Chen Wang, Gabrael Levine, Wensi Ai, Benjamin Martinez, Hang Yin, Michael Lingelbach, Minjune Hwang, Ayano Hiranaka, Sujay Garlanka, Arman Aydin, Sharon Lee, Jiankai Sun, Mona Anvari, Manasi Sharma, Dhruva Bansal, Samuel Hunter, Kyu-Young Kim, Alan Lou, Caleb R Matthews , et al. (10 additional authors not shown)

    Abstract: We present BEHAVIOR-1K, a comprehensive simulation benchmark for human-centered robotics. BEHAVIOR-1K includes two components, guided and motivated by the results of an extensive survey on "what do you want robots to do for you?". The first is the definition of 1,000 everyday activities, grounded in 50 scenes (houses, gardens, restaurants, offices, etc.) with more than 9,000 objects annotated with… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: A preliminary version was published at 6th Conference on Robot Learning (CoRL 2022)

  10. arXiv:2403.07869  [pdf, other

    cs.RO cs.AI cs.LG

    TeleMoMa: A Modular and Versatile Teleoperation System for Mobile Manipulation

    Authors: Shivin Dass, Wensi Ai, Yuqian Jiang, Samik Singh, Jiaheng Hu, Ruohan Zhang, Peter Stone, Ben Abbatematteo, Roberto Martín-Martín

    Abstract: A critical bottleneck limiting imitation learning in robotics is the lack of data. This problem is more severe in mobile manipulation, where collecting demonstrations is harder than in stationary manipulation due to the lack of available and easy-to-use teleoperation interfaces. In this work, we demonstrate TeleMoMa, a general and modular interface for whole-body teleoperation of mobile manipulato… ▽ More

    Submitted 21 March, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  11. arXiv:2402.14187  [pdf, other

    cs.CY cs.AI cs.CL cs.HC

    From Adoption to Adaption: Tracing the Diffusion of New Emojis on Twitter

    Authors: Yuhang Zhou, Xuan Lu, Wei Ai

    Abstract: In the rapidly evolving landscape of social media, the introduction of new emojis in Unicode release versions presents a structured opportunity to explore digital language evolution. Analyzing a large dataset of sampled English tweets, we examine how newly released emojis gain traction and evolve in meaning. We find that community size of early adopters and emoji semantics are crucial in determini… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 13 pages, 3 page appendix

  12. arXiv:2402.01681  [pdf, other

    cs.CL cs.AI

    Emojis Decoded: Leveraging ChatGPT for Enhanced Understanding in Social Media Communications

    Authors: Yuhang Zhou, Paiheng Xu, Xiyao Wang, Xuan Lu, Ge Gao, Wei Ai

    Abstract: Emojis, which encapsulate semantics beyond mere words or phrases, have become prevalent in social network communications. This has spurred increasing scholarly interest in exploring their attributes and functionalities. However, emoji-related research and application face two primary challenges. First, researchers typically rely on crowd-sourcing to annotate emojis in order to understand their sen… ▽ More

    Submitted 16 February, 2024; v1 submitted 22 January, 2024; originally announced February 2024.

    Comments: 12 pages, 2 page appendix

  13. arXiv:2401.10642  [pdf, other

    cs.SI cs.AI

    Fast Butterfly-Core Community Search For Large Labeled Graphs

    Authors: JiaYi Du, Yinghao Wu, Wei Ai, Tao Meng, CanHao Xie, KeQin Li

    Abstract: Community Search (CS) aims to identify densely interconnected subgraphs corresponding to query vertices within a graph. However, existing heterogeneous graph-based community search methods need help identifying cross-group communities and suffer from efficiency issues, making them unsuitable for large graphs. This paper presents a fast community search model based on the Butterfly-Core Community (… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: 8 pages, 8 figures

  14. arXiv:2401.10641  [pdf, other

    cs.SI cs.AI

    An Effective Index for Truss-based Community Search on Large Directed Graphs

    Authors: Wei Ai, CanHao Xie, Tao Meng, Yinghao Wu, KeQin Li

    Abstract: Community search is a derivative of community detection that enables online and personalized discovery of communities and has found extensive applications in massive real-world networks. Recently, there needs to be more focus on the community search issue within directed graphs, even though substantial research has been carried out on undirected graphs. The recently proposed D-truss model has achi… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: 8 pages, 8figures

  15. arXiv:2401.01495  [pdf, other

    cs.CL

    A Two-Stage Multimodal Emotion Recognition Model Based on Graph Contrastive Learning

    Authors: Wei Ai, FuChen Zhang, Tao Meng, YunTao Shou, HongEn Shao, Keqin Li

    Abstract: In terms of human-computer interaction, it is becoming more and more important to correctly understand the user's emotional state in a conversation, so the task of multimodal emotion recognition (MER) started to receive more attention. However, existing emotion classification methods usually perform classification only once. Sentences are likely to be misclassified in a single round of classificat… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: 9 pages, 3 figures

  16. arXiv:2312.16778  [pdf, other

    cs.CL

    Adversarial Representation with Intra-Modal and Inter-Modal Graph Contrastive Learning for Multimodal Emotion Recognition

    Authors: Yuntao Shou, Tao Meng, Wei Ai, Keqin Li

    Abstract: With the release of increasing open-source emotion recognition datasets on social media platforms and the rapid development of computing resources, multimodal emotion recognition tasks (MER) have begun to receive widespread research attention. The MER task extracts and fuses complementary semantic information from different modalities, which can classify the speaker's emotions. However, the existi… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

    Comments: 14 pages, 6 figures

  17. arXiv:2312.10579  [pdf, other

    cs.CL cs.AI

    DER-GCN: Dialogue and Event Relation-Aware Graph Convolutional Neural Network for Multimodal Dialogue Emotion Recognition

    Authors: Wei Ai, Yuntao Shou, Tao Meng, Keqin Li

    Abstract: With the continuous development of deep learning (DL), the task of multimodal dialogue emotion recognition (MDER) has recently received extensive research attention, which is also an essential branch of DL. The MDER aims to identify the emotional information contained in different modalities, e.g., text, video, and audio, in different dialogue scenes. However, existing research has focused on mode… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

    Comments: 14 pages, 7 figures

  18. arXiv:2312.06337  [pdf, other

    cs.SD cs.CL eess.AS

    Deep Imbalanced Learning for Multimodal Emotion Recognition in Conversations

    Authors: Tao Meng, Yuntao Shou, Wei Ai, Nan Yin, Keqin Li

    Abstract: The main task of Multimodal Emotion Recognition in Conversations (MERC) is to identify the emotions in modalities, e.g., text, audio, image and video, which is a significant development direction for realizing machine intelligence. However, many data in MERC naturally exhibit an imbalanced distribution of emotion categories, and researchers ignore the negative impact of imbalanced data on emotion… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: 16 pages, 9 figures

  19. arXiv:2312.05735  [pdf, other

    cs.AI

    A Comprehensive Survey on Multi-modal Conversational Emotion Recognition with Deep Learning

    Authors: Yuntao Shou, Tao Meng, Wei Ai, Nan Yin, Keqin Li

    Abstract: Multi-modal conversation emotion recognition (MCER) aims to recognize and track the speaker's emotional state using text, speech, and visual information in the conversation scene. Analyzing and studying MCER issues is significant to affective computing, intelligent recommendations, and human-computer interaction fields. Unlike the traditional single-utterance multi-modal emotion recognition or sin… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Comments: 36 pages, 10 figures

  20. arXiv:2312.02545  [pdf, other

    cs.CV cs.AI

    Graph Information Bottleneck for Remote Sensing Segmentation

    Authors: Yuntao Shou, Wei Ai, Tao Meng

    Abstract: Remote sensing segmentation has a wide range of applications in environmental protection, and urban change detection, etc. Despite the success of deep learning-based remote sensing segmentation methods (e.g., CNN and Transformer), they are not flexible enough to model irregular objects. In addition, existing graph contrastive learning methods usually adopt the way of maximizing mutual information… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: 13 pages, 6 figures

  21. arXiv:2312.01758  [pdf, other

    cs.CV cs.AI

    CILF-CIAE: CLIP-driven Image-Language Fusion for Correcting Inverse Age Estimation

    Authors: Yuntao Shou, Wei Ai, Tao Meng, Keqin Li

    Abstract: The age estimation task aims to predict the age of an individual by analyzing facial features in an image. The development of age estimation can improve the efficiency and accuracy of various applications (e.g., age verification and secure access control, etc.). In recent years, contrastive language-image pre-training (CLIP) has been widely used in various multimodal tasks and has made some progre… ▽ More

    Submitted 1 July, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: 14 pages, 14 figures, 3 tables

  22. arXiv:2311.08648  [pdf, other

    cs.CL cs.AI

    Explore Spurious Correlations at the Concept Level in Language Models for Text Classification

    Authors: Yuhang Zhou, Paiheng Xu, Xiaoyu Liu, Bang An, Wei Ai, Furong Huang

    Abstract: Language models (LMs) have achieved notable success in numerous NLP tasks, employing both fine-tuning and in-context learning (ICL) methods. While language models demonstrate exceptional performance, they face robustness challenges due to spurious correlations arising from imbalanced label distributions in training data or ICL exemplars. Previous research has primarily concentrated on word, phrase… ▽ More

    Submitted 15 June, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: 14 pages, 4 page appendix, Accepted by ACL 2024 Main

  23. arXiv:2311.01454  [pdf, other

    cs.RO cs.AI

    NOIR: Neural Signal Operated Intelligent Robots for Everyday Activities

    Authors: Ruohan Zhang, Sharon Lee, Minjune Hwang, Ayano Hiranaka, Chen Wang, Wensi Ai, ** Jie Ryan Tan, Shreya Gupta, Yilun Hao, Gabrael Levine, Ruohan Gao, Anthony Norcia, Li Fei-Fei, Jiajun Wu

    Abstract: We present Neural Signal Operated Intelligent Robots (NOIR), a general-purpose, intelligent brain-robot interface system that enables humans to command robots to perform everyday activities through brain signals. Through this interface, humans communicate their intended objects of interest and actions to the robots using electroencephalography (EEG). Our novel system demonstrates success in an exp… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  24. arXiv:2309.07927  [pdf, ps, other

    eess.AS cs.CL cs.SD

    Kid-Whisper: Towards Bridging the Performance Gap in Automatic Speech Recognition for Children VS. Adults

    Authors: Ahmed Adel Attia, **g Liu, Wei Ai, Dorottya Demszky, Carol Espy-Wilson

    Abstract: Recent advancements in Automatic Speech Recognition (ASR) systems, exemplified by Whisper, have demonstrated the potential of these systems to approach human-level performance given sufficient data. However, this progress doesn't readily extend to ASR for children due to the limited availability of suitable child-specific databases and the distinct characteristics of children's speech. A recent st… ▽ More

    Submitted 15 May, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

  25. arXiv:2308.16360  [pdf, other

    cs.CY cs.HC cs.LG

    Emoji Promotes Developer Participation and Issue Resolution on GitHub

    Authors: Yuhang Zhou, Xuan Lu, Ge Gao, Qiaozhu Mei, Wei Ai

    Abstract: Although remote working is increasingly adopted during the pandemic, many are concerned by the low-efficiency in the remote working. Missing in text-based communication are non-verbal cues such as facial expressions and body language, which hinders the effective communication and negatively impacts the work outcomes. Prevalent on social media platforms, emojis, as alternative non-verbal cues, are… ▽ More

    Submitted 16 April, 2024; v1 submitted 30 August, 2023; originally announced August 2023.

    Comments: Accepted by the 18th International AAAI Conference on Web and Social Media (ICWSM 2024)

  26. arXiv:2306.00899  [pdf, other

    cs.LG cs.IR cs.SI

    Pitfalls in Link Prediction with Graph Neural Networks: Understanding the Impact of Target-link Inclusion & Better Practices

    Authors: **g Zhu, Yuhang Zhou, Vassilis N. Ioannidis, Shengyi Qian, Wei Ai, Xiang Song, Danai Koutra

    Abstract: While Graph Neural Networks (GNNs) are remarkably successful in a variety of high-impact applications, we demonstrate that, in link prediction, the common practices of including the edges being predicted in the graph at training and/or test have outsized impact on the performance of low-degree nodes. We theoretically and empirically investigate how these practices impact node-level performance acr… ▽ More

    Submitted 17 December, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: Extended Version of our WSDM'24 paper. 8 pages, 2 page appendix

  27. arXiv:2305.15622  [pdf, other

    cs.LG cs.CY cs.SI

    GFairHint: Improving Individual Fairness for Graph Neural Networks via Fairness Hint

    Authors: Paiheng Xu, Yuhang Zhou, Bang An, Wei Ai, Furong Huang

    Abstract: Given the growing concerns about fairness in machine learning and the impressive performance of Graph Neural Networks (GNNs) on graph data learning, algorithmic fairness in GNNs has attracted significant attention. While many existing studies improve fairness at the group level, only a few works promote individual fairness, which renders similar outcomes for similar individuals. A desirable framew… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  28. arXiv:2304.04321  [pdf, other

    cs.AI cs.CL cs.CV cs.RO

    ARNOLD: A Benchmark for Language-Grounded Task Learning With Continuous States in Realistic 3D Scenes

    Authors: Ran Gong, Jiangyong Huang, Yizhou Zhao, Haoran Geng, Xiaofeng Gao, Qingyang Wu, Wensi Ai, Ziheng Zhou, Demetri Terzopoulos, Song-Chun Zhu, Baoxiong Jia, Siyuan Huang

    Abstract: Understanding the continuous states of objects is essential for task learning and planning in the real world. However, most existing task learning benchmarks assume discrete (e.g., binary) object goal states, which poses challenges for the learning of complex tasks and transferring learned policy from simulated environments to the real world. Furthermore, state discretization limits a robot's abil… ▽ More

    Submitted 11 September, 2023; v1 submitted 9 April, 2023; originally announced April 2023.

    Comments: The first two authors contributed equally; 20 pages; 17 figures; project availalbe: https://arnold-benchmark.github.io/ ICCV 2023

  29. arXiv:2301.12326  [pdf, other

    cs.LG cs.CY

    Team Resilience under Shock: An Empirical Analysis of GitHub Repositories during Early COVID-19 Pandemic

    Authors: Xuan Lu, Wei Ai, Yixin Wang, Qiaozhu Mei

    Abstract: While many organizations have shifted to working remotely during the COVID-19 pandemic, how the remote workforce and the remote teams are influenced by and would respond to this and future shocks remain largely unknown. Software developers have relied on remote collaborations long before the pandemic, working in virtual teams (GitHub repositories). The dynamics of these repositories through the pa… ▽ More

    Submitted 28 January, 2023; originally announced January 2023.

    Comments: 12 pages, 4 figures. To be published in the 17th International AAAI Conference on Web and Social Media (ICWSM)

  30. arXiv:2206.12727  [pdf, other

    physics.chem-ph cond-mat.mtrl-sci cs.CE physics.app-ph

    A coupled phase field formulation for modelling fatigue cracking in lithium-ion battery electrode particles

    Authors: W. Ai, B. Wu, E. Martínez-Pañeda

    Abstract: Electrode particle cracking is one of the main phenomena driving battery capacity degradation. Recent phase field fracture studies have investigated particle cracking behaviour. However, only the beginning of life has been considered and effects such as damage accumulation have been neglected. Here, a multi-physics phase field fatigue model has been developed to study crack propagation in battery… ▽ More

    Submitted 25 June, 2022; originally announced June 2022.

  31. arXiv:2206.11887  [pdf, other

    cs.CG cs.AI

    VRKitchen2.0-IndoorKit: A Tutorial for Augmented Indoor Scene Building in Omniverse

    Authors: Yizhou Zhao, Steven Gong, Xiaofeng Gao, Wensi Ai, Song-Chun Zhu

    Abstract: With the recent progress of simulations by 3D modeling software and game engines, many researchers have focused on Embodied AI tasks in the virtual environment. However, the research community lacks a platform that can easily serve both indoor scene synthesis and model benchmarking with various algorithms. Meanwhile, computer graphics-related tasks need a toolkit for implementing advanced synthesi… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

  32. arXiv:2203.04930  [pdf, other

    cs.GR cs.CV

    Triangular Character Animation Sampling with Motion, Emotion, and Relation

    Authors: Yizhou Zhao, Liang Qiu, Wensi Ai, Pan Lu, Song-Chun Zhu

    Abstract: Dramatic progress has been made in animating individual characters. However, we still lack automatic control over activities between characters, especially those involving interactions. In this paper, we present a novel energy-based framework to sample and synthesize animations by associating the characters' body motions, facial expressions, and social relations. We propose a Spatial-Temporal And-… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

  33. arXiv:2112.06060  [pdf, other

    cs.GR

    GenMotion: Data-driven Motion Generators for Real-time Animation Synthesis

    Authors: Yizhou Zhao, Wensi Ai, Liang Qiu, Pan Lu, Feng Shi, Tian Han, Song-Chun Zhu

    Abstract: With the recent success of deep learning algorithms, many researchers have focused on generative models for human motion animation. However, the research community lacks a platform for training and benchmarking various algorithms, and the animation industry needs a toolkit for implementing advanced motion synthesizing techniques. To facilitate the study of deep motion synthesis methods for skeleto… ▽ More

    Submitted 11 December, 2021; originally announced December 2021.

  34. arXiv:2105.14678  [pdf, other

    cs.CV eess.IV

    Image-to-Video Generation via 3D Facial Dynamics

    Authors: Xiaoguang Tu, Yingtian Zou, Jian Zhao, Wenjie Ai, Jian Dong, Yuan Yao, Zhikang Wang, Guodong Guo, Zhifeng Li, Wei Liu, Jiashi Feng

    Abstract: We present a versatile model, FaceAnime, for various video generation tasks from still images. Video generation from a single face image is an interesting problem and usually tackled by utilizing Generative Adversarial Networks (GANs) to integrate information from the input face image and a sequence of sparse facial landmarks. However, the generated face images usually suffer from quality loss, im… ▽ More

    Submitted 30 May, 2021; originally announced May 2021.

  35. Joint Face Image Restoration and Frontalization for Recognition

    Authors: Xiaoguang Tu, Jian Zhao, Qiankun Liu, Wenjie Ai, Guodong Guo, Zhifeng Li, Wei Liu, Jiashi Feng

    Abstract: In real-world scenarios, many factors may harm face recognition performance, e.g., large pose, bad illumination,low resolution, blur and noise. To address these challenges, previous efforts usually first restore the low-quality faces to high-quality ones and then perform face recognition. However, most of these methods are stage-wise, which is sub-optimal and deviates from the reality. In this pap… ▽ More

    Submitted 11 May, 2021; originally announced May 2021.

    Comments: 14 pages, 9 figures

  36. Emojis predict dropouts of remote workers: An empirical study of emoji usage on GitHub

    Authors: Xuan Lu, Wei Ai, Zhenpeng Chen, Yanbin Cao, Qiaozhu Mei

    Abstract: Emotions at work have long been identified as critical signals of work motivations, status, and attitudes, and as predictors of various work-related outcomes. When more and more employees work remotely, these emotional signals of workers become harder to observe through daily, face-to-face communications. The use of online platforms to communicate and collaborate at work provides an alternative… ▽ More

    Submitted 27 January, 2022; v1 submitted 10 February, 2021; originally announced February 2021.

    Journal ref: PLOS ONE 17(2022):1-21

  37. arXiv:2011.09078  [pdf, other

    cs.SD cs.MM eess.AS

    Vertical-Horizontal Structured Attention for Generating Music with Chords

    Authors: Yizhou Zhao, Liang Qiu, Wensi Ai, Feng Shi, Song-Chun Zhu

    Abstract: In this paper, we propose a lightweight music-generating model based on variational autoencoder (VAE) with structured attention. Generating music is different from generating text because the melodies with chords give listeners distinguished polyphonic feelings. In a piece of music, a chord consisting of multiple notes comes from either the mixture of multiple instruments or the combination of mul… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

  38. arXiv:2008.07364  [pdf, other

    cs.CY cs.LG cs.SI stat.ML

    Predicting Individual Treatment Effects of Large-scale Team Competitions in a Ride-sharing Economy

    Authors: Teng Ye, Wei Ai, Lingyu Zhang, Ning Luo, Lulu Zhang, Jie** Ye, Qiaozhu Mei

    Abstract: Millions of drivers worldwide have enjoyed financial benefits and work schedule flexibility through a ride-sharing economy, but meanwhile they have suffered from the lack of a sense of identity and career achievement. Equipped with social identity and contest theories, financially incentivized team competitions have been an effective instrument to increase drivers' productivity, job satisfaction,… ▽ More

    Submitted 7 August, 2020; originally announced August 2020.

    Comments: Accepted to KDD 2020

  39. arXiv:2005.10455  [pdf, other

    eess.IV cs.CV

    Single Image Super-Resolution via Residual Neuron Attention Networks

    Authors: Wenjie Ai, Xiaoguang Tu, Shilei Cheng, Mei Xie

    Abstract: Deep Convolutional Neural Networks (DCNNs) have achieved impressive performance in Single Image Super-Resolution (SISR). To further improve the performance, existing CNN-based methods generally focus on designing deeper architecture of the network. However, we argue blindly increasing network's depth is not the most sensible way. In this paper, we propose a novel end-to-end Residual Neuron Attenti… ▽ More

    Submitted 21 May, 2020; originally announced May 2020.

    Comments: 6 pages, 4 figures, Accepted by IEEE ICIP 2020

  40. arXiv:2001.04665  [pdf

    cs.CV

    Face Attribute Invertion

    Authors: X G Tu, Y Luo, H S Zhang, W J Ai, Z Ma, M Xie

    Abstract: Manipulating human facial images between two domains is an important and interesting problem. Most of the existing methods address this issue by applying two generators or one generator with extra conditional inputs. In this paper, we proposed a novel self-perception method based on GANs for automatical face attribute inverse. The proposed method takes face images as inputs and employs only one si… ▽ More

    Submitted 14 January, 2020; originally announced January 2020.

    Comments: 8 pages, 3 figures

  41. Through a Gender Lens: Learning Usage Patterns of Emojis from Large-Scale Android Users

    Authors: Zhenpeng Chen, Xuan Lu, Wei Ai, Huoran Li, Qiaozhu Mei, Xuanzhe Liu

    Abstract: Based on a large data set of emoji using behavior collected from smartphone users over the world, this paper investigates gender-specific usage of emojis. We present various interesting findings that evidence a considerable difference in emoji usage by female and male users. Such a difference is significant not just in a statistical sense; it is sufficient for a machine learning algorithm to accur… ▽ More

    Submitted 25 April, 2018; v1 submitted 16 May, 2017; originally announced May 2017.

    Comments: The Web Conference 2018 (WWW 2018)

  42. arXiv:1504.00981   

    cs.LG math.OC

    ELM-Based Distributed Cooperative Learning Over Networks

    Authors: Wu Ai, Weisheng Chen

    Abstract: This paper investigates distributed cooperative learning algorithms for data processing in a network setting. Specifically, the extreme learning machine (ELM) is introduced to train a set of data distributed across several components, and each component runs a program on a subset of the entire data. In this scheme, there is no requirement for a fusion center in the network due to e.g., practical l… ▽ More

    Submitted 30 November, 2015; v1 submitted 4 April, 2015; originally announced April 2015.

    Comments: This paper has been withdrawn by the authors due to the incorrect proof of Theorem 2