Skip to main content

Showing 1–50 of 51 results for author: Liao, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19922  [pdf, other

    cs.CV

    Parallax-tolerant Image Stitching via Segmentation-guided Multi-homography War**

    Authors: Tianli Liao, Ce Wang, Lei Li, Guangen Liu, Nan Li

    Abstract: Large parallax between images is an intractable issue in image stitching. Various war**-based methods are proposed to address it, yet the results are unsatisfactory. In this paper, we propose a novel image stitching method using multi-homography war** guided by image segmentation. Specifically, we leverage the Segment Anything Model to segment the target image into numerous contents and partit… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: 11 pages, 9 figures

  2. arXiv:2406.07814  [pdf, other

    cs.AI cs.CL cs.HC

    Collective Constitutional AI: Aligning a Language Model with Public Input

    Authors: Saffron Huang, Divya Siddarth, Liane Lovitt, Thomas I. Liao, Esin Durmus, Alex Tamkin, Deep Ganguli

    Abstract: There is growing consensus that language model (LM) developers should not be the sole deciders of LM behavior, creating a need for methods that enable the broader public to collectively shape the behavior of LM systems that affect them. To address this need, we present Collective Constitutional AI (CCAI): a multi-stage process for sourcing and integrating public input into LMs-from identifying a t… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    ACM Class: I.2.7; K.4.2

    Journal ref: Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency. 1395-1417

  3. arXiv:2406.05773  [pdf, other

    cs.CV

    CorrMAE: Pre-training Correspondence Transformers with Masked Autoencoder

    Authors: Tangfei Liao, Xiaoqin Zhang, Guobao Xiao, Min Li, Tao Wang, Mang Ye

    Abstract: Pre-training has emerged as a simple yet powerful methodology for representation learning across various domains. However, due to the expensive training cost and limited data, pre-training has not yet been extensively studied in correspondence pruning. To tackle these challenges, we propose a pre-training method to acquire a generic inliers-consistent representation by reconstructing masked corres… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  4. arXiv:2405.20334  [pdf, other

    cs.CV cs.GR

    VividDream: Generating 3D Scene with Ambient Dynamics

    Authors: Yao-Chih Lee, Yi-Ting Chen, Andrew Wang, Ting-Hsuan Liao, Brandon Y. Feng, Jia-Bin Huang

    Abstract: We introduce VividDream, a method for generating explorable 4D scenes with ambient dynamics from a single input image or text prompt. VividDream first expands an input image into a static 3D point cloud through iterative inpainting and geometry merging. An ensemble of animated videos is then generated using video diffusion models with quality refinement techniques and conditioned on renderings of… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Project page: https://vivid-dream-4d.github.io

  5. arXiv:2405.09839  [pdf, other

    cs.LG

    Advances in Robust Federated Learning: Heterogeneity Considerations

    Authors: Chuan Chen, Tianchi Liao, Xiaojun Deng, Zihou Wu, Sheng Huang, Zibin Zheng

    Abstract: In the field of heterogeneous federated learning (FL), the key challenge is to efficiently and collaboratively train models across multiple clients with different data distributions, model structures, task objectives, computational capabilities, and communication resources. This diversity leads to significant heterogeneity, which increases the complexity of model training. In this paper, we first… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  6. arXiv:2405.09024  [pdf, other

    cs.CV

    Dynamic Loss Decay based Robust Oriented Object Detection on Remote Sensing Images with Noisy Labels

    Authors: Guozhang Liu, Ting Liu, Mengke Yuan, Tao Pang, Guangxing Yang, Hao Fu, Tao Wang, Tongkui Liao

    Abstract: The ambiguous appearance, tiny scale, and fine-grained classes of objects in remote sensing imagery inevitably lead to the noisy annotations in category labels of detection dataset. However, the effects and treatments of the label noises are underexplored in modern oriented remote sensing object detectors. To address this issue, we propose a robust oriented remote sensing object detection method t… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  7. arXiv:2403.12502  [pdf, other

    cs.RO

    Under-actuated Robotic Gripper with Multiple Gras** Modes Inspired by Human Finger

    Authors: Jihao Li, Tingbo Liao, Hassen Nigatu, Haotian Guo, Guodong Lu, Huixu Dong

    Abstract: Under-actuated robot grippers as a pervasive tool of robots have become a considerable research focus. Despite their simplicity of mechanical design and control strategy, they suffer from poor versatility and weak adaptability, making widespread applications limited. To better relieve relevant research gaps, we present a novel 3-finger linkage-based gripper that realizes retractable and reconfigur… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: 8 pages

  8. arXiv:2402.18013  [pdf, other

    cs.CL cs.AI

    A Survey on Recent Advances in LLM-Based Multi-turn Dialogue Systems

    Authors: Zihao Yi, Jiarui Ouyang, Yuwen Liu, Tianhao Liao, Zhe Xu, Ying Shen

    Abstract: This survey provides a comprehensive review of research on multi-turn dialogue systems, with a particular focus on multi-turn dialogue systems based on large language models (LLMs). This paper aims to (a) give a summary of existing LLMs and approaches for adapting LLMs to downstream tasks; (b) elaborate recent advances in multi-turn dialogue systems, covering both LLM-based open-domain dialogue (O… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 35 pages, 10 figures, ACM Computing Surveys

  9. arXiv:2402.17202  [pdf, other

    cs.LG

    FedBRB: An Effective Solution to the Small-to-Large Scenario in Device-Heterogeneity Federated Learning

    Authors: Ziyue Xu, Mingfeng Xu, Tianchi Liao, Zibin Zheng, Chuan Chen

    Abstract: Recently, the success of large models has demonstrated the importance of scaling up model size. This has spurred interest in exploring collaborative training of large-scale models from federated learning perspective. Due to computational constraints, many institutions struggle to train a large-scale model locally. Thus, training a larger global model using only smaller local models has become an i… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  10. arXiv:2312.08774  [pdf, other

    cs.CV

    VSFormer: Visual-Spatial Fusion Transformer for Correspondence Pruning

    Authors: Tangfei Liao, Xiaoqin Zhang, Li Zhao, Tao Wang, Guobao Xiao

    Abstract: Correspondence pruning aims to find correct matches (inliers) from an initial set of putative correspondences, which is a fundamental task for many applications. The process of finding is challenging, given the varying inlier ratios between scenes/image pairs due to significant visual differences. However, the performance of the existing methods is usually limited by the problem of lacking visual… ▽ More

    Submitted 4 January, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI2024

  11. arXiv:2312.00048  [pdf, other

    cs.CR cs.LG

    Tokenized Model: A Blockchain-Empowered Decentralized Model Ownership Verification Platform

    Authors: Yihao Li, Yanyi Lai, Tianchi Liao, Chuan Chen, Zibin Zheng

    Abstract: With the development of practical deep learning models like generative AI, their excellent performance has brought huge economic value. For instance, ChatGPT has attracted more than 100 million users in three months. Since the model training requires a lot of data and computing power, a well-performing deep learning model is behind a huge effort and cost. Facing various model attacks, unauthorized… ▽ More

    Submitted 27 November, 2023; originally announced December 2023.

  12. arXiv:2311.18564  [pdf, other

    cs.CV

    Seam-guided local alignment and stitching for large parallax images

    Authors: Tianli Liao, Chenyang Zhao, Lei Li, Heling Cao

    Abstract: Seam-cutting methods have been proven effective in the composition step of image stitching, especially for images with parallax. However, the effectiveness of seam-cutting usually depends on that images can be roughly aligned such that there exists a local region where a plausible seam can be found. For images with large parallax, current alignment methods often fall short of expectations. In this… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: 13 pages, 12 figures, in peer review

  13. arXiv:2310.13798  [pdf, other

    cs.CL cs.AI

    Specific versus General Principles for Constitutional AI

    Authors: Sandipan Kundu, Yuntao Bai, Saurav Kadavath, Amanda Askell, Andrew Callahan, Anna Chen, Anna Goldie, Avital Balwit, Azalia Mirhoseini, Brayden McLean, Catherine Olsson, Cassie Evraets, Eli Tran-Johnson, Esin Durmus, Ethan Perez, Jackson Kernion, Jamie Kerr, Kamal Ndousse, Karina Nguyen, Nelson Elhage, Newton Cheng, Nicholas Schiefer, Nova DasSarma, Oliver Rausch, Robin Larson , et al. (11 additional authors not shown)

    Abstract: Human feedback can prevent overtly harmful utterances in conversational models, but may not automatically mitigate subtle problematic behaviors such as a stated desire for self-preservation or power. Constitutional AI offers an alternative, replacing human feedback with feedback from AI models conditioned only on a list of written principles. We find this approach effectively prevents the expressi… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  14. arXiv:2308.10899  [pdf, other

    cs.AI

    TADA! Text to Animatable Digital Avatars

    Authors: Tingting Liao, Hongwei Yi, Yuliang Xiu, Jiaxaing Tang, Yangyi Huang, Justus Thies, Michael J. Black

    Abstract: We introduce TADA, a simple-yet-effective approach that takes textual descriptions and produces expressive 3D avatars with high-quality geometry and lifelike textures, that can be animated and rendered with traditional graphics pipelines. Existing text-based character generation methods are limited in terms of geometry and texture quality, and cannot be realistically animated due to inconsistent a… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

  15. arXiv:2308.08545  [pdf, other

    cs.CV cs.AI cs.GR

    TeCH: Text-guided Reconstruction of Lifelike Clothed Humans

    Authors: Yangyi Huang, Hongwei Yi, Yuliang Xiu, Tingting Liao, Jiaxiang Tang, Deng Cai, Justus Thies

    Abstract: Despite recent research advancements in reconstructing clothed humans from a single image, accurately restoring the "unseen regions" with high-level details remains an unsolved challenge that lacks attention. Existing methods often generate overly smooth back-side surfaces with a blurry texture. But how to effectively capture all visual attributes of an individual from a single image, which are su… ▽ More

    Submitted 19 August, 2023; v1 submitted 16 August, 2023; originally announced August 2023.

    Comments: Project: https://huangyangyi.github.io/TeCH, Code: https://github.com/huangyangyi/TeCH

  16. arXiv:2307.03823  [pdf, other

    cs.CL

    Linguistic representations for fewer-shot relation extraction across domains

    Authors: Sireesh Gururaja, Ritam Dutt, Tinglong Liao, Carolyn Rose

    Abstract: Recent work has demonstrated the positive impact of incorporating linguistic representations as additional context and scaffolding on the in-domain performance of several NLP tasks. We extend this work by exploring the impact of linguistic representations on cross-domain performance in a few-shot transfer setting. An important question is whether linguistic representations enhance generalizability… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: ACL 2023

  17. arXiv:2306.16388  [pdf, other

    cs.CL cs.AI

    Towards Measuring the Representation of Subjective Global Opinions in Language Models

    Authors: Esin Durmus, Karina Nguyen, Thomas I. Liao, Nicholas Schiefer, Amanda Askell, Anton Bakhtin, Carol Chen, Zac Hatfield-Dodds, Danny Hernandez, Nicholas Joseph, Liane Lovitt, Sam McCandlish, Orowa Sikder, Alex Tamkin, Janel Thamkul, Jared Kaplan, Jack Clark, Deep Ganguli

    Abstract: Large language models (LLMs) may not equitably represent diverse global perspectives on societal issues. In this paper, we develop a quantitative framework to evaluate whose opinions model-generated responses are more similar to. We first build a dataset, GlobalOpinionQA, comprised of questions and answers from cross-national surveys designed to capture diverse opinions on global issues across dif… ▽ More

    Submitted 11 April, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

  18. arXiv:2306.04212  [pdf, other

    cs.LG cs.CY

    Migrate Demographic Group For Fair GNNs

    Authors: YanMing Hu, TianChi Liao, JiaLong Chen, **g Bian, ZiBin Zheng, Chuan Chen

    Abstract: Graph Neural networks (GNNs) have been applied in many scenarios due to the superior performance of graph learning. However, fairness is always ignored when designing GNNs. As a consequence, biased information in training data can easily affect vanilla GNNs, causing biased results toward particular demographic groups (divided by sensitive attributes, such as race and age). There have been efforts… ▽ More

    Submitted 23 March, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

  19. arXiv:2306.00419  [pdf, other

    cs.CR cs.AI

    Challenges and Remedies to Privacy and Security in AIGC: Exploring the Potential of Privacy Computing, Blockchain, and Beyond

    Authors: Chuan Chen, Zhenpeng Wu, Yanyi Lai, Wenlin Ou, Tianchi Liao, Zibin Zheng

    Abstract: Artificial Intelligence Generated Content (AIGC) is one of the latest achievements in AI development. The content generated by related applications, such as text, images and audio, has sparked a heated discussion. Various derived AIGC applications are also gradually entering all walks of life, bringing unimaginable impact to people's daily lives. However, the rapid development of such generative t… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: 43 pages, 10 figures

  20. Anomaly Detection Using One-Class SVM for Logs of Juniper Router Devices

    Authors: Tat-Bao-Thien Nguyen, Teh-Lu Liao, Tuan-Anh Vu

    Abstract: The article deals with anomaly detection of Juniper router logs. Abnormal Juniper router logs include logs that are usually different from the normal operation, and they often reflect the abnormal operation of router devices. To prevent router devices from being damaged and help administrator to grasp the situation of error quickly, detecting abnormal operation soon is very important. In this work… ▽ More

    Submitted 20 May, 2023; originally announced May 2023.

    Journal ref: In: Duong, T., Vo, NS., Nguyen, L., Vien, QT., Nguyen, VD. (eds) Industrial Networks and Intelligent Systems. INISCOM 2019

  21. arXiv:2304.03903  [pdf, other

    cs.CV cs.AI

    High-Fidelity Clothed Avatar Reconstruction from a Single Image

    Authors: Tingting Liao, Xiaomei Zhang, Yuliang Xiu, Hongwei Yi, Xudong Liu, Guo-Jun Qi, Yong Zhang, Xuan Wang, Xiangyu Zhu, Zhen Lei

    Abstract: This paper presents a framework for efficient 3D clothed avatar reconstruction. By combining the advantages of the high accuracy of optimization-based methods and the efficiency of learning-based methods, we propose a coarse-to-fine way to realize a high-fidelity clothed avatar reconstruction (CAR) from a single image. At the first stage, we use an implicit model to learn the general shape in the… ▽ More

    Submitted 8 April, 2023; originally announced April 2023.

  22. arXiv:2303.15772  [pdf, other

    cs.LG cs.AI cs.CY

    Ecosystem Graphs: The Social Footprint of Foundation Models

    Authors: Rishi Bommasani, Dilara Soylu, Thomas I. Liao, Kathleen A. Creel, Percy Liang

    Abstract: Foundation models (e.g. ChatGPT, StableDiffusion) pervasively influence society, warranting immediate social attention. While the models themselves garner much attention, to accurately characterize their impact, we must consider the broader sociotechnical ecosystem. We propose Ecosystem Graphs as a documentation framework to transparently centralize knowledge of this ecosystem. Ecosystem Graphs is… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: Authored by the Center for Research on Foundation Models (CRFM) at the Stanford Institute for Human-Centered Artificial Intelligence (HAI). Ecosystem Graphs available at https://crfm.stanford.edu/ecosystem-graphs/

  23. arXiv:2302.08510  [pdf, other

    cs.CV

    Text-driven Visual Synthesis with Latent Diffusion Prior

    Authors: Ting-Hsuan Liao, Songwei Ge, Yiran Xu, Yao-Chih Lee, Badour AlBahar, Jia-Bin Huang

    Abstract: There has been tremendous progress in large-scale text-to-image synthesis driven by diffusion models enabling versatile downstream applications such as 3D object synthesis from texts, image editing, and customized generation. We present a generic approach using latent diffusion models as powerful image priors for various visual synthesis tasks. Existing methods that utilize such priors fail to use… ▽ More

    Submitted 3 April, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

    Comments: Project website: https://latent-diffusion-prior.github.io/

  24. arXiv:2302.07459  [pdf, other

    cs.CL

    The Capacity for Moral Self-Correction in Large Language Models

    Authors: Deep Ganguli, Amanda Askell, Nicholas Schiefer, Thomas I. Liao, Kamilė Lukošiūtė, Anna Chen, Anna Goldie, Azalia Mirhoseini, Catherine Olsson, Danny Hernandez, Dawn Drain, Dustin Li, Eli Tran-Johnson, Ethan Perez, Jackson Kernion, Jamie Kerr, Jared Mueller, Joshua Landau, Kamal Ndousse, Karina Nguyen, Liane Lovitt, Michael Sellitto, Nelson Elhage, Noemi Mercado, Nova DasSarma , et al. (24 additional authors not shown)

    Abstract: We test the hypothesis that language models trained with reinforcement learning from human feedback (RLHF) have the capability to "morally self-correct" -- to avoid producing harmful outputs -- if instructed to do so. We find strong evidence in support of this hypothesis across three different experiments, each of which reveal different facets of moral self-correction. We find that the capability… ▽ More

    Submitted 18 February, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

  25. arXiv:2302.05242  [pdf, other

    cs.RO eess.SY

    Hierarchical Motion Planning under Probabilistic Temporal Tasks and Safe-Return Constraints

    Authors: Meng Guo, Tianjun Liao, Junjie Wang, Zhongkui Li

    Abstract: Safety is crucial for robotic missions within an uncertain environment. Common safety requirements such as collision avoidance are only state-dependent, which can be restrictive for complex missions. In this work, we address a more general formulation as safe-return constraints, which require the existence of a return-policy to drive the system back to a set of safe states with high probability. T… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

    Comments: 16 pages, 11 figures

  26. arXiv:2211.08888  [pdf, other

    cs.CV

    ELDA: Using Edges to Have an Edge on Semantic Segmentation Based UDA

    Authors: Ting-Hsuan Liao, Huang-Ru Liao, Shan-Ya Yang, Jie-En Yao, Li-Yuan Tsao, Hsu-Shen Liu, Bo-Wun Cheng, Chen-Hao Chao, Chia-Che Chang, Yi-Chen Lo, Chun-Yi Lee

    Abstract: Many unsupervised domain adaptation (UDA) methods have been proposed to bridge the domain gap by utilizing domain invariant information. Most approaches have chosen depth as such information and achieved remarkable success. Despite their effectiveness, using depth as domain invariant information in UDA tasks may lead to multiple issues, such as excessively high extraction costs and difficulties in… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: Accepted by BMVC2022. Ting-Hsuan Liao and Huang-Ru Liao contributed equally to this work

  27. Robust Unstructured Knowledge Access in Conversational Dialogue with ASR Errors

    Authors: Yik-Cheung Tam, Jiacheng Xu, Jiakai Zou, Zecheng Wang, Tinglong Liao, Shuhan Yuan

    Abstract: Performance of spoken language understanding (SLU) can be degraded with automatic speech recognition (ASR) errors. We propose a novel approach to improve SLU robustness by randomly corrupting clean training text with an ASR error simulator, followed by self-correcting the errors and minimizing the target classification loss in a joint manner. In the proposed error simulator, we leverage confusion… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: 7 pages, 2 figures. Accepted at ICASSP 2022

    Journal ref: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022, pp. 6702-6706

  28. arXiv:2210.08993  [pdf, other

    cs.CY

    When Digital Economy Meets Web3.0: Applications and Challenges

    Authors: Chuan Chen, Lei Zhang, Yihao Li, Tianchi Liao, Siran Zhao, Zibin Zheng, Huawei Huang, Jia**g Wu

    Abstract: With the continuous development of web technology, Web3.0 has attracted a considerable amount of attention due to its unique decentralized characteristics. The digital economy is an important driver of high-quality economic development and is currently in a rapid development stage. In the digital economy scenario, the centralized nature of the Internet and other characteristics usually bring about… ▽ More

    Submitted 29 October, 2022; v1 submitted 26 September, 2022; originally announced October 2022.

    Comments: 14 pages, 5 figures

  29. arXiv:2209.07313  [pdf, other

    eess.IV cs.CV

    HarDNet-DFUS: An Enhanced Harmonically-Connected Network for Diabetic Foot Ulcer Image Segmentation and Colonoscopy Polyp Segmentation

    Authors: Ting-Yu Liao, Ching-Hui Yang, Yu-Wen Lo, Kuan-Ying Lai, Po-Huai Shen, Youn-Long Lin

    Abstract: We present a neural network architecture for medical image segmentation of diabetic foot ulcers and colonoscopy polyps. Diabetic foot ulcers are caused by neuropathic and vascular complications of diabetes mellitus. In order to provide a proper diagnosis and treatment, wound care professionals need to extract accurate morphological features from the foot wounds. Using computer-aided systems is a p… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

  30. arXiv:2208.08892  [pdf, other

    cs.CV cs.AI

    Pixel-Wise Prediction based Visual Odometry via Uncertainty Estimation

    Authors: Hao-Wei Chen, Ting-Hsuan Liao, Hsuan-Kung Yang, Chun-Yi Lee

    Abstract: This paper introduces pixel-wise prediction based visual odometry (PWVO), which is a dense prediction task that evaluates the values of translation and rotation for every pixel in its input observations. PWVO employs uncertainty estimation to identify the noisy regions in the input observations, and adopts a selection mechanism to integrate pixel-wise predictions based on the estimated uncertainty… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

  31. arXiv:2207.05621  [pdf, other

    cs.CV

    MSP-Former: Multi-Scale Projection Transformer for Single Image Desnowing

    Authors: Sixiang Chen, Tian Ye, Yun Liu, Taodong Liao, **gxia Jiang, Erkang Chen, Peng Chen

    Abstract: Snow removal causes challenges due to its characteristic of complex degradations. To this end, targeted treatment of multi-scale snow degradations is critical for the network to learn effective snow removal. In order to handle the diverse scenes, we propose a multi-scale projection transformer (MSP-Former), which understands and covers a variety of snow degradation features in a multi-path manner,… ▽ More

    Submitted 11 March, 2023; v1 submitted 12 July, 2022; originally announced July 2022.

    Comments: Accepted to ICASSP'2023

  32. arXiv:2204.11184  [pdf, other

    cs.CV

    MVP-Human Dataset for 3D Human Avatar Reconstruction from Unconstrained Frames

    Authors: Xiangyu Zhu, Tingting Liao, Jiang**g Lyu, Xiang Yan, Yunfeng Wang, Kan Guo, Qiong Cao, Stan Z. Li, Zhen Lei

    Abstract: In this paper, we consider a novel problem of reconstructing a 3D human avatar from multiple unconstrained frames, independent of assumptions on camera calibration, capture space, and constrained actions. The problem should be addressed by a framework that takes multiple unconstrained images as inputs, and generates a shape-with-skinning avatar in the canonical space, finished in one feed-forward… ▽ More

    Submitted 17 May, 2023; v1 submitted 23 April, 2022; originally announced April 2022.

    Comments: Accepted by IEEE Transactions on Biometrics, Behavior, and Identity Science (TBIOM)

  33. Deep Transfer Learning with Graph Neural Network for Sensor-Based Human Activity Recognition

    Authors: Yan Yan, Tianzheng Liao, **** Zhao, Jiahong Wang, Liang Ma, Wei Lv, **g Xiong, Lei Wang

    Abstract: The sensor-based human activity recognition (HAR) in mobile application scenarios is often confronted with sensor modalities variation and annotated data deficiency. Given this observation, we devised a graph-inspired deep learning approach toward the sensor-based HAR tasks, which was further used to build a deep transfer learning model toward giving a tentative solution for these two challenging… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

  34. Investigation of Factorized Optical Flows as Mid-Level Representations

    Authors: Hsuan-Kung Yang, Tsu-Ching Hsiao, Ting-Hsuan Liao, Hsu-Shen Liu, Li-Yuan Tsao, Tzu-Wen Wang, Shan-Ya Yang, Yu-Wen Chen, Huang-Ru Liao, Chun-Yi Lee

    Abstract: In this paper, we introduce a new concept of incorporating factorized flow maps as mid-level representations, for bridging the perception and the control modules in modular learning based robotic frameworks. To investigate the advantages of factorized flow maps and examine their interplay with the other types of mid-level representations, we further develop a configurable framework, along with fou… ▽ More

    Submitted 10 March, 2022; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: Ting-Hsuan Liao, Hsu-Shen Liu, Li-Yuan Tsao, Tzu-Wen Wang, and Shan-Ya Yang contributed equally to this work, names listed in alphabetical order; This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

    Journal ref: 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  35. arXiv:2202.06276  [pdf, other

    cs.CV

    Natural Image Stitching Using Depth Maps

    Authors: Tianli Liao, Nan Li

    Abstract: Natural image stitching (NIS) aims to create one natural-looking mosaic from two overlap** images that capture the same 3D scene from different viewing positions. Challenges inevitably arise when the scene is non-planar and the camera baseline is wide, since parallax becomes not negligible in such cases. In this paper, we propose a novel NIS method using depth maps, which generates natural-looki… ▽ More

    Submitted 22 February, 2023; v1 submitted 13 February, 2022; originally announced February 2022.

    Comments: 10 pages, 8 figures, under review

  36. arXiv:2201.02312  [pdf, other

    cs.CL cs.AI

    A Transfer Learning Pipeline for Educational Resource Discovery with Application in Leading Paragraph Generation

    Authors: Irene Li, Thomas George, Alexander Fabbri, Tammy Liao, Benjamin Chen, Rina Kawamura, Richard Zhou, Vanessa Yan, Swapnil Hingmire, Dragomir Radev

    Abstract: Effective human learning depends on a wide selection of educational materials that align with the learner's current understanding of the topic. While the Internet has revolutionized human learning or education, a substantial resource accessibility barrier still exists. Namely, the excess of online information can make it challenging to navigate and discover high-quality learning materials. In this… ▽ More

    Submitted 6 January, 2022; originally announced January 2022.

  37. arXiv:2112.08578  [pdf, other

    cs.CL

    CLICKER: A Computational LInguistics Classification Scheme for Educational Resources

    Authors: Swapnil Hingmire, Irene Li, Rena Kawamura, Benjamin Chen, Alexander Fabbri, Xiangru Tang, Yixin Liu, Thomas George, Tammy Liao, Wai Pan Wong, Vanessa Yan, Richard Zhou, Girish K. Palshikar, Dragomir Radev

    Abstract: A classification scheme of a scientific subject gives an overview of its body of knowledge. It can also be used to facilitate access to research articles and other materials related to the subject. For example, the ACM Computing Classification System (CCS) is used in the ACM Digital Library search interface and also for indexing computer science papers. We observed that a comprehensive classificat… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

    Comments: 7 pages, 5 figures, 4 tables

  38. arXiv:2109.08628  [pdf, other

    cs.LG cs.CV cs.RO

    Autonomous Vision-based UAV Landing with Collision Avoidance using Deep Learning

    Authors: Tianpei Liao, Amal Haridevan, Yibo Liu, **jun Shan

    Abstract: There is a risk of collision when multiple UAVs land simultaneously without communication on the same platform. This work accomplishes vision-based autonomous landing and uses a deep-learning-based method to realize collision avoidance during the landing process.

    Submitted 17 September, 2021; originally announced September 2021.

  39. arXiv:2012.00944  [pdf, other

    cs.CV cs.LG eess.IV

    Tensor Completion via Convolutional Sparse Coding Regularization

    Authors: Zhebin Wu, Tianchi Liao, Chuan Chen, Cong Liu, Zibin Zheng, Xiongjun Zhang

    Abstract: Tensor data often suffer from missing value problem due to the complex high-dimensional structure while acquiring them. To complete the missing information, lots of Low-Rank Tensor Completion (LRTC) methods have been proposed, most of which depend on the low-rank property of tensor data. In this way, the low-rank component of the original data could be recovered roughly. However, the shortcoming i… ▽ More

    Submitted 6 May, 2021; v1 submitted 1 December, 2020; originally announced December 2020.

  40. arXiv:2010.11658  [pdf, other

    quant-ph cs.CC cs.CR

    On the Compressed-Oracle Technique, and Post-Quantum Security of Proofs of Sequential Work

    Authors: Kai-Min Chung, Serge Fehr, Yu-Hsuan Huang, Tai-Ning Liao

    Abstract: We revisit the so-called compressed oracle technique, introduced by Zhandry for analyzing quantum algorithms in the quantum random oracle model (QROM). To start off with, we offer a concise exposition of the technique, which easily extends to the parallel-query QROM, where in each query-round the considered algorithm may make several queries to the QROM in parallel. This variant of the QROM allows… ▽ More

    Submitted 9 July, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

  41. arXiv:2005.03457  [pdf, other

    cs.CV

    NTIRE 2020 Challenge on NonHomogeneous Dehazing

    Authors: Codruta O. Ancuti, Cosmin Ancuti, Florin-Alexandru Vasluianu, Radu Timofte, **g Liu, Haiyan Wu, Yuan Xie, Yanyun Qu, Lizhuang Ma, Ziling Huang, Qili Deng, Ju-Chin Chao, Tsung-Shan Yang, Peng-Wen Chen, Po-Min Hsu, Tzu-Yi Liao, Chung-En Sun, Pei-Yuan Wu, Jeonghyeok Do, Jongmin Park, Munchurl Kim, Kareem Metwaly, Xuelu Li, Tiantong Guo, Vishal Monga , et al. (27 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2020 Challenge on NonHomogeneous Dehazing of images (restoration of rich details in hazy image). We focus on the proposed solutions and their results evaluated on NH-Haze, a novel dataset consisting of 55 pairs of real haze free and nonhomogeneous hazy images recorded outdoor. NH-Haze is the first realistic nonhomogeneous haze dataset that provides ground truth images.… ▽ More

    Submitted 7 May, 2020; originally announced May 2020.

    Comments: CVPR Workshops Proceedings 2020

  42. arXiv:1911.09176  [pdf, other

    quant-ph cs.CC cs.CR cs.DS

    Lower Bounds for Function Inversion with Quantum Advice

    Authors: Kai-Min Chung, Tai-Ning Liao, Luowen Qian

    Abstract: Function inversion is the problem that given a random function $f: [M] \to [N]$, we want to find pre-image of any image $f^{-1}(y)$ in time $T$. In this work, we revisit this problem under the preprocessing model where we can compute some auxiliary information or advice of size $S$ that only depends on $f$ but not on $y$. It is a well-studied problem in the classical settings, however, it is not c… ▽ More

    Submitted 8 April, 2020; v1 submitted 20 November, 2019; originally announced November 2019.

    Comments: ITC full version

  43. arXiv:1905.01334  [pdf, other

    cs.RO cs.AI cs.LG

    Data-efficient Learning of Morphology and Controller for a Microrobot

    Authors: Thomas Liao, Grant Wang, Brian Yang, Rene Lee, Kristofer Pister, Sergey Levine, Roberto Calandra

    Abstract: Robot design is often a slow and difficult process requiring the iterative construction and testing of prototypes, with the goal of sequentially optimizing the design. For most robots, this process is further complicated by the need, when validating the capabilities of the hardware to solve the desired task, to already have an appropriate controller, which is in turn designed and tuned for the spe… ▽ More

    Submitted 3 May, 2019; originally announced May 2019.

    Comments: Accepted at ICRA-2019. 6 pages

  44. arXiv:1809.08753  [pdf, other

    cs.MM

    An Iterative Refinement Approach for Social Media Headline Prediction

    Authors: Chih-Chung Hsu, Chia-Yen Lee, Ting-Xuan Liao, Jun-Yi Lee, Tsai-Yne Hou, Ying-Chu Kuo, **g-Wen Lin, Ching-Yi Hsueh, Zhong-Xuan Zhan, Hsiang-Chin Chien

    Abstract: In this study, we propose a novel iterative refinement approach to predict the popularity score of the social media meta-data effectively. With the rapid growth of the social media on the Internet, how to adequately forecast the view count or popularity becomes more important. Conventionally, the ensemble approach such as random forest regression achieves high and stable performance on various pre… ▽ More

    Submitted 24 September, 2018; originally announced September 2018.

    Comments: 5 pages, ACM Multimedia Conference 2018

  45. arXiv:1808.01620  [pdf, other

    cs.DB

    Schema Integration on Massive Data Sources

    Authors: Tianbao Lia, Hongzhi Wang, Jianzhong Li, Hong Gao

    Abstract: As the fundamental phrase of collecting and analyzing data, data integration is used in many applications, such as data cleaning, bioinformatics and pattern recognition. In big data era, one of the major problems of data integration is to obtain the global schema of data sources since the global schema could be hardly derived from massive data sources directly. In this paper, we attempt to solve s… ▽ More

    Submitted 5 August, 2018; originally announced August 2018.

  46. arXiv:1807.05119  [pdf, other

    cs.CV

    Learning-based Natural Geometric Matching with Homography Prior

    Authors: Yifang Xu, Tianli Liao, **g Chen

    Abstract: Geometric matching is a key step in computer vision tasks. Previous learning-based methods for geometric matching concentrate more on improving alignment quality, while we argue the importance of naturalness issue simultaneously. To deal with this, firstly, Pearson correlation is applied to handle large intra-class variations of features in feature matching stage. Then, we parametrize homography t… ▽ More

    Submitted 13 July, 2018; originally announced July 2018.

    Comments: 13 pages,4 figures

  47. arXiv:1805.09578  [pdf, other

    cs.CV

    Coarse-to-fine Seam Estimation for Image Stitching

    Authors: Tianli Liao, **g Chen, Yifang Xu

    Abstract: Seam-cutting and seam-driven techniques have been proven effective for handling imperfect image series in image stitching. Generally, seam-driven is to utilize seam-cutting to find a best seam from one or finite alignment hypotheses based on a predefined seam quality metric. However, the quality metrics in most methods are defined to measure the average performance of the pixels on the seam withou… ▽ More

    Submitted 24 May, 2018; originally announced May 2018.

    Comments: 5 pages, 4 figures

  48. arXiv:1804.07492  [pdf, other

    cs.CV

    Graph-based Hypothesis Generation for Parallax-tolerant Image Stitching

    Authors: **g Chen, Nan Li, Tianli Liao

    Abstract: The seam-driven approach has been proven fairly effective for parallax-tolerant image stitching, whose strategy is to search for an invisible seam from finite representative hypotheses of local alignment. In this paper, we propose a graph-based hypothesis generation and a seam-guided local alignment for improving the effectiveness and the efficiency of the seam-driven approach. The experiment demo… ▽ More

    Submitted 20 April, 2018; originally announced April 2018.

    Comments: 3 pages, 3 figures, 2 tables

  49. arXiv:1803.06655  [pdf, other

    cs.CV

    Ratio-Preserving Half-Cylindrical Warps for Natural Image Stitching

    Authors: Yifang Xu, **g Chen, Tianli Liao

    Abstract: A novel warp for natural image stitching is proposed that utilizes the property of cylindrical warp and a horizontal pixel selection strategy. The proposed ratio-preserving half-cylindrical warp is a combination of homography and cylindrical warps which guarantees alignment by homography and possesses less projective distortion by cylindrical warp. Unlike previous approaches applying cylindrical w… ▽ More

    Submitted 18 March, 2018; originally announced March 2018.

    Comments: 3 pages, 5 figures

  50. Single-Perspective Warps in Natural Image Stitching

    Authors: Tianli Liao, Nan Li

    Abstract: Results of image stitching can be perceptually divided into single-perspective and multiple-perspective. Compared to the multiple-perspective result, the single-perspective result excels in perspective consistency but suffers from projective distortion. In this paper, we propose two single-perspective warps for natural image stitching. The first one is a parametric warp, which is a combination of… ▽ More

    Submitted 7 March, 2018; v1 submitted 13 February, 2018; originally announced February 2018.

    Comments: 10 pages, 10 figures