Skip to main content

Showing 1–29 of 29 results for author: Qian, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01511  [pdf, other

    cs.AI

    CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents

    Authors: Tianqi Xu, Linyao Chen, Dai-Jie Wu, Yanjun Chen, Zecheng Zhang, Xiang Yao, Zhiqiang Xie, Yongchao Chen, Shilong Liu, Bochen Qian, Philip Torr, Bernard Ghanem, Guohao Li

    Abstract: The development of autonomous agents increasingly relies on Multimodal Language Models (MLMs) to perform tasks described in natural language with GUI environments, such as websites, desktop computers, or mobile phones. Existing benchmarks for MLM agents in interactive environments are limited by their focus on a single environment, lack of detailed and generalized evaluation methods, and the compl… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2406.16786  [pdf, other

    cs.CE

    Generalized and high-efficiency arbitrary-positioned buffer for smoothed particle hydrodynamics

    Authors: Shuoguo Zhang, Yu Fan, Yaru Ren, Bin Qian, Xiangyu Hu

    Abstract: This paper develops an arbitrary-positioned buffer for the smoothed particle hydrodynamics (SPH) method, whose generality and high efficiency are achieved through two techniques. First, with the local coordinate system established at each arbitrary-positioned in-/outlet, particle positions in the global coordinate system are transformed into those in it via coordinate transformation. Since one loc… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 34 pages and 17 figures

  3. arXiv:2406.09794  [pdf, other

    cs.CV

    SuperSVG: Superpixel-based Scalable Vector Graphics Synthesis

    Authors: Teng Hu, Ran Yi, Baihong Qian, Jiangning Zhang, Paul L. Rosin, Yu-Kun Lai

    Abstract: SVG (Scalable Vector Graphics) is a widely used graphics format that possesses excellent scalability and editability. Image vectorization, which aims to convert raster images to SVGs, is an important yet challenging problem in computer vision and graphics. Existing image vectorization methods either suffer from low reconstruction accuracy for complex images or require long computation time. To add… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: CVPR 2024

  4. arXiv:2403.19898  [pdf, other

    cs.CV

    Structure Matters: Tackling the Semantic Discrepancy in Diffusion Models for Image Inpainting

    Authors: Haipeng Liu, Yang Wang, Biao Qian, Meng Wang, Yong Rui

    Abstract: Denoising diffusion probabilistic models for image inpainting aim to add the noise to the texture of image during the forward process and recover masked regions with unmasked ones of the texture via the reverse denoising process. Despite the meaningful semantics generation, the existing arts suffer from the semantic discrepancy between masked and unmasked regions, since the semantically dense unma… ▽ More

    Submitted 31 March, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: 15 pages, 10 figures, to appear CVPR 2024

  5. arXiv:2402.14034  [pdf, other

    cs.MA cs.AI

    AgentScope: A Flexible yet Robust Multi-Agent Platform

    Authors: Dawei Gao, Zitao Li, Xuchen Pan, Weirui Kuang, Zhijian Ma, Bingchen Qian, Fei Wei, Wenhao Zhang, Yuexiang Xie, Daoyuan Chen, Liuyi Yao, Hongyi Peng, Zeyu Zhang, Lin Zhu, Chen Cheng, Hongzhu Shi, Yaliang Li, Bolin Ding, **gren Zhou

    Abstract: With the rapid advancement of Large Language Models (LLMs), significant progress has been made in multi-agent applications. However, the complexities in coordinating agents' cooperation and LLMs' erratic performance pose notable challenges in develo** robust and efficient multi-agent applications. To tackle these challenges, we propose AgentScope, a developer-centric multi-agent platform with me… ▽ More

    Submitted 20 May, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: We have released code on https://github.com/modelscope/agentscope

  6. arXiv:2402.11505  [pdf, other

    cs.CL cs.AI

    Federated Fine-tuning of Large Language Models under Heterogeneous Tasks and Client Resources

    Authors: Jiamu Bai, Daoyuan Chen, Bingchen Qian, Liuyi Yao, Yaliang Li

    Abstract: Federated Learning (FL) has recently been applied to the parameter-efficient fine-tuning of Large Language Models (LLMs). While promising, it raises significant challenges due to the heterogeneous resources and data distributions of clients. This study introduces FlexLoRA, a simple yet effective aggregation scheme for LLM fine-tuning, which mitigates the ``bucket effect'' in traditional FL that re… ▽ More

    Submitted 30 May, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: 19 pages, 13 tables, 9 figures

  7. arXiv:2401.04143  [pdf, other

    cs.CV

    RHOBIN Challenge: Reconstruction of Human Object Interaction

    Authors: Xianghui Xie, Xi Wang, Nikos Athanasiou, Bharat Lal Bhatnagar, Chun-Hao P. Huang, Kaichun Mo, Hao Chen, Xia Jia, Zerui Zhang, Liangxian Cui, Xiao Lin, Bingqiao Qian, Jie Xiao, Wenfei Yang, Hyeong** Nam, Daniel Sungho Jung, Kihoon Kim, Kyoung Mu Lee, Otmar Hilliges, Gerard Pons-Moll

    Abstract: Modeling the interaction between humans and objects has been an emerging research direction in recent years. Capturing human-object interaction is however a very challenging task due to heavy occlusion and complex dynamics, which requires understanding not only 3D human pose, and object pose but also the interaction between them. Reconstruction of 3D humans and objects has been two separate resear… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: 14 pages, 5 tables, 7 figure. Technical report of the CVPR'23 workshop: RHOBIN challenge (https://rhobin-challenge.github.io/)

  8. arXiv:2312.06353  [pdf, other

    cs.LG cs.DC

    Federated Full-Parameter Tuning of Billion-Sized Language Models with Communication Cost under 18 Kilobytes

    Authors: Zhen Qin, Daoyuan Chen, Bingchen Qian, Bolin Ding, Yaliang Li, Shuiguang Deng

    Abstract: Pre-trained large language models (LLMs) need fine-tuning to improve their responsiveness to natural language instructions. Federated learning offers a way to fine-tune LLMs using the abundant data on end devices without compromising data privacy. Most existing federated fine-tuning methods for LLMs rely on parameter-efficient fine-tuning techniques, which may not reach the performance height poss… ▽ More

    Submitted 27 May, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: Accepted to ICML 2024. 25 pages, 14 figures, 7 tables. Codes are available at https://github.com/alibaba/FederatedScope/tree/FedKSeed

  9. arXiv:2309.00363  [pdf, other

    cs.LG

    FederatedScope-LLM: A Comprehensive Package for Fine-tuning Large Language Models in Federated Learning

    Authors: Weirui Kuang, Bingchen Qian, Zitao Li, Daoyuan Chen, Dawei Gao, Xuchen Pan, Yuexiang Xie, Yaliang Li, Bolin Ding, **gren Zhou

    Abstract: LLMs have demonstrated great capabilities in various NLP tasks. Different entities can further improve the performance of those LLMs on their specific downstream tasks by fine-tuning LLMs. When several entities have similar interested tasks, but their data cannot be shared because of privacy concerns regulations, federated learning (FL) is a mainstream solution to leverage the data of different en… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

    Comments: Source code: https://github.com/alibaba/FederatedScope/tree/llm

  10. arXiv:2308.16458  [pdf, other

    cs.LG cs.AI cs.CL

    BioCoder: A Benchmark for Bioinformatics Code Generation with Large Language Models

    Authors: Xiangru Tang, Bill Qian, Rick Gao, Jiakang Chen, Xinyun Chen, Mark Gerstein

    Abstract: Pre-trained large language models (LLMs) have significantly improved code generation. As these models scale up, there is an increasing need for the output to handle more intricate tasks and to be appropriately specialized to particular domains. Here, we target bioinformatics due to the amount of domain knowledge, algorithms, and data operations this discipline requires. We present BioCoder, a benc… ▽ More

    Submitted 20 May, 2024; v1 submitted 31 August, 2023; originally announced August 2023.

  11. arXiv:2307.16789  [pdf, other

    cs.AI cs.CL cs.LG

    ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs

    Authors: Yujia Qin, Shihao Liang, Yining Ye, Kunlun Zhu, Lan Yan, Yaxi Lu, Yankai Lin, Xin Cong, Xiangru Tang, Bill Qian, Sihan Zhao, Lauren Hong, Runchu Tian, Ruobing Xie, Jie Zhou, Mark Gerstein, Dahai Li, Zhiyuan Liu, Maosong Sun

    Abstract: Despite the advancements of open-source large language models (LLMs), e.g., LLaMA, they remain significantly limited in tool-use capabilities, i.e., using external tools (APIs) to fulfill human instructions. The reason is that current instruction tuning largely focuses on basic language tasks but ignores the tool-use domain. This is in contrast to the excellent tool-use capabilities of state-of-th… ▽ More

    Submitted 3 October, 2023; v1 submitted 31 July, 2023; originally announced July 2023.

  12. arXiv:2304.02389  [pdf, other

    eess.IV cs.CV cs.LG

    DRAC: Diabetic Retinopathy Analysis Challenge with Ultra-Wide Optical Coherence Tomography Angiography Images

    Authors: Bo Qian, Hao Chen, Xiangning Wang, Haoxuan Che, Gitaek Kwon, Jaeyoung Kim, Sung** Choi, Seoyoung Shin, Felix Krause, Markus Unterdechler, Junlin Hou, Rui Feng, Yihao Li, Mostafa El Habib Daho, Qiang Wu, ** Zhang, Xiaokang Yang, Yiyu Cai, Wei** Jia, Huating Li, Bin Sheng

    Abstract: Computer-assisted automatic analysis of diabetic retinopathy (DR) is of great importance in reducing the risks of vision loss and even blindness. Ultra-wide optical coherence tomography angiography (UW-OCTA) is a non-invasive and safe imaging modality in DR diagnosis system, but there is a lack of publicly available benchmarks for model development and evaluation. To promote further research and s… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

  13. arXiv:2303.06869  [pdf, other

    cs.CV

    Adaptive Data-Free Quantization

    Authors: Biao Qian, Yang Wang, Richang Hong, Meng Wang

    Abstract: Data-free quantization (DFQ) recovers the performance of quantized network (Q) without the original data, but generates the fake sample via a generator (G) by learning from full-precision network (P), which, however, is totally independent of Q, overlooking the adaptability of the knowledge from generated samples, i.e., informative or not to the learning process of Q, resulting into the overflow o… ▽ More

    Submitted 20 March, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

    Comments: 9 pages, 6 figures, Refined camera ready version for CVPR 2023

  14. arXiv:2302.09572  [pdf, other

    cs.CV

    Rethinking Data-Free Quantization as a Zero-Sum Game

    Authors: Biao Qian, Yang Wang, Richang Hong, Meng Wang

    Abstract: Data-free quantization (DFQ) recovers the performance of quantized network (Q) without accessing the real data, but generates the fake sample via a generator (G) by learning from full-precision network (P) instead. However, such sample generation process is totally independent of Q, specialized as failing to consider the adaptability of the generated samples, i.e., beneficial or adversarial, over… ▽ More

    Submitted 19 February, 2023; originally announced February 2023.

    Comments: 9 pages, 7 figures, accepted by AAAI 2023

  15. arXiv:2302.08706  [pdf, ps, other

    cs.CV

    Fine-grained Cross-modal Fusion based Refinement for Text-to-Image Synthesis

    Authors: Haoran Sun, Yang Wang, Haipeng Liu, Biao Qian

    Abstract: Text-to-image synthesis refers to generating visual-realistic and semantically consistent images from given textual descriptions. Previous approaches generate an initial low-resolution image and then refine it to be high-resolution. Despite the remarkable progress, these methods are limited in fully utilizing the given texts and could generate text-mismatched images, especially when the text descr… ▽ More

    Submitted 20 February, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

    Comments: 13 pages, 8 figures, accepted by Chinese Journal of Electronics

  16. arXiv:2209.04996  [pdf, other

    cs.CV

    Switchable Online Knowledge Distillation

    Authors: Biao Qian, Yang Wang, Hongzhi Yin, Richang Hong, Meng Wang

    Abstract: Online Knowledge Distillation (OKD) improves the involved models by reciprocally exploiting the difference between teacher and student. Several crucial bottlenecks over the gap between them -- e.g., Why and when does a large gap harm the performance, especially for student? How to quantify the gap between teacher and student? -- have received limited formal study. In this paper, we propose Switcha… ▽ More

    Submitted 11 September, 2022; originally announced September 2022.

    Comments: 16 pages, 7 figures, accepted by ECCV 2022

  17. arXiv:2207.09277  [pdf, ps, other

    cs.IT

    Covering Grassmannian Codes: Bounds and Constructions

    Authors: Bingchen Qian, Xin Wang, Chengfei Xie, Gennian Ge

    Abstract: Grassmannian $\mathcal{G}_q(n,k)$ is the set of all $k$-dimensional subspaces of the vector space $\mathbb{F}_q^n.$ Recently, Etzion and Zhang introduced a new notion called covering Grassmannian code which can be used in network coding solutions for generalized combination networks. An $α$-$(n,k,δ)_q^c$ covering Grassmannian code $\mathcal{C}$ is a subset of $\mathcal{G}_q(n,k)$ such that every s… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: 17 pages

  18. arXiv:2108.07987   

    cs.IT math.CO

    Improved Lower Bounds for Secure Codes and Related Structures

    Authors: Bingchen Qian, Xin Wang, Gennian Ge

    Abstract: Secure codes are widely-studied combinatorial structures which were introduced for traitor tracing in broadcast encryption. To determine the maximum size of such structures is the main research objective. In this paper, we investigate the lower bounds for secure codes and their related structures. First, we give some improved lower bounds for the rates of $2$-frameproof codes and $\overline{2}$-se… ▽ More

    Submitted 21 August, 2021; v1 submitted 18 August, 2021; originally announced August 2021.

    Comments: There are errors in Sections II and III. The proofs of Theorems II.6 and III.3 are wrong

  19. arXiv:2107.08715  [pdf, other

    cs.CV

    RECIST-Net: Lesion detection via grou** keypoints on RECIST-based annotation

    Authors: Cong Xie, Shilei Cao, Dong Wei, Hongyu Zhou, Kai Ma, Xianli Zhang, Buyue Qian, Liansheng Wang, Yefeng Zheng

    Abstract: Universal lesion detection in computed tomography (CT) images is an important yet challenging task due to the large variations in lesion type, size, shape, and appearance. Considering that data in clinical routine (such as the DeepLesion dataset) are usually annotated with a long and a short diameter according to the standard of Response Evaluation Criteria in Solid Tumors (RECIST) diameters, we p… ▽ More

    Submitted 19 July, 2021; originally announced July 2021.

    Comments: 5 pages, 3 figures, IEEE ISBI 2021

  20. arXiv:2009.02625  [pdf, other

    cs.LG cs.AI cs.IR

    Online Disease Self-diagnosis with Inductive Heterogeneous Graph Convolutional Networks

    Authors: Zifeng Wang, Rui Wen, Xi Chen, Shilei Cao, Shao-Lun Huang, Buyue Qian, Yefeng Zheng

    Abstract: We propose a Healthcare Graph Convolutional Network (HealGCN) to offer disease self-diagnosis service for online users based on Electronic Healthcare Records (EHRs). Two main challenges are focused in this paper for online disease diagnosis: (1) serving cold-start users via graph convolutional networks and (2) handling scarce clinical description via a symptom retrieval system. To this end, we fir… ▽ More

    Submitted 12 February, 2021; v1 submitted 5 September, 2020; originally announced September 2020.

  21. arXiv:2007.05887  [pdf

    cs.CV

    Train Your Data Processor: Distribution-Aware and Error-Compensation Coordinate Decoding for Human Pose Estimation

    Authors: Feiyu Yang, Zhan Song, Zhenzhong Xiao, Yu Chen, Zhe Pan, Min Zhang, Min Xue, Yaoyang Mo, Yao Zhang, Guoxiong Guan, Beibei Qian

    Abstract: Recently, the leading performance of human pose estimation is dominated by heatmap based methods. While being a fundamental component of heatmap processing, heatmap decoding (i.e. transforming heatmaps to coordinates) receives only limited investigations, to our best knowledge. This work fills the gap by studying the heatmap decoding processing with a particular focus on the errors introduced thro… ▽ More

    Submitted 17 July, 2020; v1 submitted 11 July, 2020; originally announced July 2020.

    Comments: Improve the state-of-the-art of COCO keypoint detection challenge by 1-2 AP. Project page: https://github.com/fyang235/DAEC

  22. arXiv:1912.00418  [pdf, other

    cs.CV

    Diversifying Inference Path Selection: Moving-Mobile-Network for Landmark Recognition

    Authors: Biao Qian, Yang Wang, Zhao Zhang, Richang Hong, Meng Wang, Ling Shao

    Abstract: Deep convolutional neural networks have largely benefited computer vision tasks. However, the high computational complexity limits their real-world applications. To this end, many methods have been proposed for efficient network learning, and applications in portable mobile devices. In this paper, we propose a novel \underline{M}oving-\underline{M}obile-\underline{Net}work, named M$^2$Net, for lan… ▽ More

    Submitted 1 December, 2019; originally announced December 2019.

    Comments: 9 pages, 11 figures

  23. arXiv:1910.05433  [pdf, other

    cs.DC cs.LG cs.NI

    Orchestrating the Development Lifecycle of Machine Learning-Based IoT Applications: A Taxonomy and Survey

    Authors: Bin Qian, Jie Su, Zhenyu Wen, Devki Nandan Jha, Yinhao Li, Yu Guan, Deepak Puthal, Philip James, Renyu Yang, Albert Y. Zomaya, Omer Rana, Lizhe Wang, Maciej Koutny, Rajiv Ranjan

    Abstract: Machine Learning (ML) and Internet of Things (IoT) are complementary advances: ML techniques unlock complete potentials of IoT with intelligence, and IoT applications increasingly feed data collected by sensors into ML models, thereby employing results to improve their business processes and services. Hence, orchestrating ML pipelines that encompasses model training and implication involved in hol… ▽ More

    Submitted 29 May, 2020; v1 submitted 11 October, 2019; originally announced October 2019.

    Comments: 50 pages, Accepted by ACM Computing Surveys (CSUR)

  24. arXiv:1907.05271  [pdf

    cs.CV

    A Targeted Acceleration and Compression Framework for Low bit Neural Networks

    Authors: Biao Qian, Yang Wang

    Abstract: 1 bit deep neural networks (DNNs), of which both the activations and weights are binarized , are attracting more and more attention due to their high computational efficiency and low memory requirement . However, the drawback of large accuracy drop** also restrict s its application. In this paper, we propose a novel Targeted Acceleration and Compression (TAC) framework to improve the performance… ▽ More

    Submitted 9 July, 2019; originally announced July 2019.

    Comments: 10 pages

  25. arXiv:1906.03074  [pdf, other

    cs.CY cs.AI cs.LG

    Modeling e-Learners' Cognitive and Metacognitive Strategy in Comparative Question Solving

    Authors: Feng Tian, Jia Yue, Kuo-ming Chao, Buyue Qian, Nazaraf Shah, Longzhuang Li, Hai** Zhu, Yan Chen, Bin Zeng, Qinghua Zheng

    Abstract: Cognitive and metacognitive strategy had demonstrated a significant role in self-regulated learning (SRL), and an appropriate use of strategies is beneficial to effective learning or question-solving tasks during a human-computer interaction process. This paper proposes a novel method combining Knowledge Map (KM) based data mining technique with Thinking Map (TM) to detect learner's cognitive and… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Comments: 12 pages, 15 figures

  26. arXiv:1902.03376  [pdf, other

    stat.ML cs.AI cs.LG

    Measuring Patient Similarities via a Deep Architecture with Medical Concept Embedding

    Authors: Zihao Zhu, Changchang Yin, Buyue Qian, Yu Cheng, Jishang Wei, Fei Wang

    Abstract: Evaluating the clinical similarities between pairwise patients is a fundamental problem in healthcare informatics. A proper patient similarity measure enables various downstream applications, such as cohort study and treatment comparative effectiveness research. One major carrier for conducting patient similarity research is Electronic Health Records(EHRs), which are usually heterogeneous, longitu… ▽ More

    Submitted 9 February, 2019; originally announced February 2019.

    Comments: Published in ICDM 2016, arXiv version. Code link is added

  27. arXiv:1811.02629  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Identifying the Best Machine Learning Algorithms for Brain Tumor Segmentation, Progression Assessment, and Overall Survival Prediction in the BRATS Challenge

    Authors: Spyridon Bakas, Mauricio Reyes, Andras Jakab, Stefan Bauer, Markus Rempfler, Alessandro Crimi, Russell Takeshi Shinohara, Christoph Berger, Sung Min Ha, Martin Rozycki, Marcel Prastawa, Esther Alberts, Jana Lipkova, John Freymann, Justin Kirby, Michel Bilello, Hassan Fathallah-Shaykh, Roland Wiest, Jan Kirschke, Benedikt Wiestler, Rivka Colen, Aikaterini Kotrotsou, Pamela Lamontagne, Daniel Marcus, Mikhail Milchenko , et al. (402 additional authors not shown)

    Abstract: Gliomas are the most common primary brain malignancies, with different degrees of aggressiveness, variable prognosis and various heterogeneous histologic sub-regions, i.e., peritumoral edematous/invaded tissue, necrotic core, active and non-enhancing core. This intrinsic heterogeneity is also portrayed in their radio-phenotype, as their sub-regions are depicted by varying intensity profiles dissem… ▽ More

    Submitted 23 April, 2019; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: The International Multimodal Brain Tumor Segmentation (BraTS) Challenge

  28. arXiv:1202.0855  [pdf, ps, other

    cs.LG stat.ML

    A Reconstruction Error Formulation for Semi-Supervised Multi-task and Multi-view Learning

    Authors: Buyue Qian, Xiang Wang, Ian Davidson

    Abstract: A significant challenge to make learning techniques more suitable for general purpose use is to move beyond i) complete supervision, ii) low dimensional data, iii) a single task and single view per instance. Solving these challenges allows working with "Big Data" problems that are typically high dimensional with multiple (but possibly incomplete) labelings and views. While other work has addressed… ▽ More

    Submitted 3 February, 2012; originally announced February 2012.

  29. On Constrained Spectral Clustering and Its Applications

    Authors: Xiang Wang, Buyue Qian, Ian Davidson

    Abstract: Constrained clustering has been well-studied for algorithms such as $K$-means and hierarchical clustering. However, how to satisfy many constraints in these algorithmic settings has been shown to be intractable. One alternative to encode many constraints is to use spectral clustering, which remains a develo** area. In this paper, we propose a flexible framework for constrained spectral clusterin… ▽ More

    Submitted 21 September, 2012; v1 submitted 25 January, 2012; originally announced January 2012.

    Comments: Data Mining and Knowledge Discovery, 2012

    ACM Class: H.2.8