Skip to main content

Showing 1–50 of 179 results for author: Lan, Z

.
  1. arXiv:2407.02894  [pdf, other

    cs.CL cs.AI

    Translatotron-V(ison): An End-to-End Model for In-Image Machine Translation

    Authors: Zhibin Lan, Liqiang Niu, Fandong Meng, Jie Zhou, Min Zhang, **song Su

    Abstract: In-image machine translation (IIMT) aims to translate an image containing texts in source language into an image containing translations in target language. In this regard, conventional cascaded methods suffer from issues such as error propagation, massive parameters, and difficulties in deployment and retaining visual characteristics of the input image. Thus, constructing end-to-end models has be… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Accepted to ACL 2024 Findings

  2. arXiv:2407.01894  [pdf, other

    cs.CV cs.HC

    Adaptive Modality Balanced Online Knowledge Distillation for Brain-Eye-Computer based Dim Object Detection

    Authors: Zixing Li, Chao Yan, Zhen Lan, Dengqing Tang, Xiaojia Xiang, Han Zhou, Jun Lai

    Abstract: Advanced cognition can be extracted from the human brain using brain-computer interfaces. Integrating these interfaces with computer vision techniques, which possess efficient feature extraction capabilities, can achieve more robust and accurate detection of dim targets in aerial images. However, existing target detection methods primarily concentrate on homogeneous data, lacking efficient and ver… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 18 pages,15 figures

  3. arXiv:2407.01638  [pdf, other

    cs.SE cs.AI cs.DC cs.PL

    LASSI: An LLM-based Automated Self-Correcting Pipeline for Translating Parallel Scientific Codes

    Authors: Matthew T. Dearing, Yiheng Tao, Xingfu Wu, Zhiling Lan, Valerie Taylor

    Abstract: This paper addresses the problem of providing a novel approach to sourcing significant training data for LLMs focused on science and engineering. In particular, a crucial challenge is sourcing parallel scientific codes in the ranges of millions to billions of codes. To tackle this problem, we propose an automated pipeline framework, called LASSI, designed to translate between parallel programming… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  4. arXiv:2406.17287  [pdf, other

    cs.CL cs.AI

    Predicting the Big Five Personality Traits in Chinese Counselling Dialogues Using Large Language Models

    Authors: Yang Yan, Lizhi Ma, Anqi Li, **gsong Ma, Zhenzhong Lan

    Abstract: Accurate assessment of personality traits is crucial for effective psycho-counseling, yet traditional methods like self-report questionnaires are time-consuming and biased. This study exams whether Large Language Models (LLMs) can predict the Big Five personality traits directly from counseling dialogues and introduces an innovative framework to perform the task. Our framework applies role-play an… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  5. arXiv:2406.15097  [pdf, other

    cs.NI

    Modeling and Analysis of Application Interference on Dragonfly+

    Authors: Yao Kang, Xin Wang, Neil McGlohon, Misbah Mubarak, Sudheer Chunduri, Zhiling Lan

    Abstract: Dragonfly class of networks are considered as promising interconnects for next-generation supercomputers. While Dragonfly+ networks offer more path diversity than the original Dragonfly design, they are still prone to performance variability due to their hierarchical architecture and resource sharing design. Event-driven network simulators are indispensable tools for navigating complex system desi… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: Accepted by SIGSIM PADS 2019

  6. arXiv:2406.15000  [pdf, other

    cs.CL cs.AI

    Unveiling the Impact of Multi-Modal Interactions on User Engagement: A Comprehensive Evaluation in AI-driven Conversations

    Authors: Lichao Zhang, Jia Yu, Shuai Zhang, Long Li, Yangyang Zhong, Guanbao Liang, Yuming Yan, Qing Ma, Fangsheng Weng, Fayu Pan, **g Li, Renjun Xu, Zhenzhong Lan

    Abstract: Large Language Models (LLMs) have significantly advanced user-bot interactions, enabling more complex and coherent dialogues. However, the prevalent text-only modality might not fully exploit the potential for effective user engagement. This paper explores the impact of multi-modal interactions, which incorporate images and audio alongside text, on user engagement in chatbot conversations. We cond… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  7. arXiv:2406.02341  [pdf, ps, other

    physics.chem-ph

    QCDGE database, Quantum Chemistry Database with Ground- and Excited-state Properties of 450 Kilo Molecules

    Authors: Yifei Zhu, Mengge Li, Chao Xu, Zhenggang Lan

    Abstract: Due to rapid advancements in deep learning techniques, the demand for large-volume high-quality databases grows significantly in chemical research. We developed a quantum-chemistry database that includes 443,106 small organic molecules with sizes up to 10 heavy atoms including carbon (C), nitrogen (N), oxygen (O), and fluorine (F). Ground-state geometry optimizations and frequency calculations of… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  8. arXiv:2405.19942  [pdf, ps, other

    quant-ph

    Coherent Control of Spontaneous Emission for a giant driven $Λ$-type three-level atom

    Authors: Yang ya, Sun ge, Li **g, Lu **g, Zhou lan

    Abstract: Quantum optics with giant atoms provides a new approach for implementing optical memory devices at the atomic scale. Here, we theoretically study the relaxation dynamics of a single driven three-level atom interacting with a one-dimensional waveguide, via two coupling points. Under certain conditions, after the long-time dynamics, we found that the population of giant atom can either maintain stab… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 12 pages, 5 figures

  9. arXiv:2405.18706  [pdf, other

    cs.CV

    FocSAM: Delving Deeply into Focused Objects in Segmenting Anything

    Authors: You Huang, Zongyu Lan, Liujuan Cao, Xianming Lin, Shengchuan Zhang, Guannan Jiang, Rongrong Ji

    Abstract: The Segment Anything Model (SAM) marks a notable milestone in segmentation models, highlighted by its robust zero-shot capabilities and ability to handle diverse prompts. SAM follows a pipeline that separates interactive segmentation into image preprocessing through a large encoder and interactive inference via a lightweight decoder, ensuring efficient real-time performance. However, SAM faces sta… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted to CVPR 2024

  10. arXiv:2405.12669  [pdf, other

    cs.CL

    A Survey on Multi-modal Machine Translation: Tasks, Methods and Challenges

    Authors: Huangjun Shen, Liangying Shao, Wenbo Li, Zhibin Lan, Zhanyu Liu, **song Su

    Abstract: In recent years, multi-modal machine translation has attracted significant interest in both academia and industry due to its superior performance. It takes both textual and visual modalities as inputs, leveraging visual context to tackle the ambiguities in source texts. In this paper, we begin by offering an exhaustive overview of 99 prior works, comprehensively summarizing representative studies… ▽ More

    Submitted 22 May, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

  11. arXiv:2405.04909  [pdf, other

    cs.CV cs.AI

    Traj-LLM: A New Exploration for Empowering Trajectory Prediction with Pre-trained Large Language Models

    Authors: Zhengxing Lan, Hongbo Li, Lingshan Liu, Bo Fan, Yisheng Lv, Yilong Ren, Zhiyong Cui

    Abstract: Predicting the future trajectories of dynamic traffic actors is a cornerstone task in autonomous driving. Though existing notable efforts have resulted in impressive performance improvements, a gap persists in scene cognitive and understanding of the complex traffic semantics. This paper proposes Traj-LLM, the first to investigate the potential of using Large Language Models (LLMs) without explici… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  12. arXiv:2404.14757  [pdf, other

    cs.LG cs.AI

    Integrating Mamba and Transformer for Long-Short Range Time Series Forecasting

    Authors: Xiongxiao Xu, Yueqing Liang, Baixiang Huang, Zhiling Lan, Kai Shu

    Abstract: Time series forecasting is an important problem and plays a key role in a variety of applications including weather forecasting, stock market, and scientific simulations. Although transformers have proven to be effective in capturing dependency, its quadratic complexity of attention mechanism prevents its further adoption in long-range time series forecasting, thus limiting them attend to short-ra… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  13. arXiv:2404.14070  [pdf

    cs.HC cs.CY

    No General Code of Ethics for All: Ethical Considerations in Human-bot Psycho-counseling

    Authors: Lizhi Ma, Tong Zhao, Huachuan Qiu, Zhenzhong Lan

    Abstract: The pervasive use of AI applications is increasingly influencing our everyday decisions. However, the ethical challenges associated with AI transcend conventional ethics and single-discipline approaches. In this paper, we propose aspirational ethical principles specifically tailored for human-bot psycho-counseling during an era when AI-powered mental health services are continually emerging. We ex… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 54 pages,11 tables, APA style, the tables are presented following Reference

  14. arXiv:2404.13584  [pdf, other

    cs.CV cs.LG

    Rethink Arbitrary Style Transfer with Transformer and Contrastive Learning

    Authors: Zhanjie Zhang, Jiakai Sun, Guangyuan Li, Lei Zhao, Quanwei Zhang, Zehua Lan, Haolin Yin, Wei Xing, Huaizhong Lin, Zhiwen Zuo

    Abstract: Arbitrary style transfer holds widespread attention in research and boasts numerous practical applications. The existing methods, which either employ cross-attention to incorporate deep style attributes into content attributes or use adaptive normalization to adjust content features, fail to generate high-quality stylized images. In this paper, we introduce an innovative technique to improve the q… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: Accepted by CVIU

  15. arXiv:2404.07534  [pdf, other

    physics.plasm-ph

    Realizing Laser-driven Deuteron Acceleration with Low Energy Spread via In-situ D$_2$O-deposited Target

    Authors: Tianyun Wei, Yasunobu Arikawa, Seyed Reza Mirfayzi, Yanjun Gu, Takehito Hayakawa, Alessio Morace, Kunioki Mima, Zechen Lan, Ryuya Yamada, Kohei Yamanoi, Koichi Honda, Sergei V. Bulanov, Akifumi Yogo

    Abstract: Generation of quasi-monoenergetic ion pulse by laser-driven acceleration is one of the hot topics in laser plasma physics. In this study, we present a new method for the \textit{In-situ} deposition of an ultra-thin D$_2$O layer on the surface of an aluminum foil target utilizing a spherical D$_2$O capsule. Employing a 10$^{19}$ W/cm$^2$ laser, we achieve the acceleration of 10.8 MeV deuterons with… ▽ More

    Submitted 1 June, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

  16. Union: An Automatic Workload Manager for Accelerating Network Simulation

    Authors: Xin Wang, Misbah Mubarak, Yao Kang, Robert B. Ross, Zhiling Lan

    Abstract: With the rapid growth of the machine learning applications, the workloads of future HPC systems are anticipated to be a mix of scientific simulation, big data analytics, and machine learning applications. Simulation is a great research vehicle to understand the performance implications of co-running scientific applications with big data and machine learning workloads on large-scale systems. In thi… ▽ More

    Submitted 3 April, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

  17. Q-adaptive: A Multi-Agent Reinforcement Learning Based Routing on Dragonfly Network

    Authors: Yao Kang, Xin Wang, Zhiling Lan

    Abstract: High-radix interconnects such as Dragonfly and its variants rely on adaptive routing to balance network traffic for optimum performance. Ideally, adaptive routing attempts to forward packets between minimal and non-minimal paths with the least congestion. In practice, current adaptive routing algorithms estimate routing path congestion based on local information such as output queue occupancy. Usi… ▽ More

    Submitted 3 April, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

  18. MRSch: Multi-Resource Scheduling for HPC

    Authors: Boyang Li, Yu** Fan, Matthew Dearing, Zhiling Lan, Paul Richy, William Allcocky, Michael Papka

    Abstract: Emerging workloads in high-performance computing (HPC) are embracing significant changes, such as having diverse resource requirements instead of being CPU-centric. This advancement forces cluster schedulers to consider multiple schedulable resources during decision-making. Existing scheduling studies rely on heuristic or optimization methods, which are limited by an inability to adapt to new scen… ▽ More

    Submitted 3 April, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

  19. Interpretable Modeling of Deep Reinforcement Learning Driven Scheduling

    Authors: Boyang Li, Zhiling Lan, Michael E. Papka

    Abstract: In the field of high-performance computing (HPC), there has been recent exploration into the use of deep reinforcement learning for cluster scheduling (DRL scheduling), which has demonstrated promising outcomes. However, a significant challenge arises from the lack of interpretability in deep neural networks (DNN), rendering them as black-box models to system managers. This lack of model interpret… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  20. Study of Workload Interference with Intelligent Routing on Dragonfly

    Authors: Yao Kang, Xin Wang, Zhiling Lan

    Abstract: Dragonfly interconnect is a crucial network technology for supercomputers. To support exascale systems, network resources are shared such that links and routers are not dedicated to any node pair. While link utilization is increased, workload performance is often offset by network contention. Recently, intelligent routing built on reinforcement learning demonstrates higher network throughput with… ▽ More

    Submitted 3 April, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

  21. arXiv:2403.13250  [pdf, other

    cs.CL

    Facilitating Pornographic Text Detection for Open-Domain Dialogue Systems via Knowledge Distillation of Large Language Models

    Authors: Huachuan Qiu, Shuai Zhang, Hongliang He, Anqi Li, Zhenzhong Lan

    Abstract: Pornographic content occurring in human-machine interaction dialogues can cause severe side effects for users in open-domain dialogue systems. However, research on detecting pornographic language within human-machine interaction dialogues is an important subject that is rarely studied. To advance in this direction, we introduce CensorChat, a dialogue monitoring dataset aimed at detecting whether t… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Accepted to CSCWD 2024 (27th International Conference on Computer Supported Cooperative Work in Design). arXiv admin note: text overlap with arXiv:2309.09749

  22. arXiv:2403.11528  [pdf, other

    physics.ins-det hep-ex nucl-ex physics.plasm-ph

    Development of neutron beamline for laser-driven neutron resonance spectroscopy

    Authors: Zechen Lan, Yasunobu Arikawa, Alessio Morace, Yuki Abe, S. Reza Mirfayzi, Tianyun Wei, Takehito Hayakawa, Akifumi Yogo

    Abstract: Recent progress of laser science provides laser-driven neutron source (LDNS), which has remarkable features such as the short pulse width. One of the key techniques to be developed for more efficient use of the LDNS is neutron collimation tubes to increase the number of neutrons arriving at a detector in the time-of-flight method. However, when a tube with a thick wall is used as a collimator the… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 10 pages, 6 figures, submitted to The European Physical Journal Plus

  23. arXiv:2403.01504  [pdf, ps, other

    physics.optics math-ph physics.app-ph

    Spin and Orbital Angular Momenta of Electromagnetic Waves: From Classical to Quantum Forms

    Authors: Wei E. I. Sha, Zhihao Lan, Menglin L. N. Chen, Yongpin P. Chen, Sheng Sun

    Abstract: Angular momenta of electromagnetic waves are important both in concepts and applications. In this work, we systematically discuss two types of angular momenta, i.e., spin angular momentum and orbital angular momentum in various cases, e.g., with source and without source, in classical and quantum forms. Numerical results demonstrating how to extract the topological charge of a classical vortex bea… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

    Comments: 5 pages, 3 figures

    Journal ref: IEEE Journal on Multiscale and Multiphysics Computational Techniques, 2024

  24. arXiv:2402.11958  [pdf, other

    cs.CL

    Automatic Evaluation for Mental Health Counseling using LLMs

    Authors: Anqi Li, Yu Lu, Nirui Song, Shuai Zhang, Lizhi Ma, Zhenzhong Lan

    Abstract: High-quality psychological counseling is crucial for mental health worldwide, and timely evaluation is vital for ensuring its effectiveness. However, obtaining professional evaluation for each counseling session is expensive and challenging. Existing methods that rely on self or third-party manual reports to assess the quality of counseling suffer from subjective biases and limitations of time-con… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 21 pages, 4 figures

  25. arXiv:2402.11522  [pdf, other

    cs.CL

    Unveiling the Secrets of Engaging Conversations: Factors that Keep Users Hooked on Role-Playing Dialog Agents

    Authors: Shuai Zhang, Yu Lu, Junwen Liu, Jia Yu, Huachuan Qiu, Yuming Yan, Zhenzhong Lan

    Abstract: With the growing humanlike nature of dialog agents, people are now engaging in extended conversations that can stretch from brief moments to substantial periods of time. Understanding the factors that contribute to sustaining these interactions is crucial, yet existing studies primarily focusing on short-term simulations that rarely explore such prolonged and real conversations. In this paper, w… ▽ More

    Submitted 12 March, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  26. arXiv:2402.08900  [pdf, other

    physics.chem-ph

    The photodissociation dynamics and ultrafast electron diffraction image of cyclobutanone from the surface hop** dynamics simulation

    Authors: Jiawei Peng, Hong Liu, Zhenggang Lan

    Abstract: The comprehension of nonadiabatic dynamics in polyatomic systems relies heavily on the simultaneous advancements in theoretical and experimental domains. The gas-phase electron diffraction (GUED) technique has attracted widespread attention as a promising tool for observing the photochemical and photophysical features at all-atomic level with high temporal and spatial resolutions. In this work, th… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  27. arXiv:2402.06772  [pdf, other

    q-bio.QM cs.AI cs.CE cs.LG

    Retrosynthesis Prediction via Search in (Hyper) Graph

    Authors: Zixun Lan, Binjie Hong, Jiajun Zhu, Zuo Zeng, Zhenfu Liu, Limin Yu, Fei Ma

    Abstract: Predicting reactants from a specified core product stands as a fundamental challenge within organic synthesis, termed retrosynthesis prediction. Recently, semi-template-based methods and graph-edits-based methods have achieved good performance in terms of both interpretability and accuracy. However, due to their mechanisms these methods cannot predict complex reactions, e.g., reactions with multip… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  28. arXiv:2401.13919  [pdf, other

    cs.CL cs.AI

    WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models

    Authors: Hongliang He, Wenlin Yao, Kaixin Ma, Wenhao Yu, Yong Dai, Hongming Zhang, Zhenzhong Lan, Dong Yu

    Abstract: The rapid advancement of large language models (LLMs) has led to a new era marked by the development of autonomous applications in real-world scenarios, which drives innovation in creating advanced web agents. Existing web agents typically only handle one input modality and are evaluated only in simplified web simulators or static web snapshots, greatly limiting their applicability in real-world s… ▽ More

    Submitted 6 June, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: Accepted to ACL 2024 (main). Code and data is released at https://github.com/MinorJerry/WebVoyager

  29. arXiv:2401.13178  [pdf, other

    cs.CL cs.AI cs.LG

    AgentBoard: An Analytical Evaluation Board of Multi-turn LLM Agents

    Authors: Chang Ma, Junlei Zhang, Zhihao Zhu, Cheng Yang, Yujiu Yang, Yaohui **, Zhenzhong Lan, Lingpeng Kong, Junxian He

    Abstract: Evaluating large language models (LLMs) as general-purpose agents is essential for understanding their capabilities and facilitating their integration into practical applications. However, the evaluation process presents substantial challenges. A primary obstacle is the benchmarking of agent performance across diverse scenarios within a unified framework, especially in maintaining partially-observ… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: Preprint

  30. arXiv:2401.08259  [pdf

    physics.chem-ph

    Ultrafast Excited-State Energy Transfer in Phenylene Ethynylene Dendrimer: Quantum Dynamics with Tensor Network Method

    Authors: Sisi Liu, Jiawei Peng, Peng Bao, Qiang Shi, Zhenggang Lan

    Abstract: Photo-induced excited-state energy transfer (EET) processes play an important role in the solar energy conversions. The phenylene ethynylene (PE) dendrimers display great potential in improving the efficiency of solar cells, because of their excellent photo-harvesting and exciton-transport properties. In this work, we investigated the intramolecular EET dynamics in a dendrimer composed of two line… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  31. arXiv:2312.15730  [pdf, other

    q-fin.TR

    Deep Reinforcement Learning for Quantitative Trading

    Authors: Maochun Xu, Zixun Lan, Zheng Tao, Jiawei Du, Zongao Ye

    Abstract: Artificial Intelligence (AI) and Machine Learning (ML) are transforming the domain of Quantitative Trading (QT) through the deployment of advanced algorithms capable of sifting through extensive financial datasets to pinpoint lucrative investment openings. AI-driven models, particularly those employing ML techniques such as deep learning and reinforcement learning, have shown great prowess in pred… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

  32. arXiv:2312.07626  [pdf, other

    physics.optics

    Isotropic gap formation, localization, and waveguiding in mesoscale Yukawa-potential amorphous structures

    Authors: Murat Can Sarihan, Alperen Govdeli, Zhihao Lan, Yildirim Batuhan Yilmaz, Mertcan Erdil, Yupei Wang, Mehmet Sirin Aras, Cenk Yanik, Nicolae Coriolan Panoiu, Chee Wei Wong, Serdar Kocaman

    Abstract: Amorphous photonic structures are mesoscopic optical structures described by electrical permittivity distributions with underlying spatial randomness. They offer a unique platform for studying a broad set of electromagnetic phenomena, including transverse Anderson localization, enhanced wave transport, and suppressed diffusion in random media. Despite this, at a more practical level, there is insu… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: 9 pages, 4 figures

  33. arXiv:2312.06135  [pdf, other

    cs.CV

    ArtBank: Artistic Style Transfer with Pre-trained Diffusion Model and Implicit Style Prompt Bank

    Authors: Zhanjie Zhang, Quanwei Zhang, Guangyuan Li, Wei Xing, Lei Zhao, Jiakai Sun, Zehua Lan, Junsheng Luan, Yiling Huang, Huaizhong Lin

    Abstract: Artistic style transfer aims to repaint the content image with the learned artistic style. Existing artistic style transfer methods can be divided into two categories: small model-based approaches and pre-trained large-scale model-based approaches. Small model-based approaches can preserve the content strucuture, but fail to produce highly realistic stylized images and introduce artifacts and dish… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI2024

  34. arXiv:2312.04262  [pdf, other

    cs.CL cs.HC

    PsyChat: A Client-Centric Dialogue System for Mental Health Support

    Authors: Huachuan Qiu, Anqi Li, Lizhi Ma, Zhenzhong Lan

    Abstract: Dialogue systems are increasingly integrated into mental health support to help clients facilitate exploration, gain insight, take action, and ultimately heal themselves. A practical and user-friendly dialogue system should be client-centric, focusing on the client's behaviors. However, existing dialogue systems publicly available for mental health support often concentrate solely on the counselor… ▽ More

    Submitted 19 March, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: Accepted to CSCWD 2024 (27th International Conference on Computer Supported Cooperative Work in Design)

  35. arXiv:2311.12067  [pdf, other

    cs.CV

    Quality and Quantity: Unveiling a Million High-Quality Images for Text-to-Image Synthesis in Fashion Design

    Authors: Jia Yu, Lichao Zhang, Zijie Chen, Fayu Pan, MiaoMiao Wen, Yuming Yan, Fangsheng Weng, Shuai Zhang, Lili Pan, Zhenzhong Lan

    Abstract: The fusion of AI and fashion design has emerged as a promising research area. However, the lack of extensive, interrelated data on clothing and try-on stages has hindered the full potential of AI in this domain. Addressing this, we present the Fashion-Diffusion dataset, a product of multiple years' rigorous effort. This dataset, the first of its kind, comprises over a million high-quality fashion… ▽ More

    Submitted 18 March, 2024; v1 submitted 19 November, 2023; originally announced November 2023.

  36. arXiv:2311.09861  [pdf, other

    cs.CL cs.AI

    ConceptPsy:A Benchmark Suite with Conceptual Comprehensiveness in Psychology

    Authors: Junlei Zhang, Hongliang He, Nirui Song, Zhanchao Zhou, Shuyuan He, Shuai Zhang, Huachuan Qiu, Anqi Li, Yong Dai, Lizhi Ma, Zhenzhong Lan

    Abstract: The critical field of psychology necessitates a comprehensive benchmark to enhance the evaluation and development of domain-specific Large Language Models (LLMs). Existing MMLU-type benchmarks, such as C-EVAL and CMMLU, include psychology-related subjects, but their limited number of questions and lack of systematic concept sampling strategies mean they cannot cover the concepts required in psycho… ▽ More

    Submitted 16 June, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Under Review

  37. arXiv:2311.08733  [pdf, other

    cond-mat.mes-hall physics.optics physics.plasm-ph quant-ph

    Topological States Decorated by Twig Boundary in Plasma Photonic Crystals

    Authors: Jianfei Li, **gfeng Yao, Ying Wang, Zhongxiang Zhou, Zhihao Lan, Chengxun Yuan

    Abstract: The twig edge states in graphene-like structures are viewed as the fourth states complementary to their zigzag, bearded, and armchair counterparts. In this work, we study a rod-in-plasma system in honeycomb lattice with twig edge truncation under external magnetic fields and lattice scaling and show that twig edge states can exist in different phases of the system, such as quantum Hall phase, quan… ▽ More

    Submitted 21 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  38. arXiv:2310.19651  [pdf, other

    cs.CL

    Dynamics of Instruction Tuning: Each Ability of Large Language Models Has Its Own Growth Pace

    Authors: Chiyu Song, Zhanchao Zhou, Jianhao Yan, Yuejiao Fei, Zhenzhong Lan, Yue Zhang

    Abstract: Instruction tuning is a burgeoning method to elicit the general intelligence of Large Language Models (LLMs). However, the creation of instruction data is still largely heuristic, leading to significant variation in quantity and quality across existing datasets. While some research advocates for expanding the number of instructions, others suggest that a small set of well-chosen examples is adequa… ▽ More

    Submitted 22 February, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

  39. arXiv:2310.15204  [pdf

    cs.LG

    Mid-Long Term Daily Electricity Consumption Forecasting Based on Piecewise Linear Regression and Dilated Causal CNN

    Authors: Zhou Lan, Ben Liu, Yi Feng, Danhuang Dong, Peng Zhang

    Abstract: Daily electricity consumption forecasting is a classical problem. Existing forecasting algorithms tend to have decreased accuracy on special dates like holidays. This study decomposes the daily electricity consumption series into three components: trend, seasonal, and residual, and constructs a two-stage prediction method using piecewise linear regression as a filter and Dilated Causal CNN as a pr… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Key words: Daily electricity consumption forecasting; time series decomposition; piecewise linear regression; Dilated Causal CNN

  40. arXiv:2310.08129  [pdf, other

    cs.CV

    Tailored Visions: Enhancing Text-to-Image Generation with Personalized Prompt Rewriting

    Authors: Zijie Chen, Lichao Zhang, Fangsheng Weng, Lili Pan, Zhenzhong Lan

    Abstract: Despite significant progress in the field, it is still challenging to create personalized visual representations that align closely with the desires and preferences of individual users. This process requires users to articulate their ideas in words that are both comprehensible to the models and accurately capture their vision, posing difficulties for many users. In this paper, we tackle this chall… ▽ More

    Submitted 6 April, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: Accepted at CVPR 2024

  41. arXiv:2310.00929  [pdf, other

    physics.ins-det physics.app-ph

    Single-Shot Laser-Driven Neutron Resonance Spectroscopy for Temperature Profiling

    Authors: Zechen Lan, Yasunobu Arikawa, S. Reza Mirfayzi, Alessio Morace, Takehito Hayakawa, Hirotaka Sato, Takashi Kamiyama, Tianyun Wei, Yuta Tatsumi, Mitsuo Koizumi, Yuki Abe, Shinsuke Fujioka, Kunioki Mima, Ryosuke Kodama, Akifumi Yogo

    Abstract: The temperature measurement of material inside of an object is one of the key technologies for control of dynamical processes. For this purpose, various techniques such as laser-based thermography and phase-contrast imaging thermography have been studied. However, it is, in principle, impossible to measure the temperature of an element inside of an object using these techniques. One of the possibl… ▽ More

    Submitted 3 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

  42. arXiv:2309.15289  [pdf, other

    cs.CV cs.LG

    SEPT: Towards Efficient Scene Representation Learning for Motion Prediction

    Authors: Zhiqian Lan, Yuxuan Jiang, Yao Mu, Chen Chen, Shengbo Eben Li

    Abstract: Motion prediction is crucial for autonomous vehicles to operate safely in complex traffic environments. Extracting effective spatiotemporal relationships among traffic elements is key to accurate forecasting. Inspired by the successful practice of pretrained large language models, this paper presents SEPT, a modeling framework that leverages self-supervised learning to develop powerful spatiotempo… ▽ More

    Submitted 19 December, 2023; v1 submitted 26 September, 2023; originally announced September 2023.

  43. arXiv:2309.09749   

    cs.CL

    Facilitating NSFW Text Detection in Open-Domain Dialogue Systems via Knowledge Distillation

    Authors: Huachuan Qiu, Shuai Zhang, Hongliang He, Anqi Li, Zhenzhong Lan

    Abstract: NSFW (Not Safe for Work) content, in the context of a dialogue, can have severe side effects on users in open-domain dialogue systems. However, research on detecting NSFW language, especially sexually explicit content, within a dialogue context has significantly lagged behind. To address this issue, we introduce CensorChat, a dialogue monitoring dataset aimed at NSFW dialogue detection. Leveraging… ▽ More

    Submitted 20 March, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: As we have submitted a final version arXiv:2403.13250, we decide to withdraw it

  44. arXiv:2309.06221  [pdf, other

    cs.CV

    Use neural networks to recognize students' handwritten letters and incorrect symbols

    Authors: JiaJun Zhu, Zichuan Yang, Binjie Hong, Jiacheng Song, Jiwei Wang, Tianhao Chen, Shuilan Yang, Zixun Lan, Fei Ma

    Abstract: Correcting students' multiple-choice answers is a repetitive and mechanical task that can be considered an image multi-classification task. Assuming possible options are 'abcd' and the correct option is one of the four, some students may write incorrect symbols or options that do not exist. In this paper, five classifications were set up - four for possible correct options and one for other incorr… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  45. arXiv:2309.02054  [pdf, other

    cs.CV

    An Adaptive Spatial-Temporal Local Feature Difference Method for Infrared Small-moving Target Detection

    Authors: Yongkang Zhao, Chuang Zhu, Yuan Li, Shuaishuai Wang, Zihan Lan, Yuanyuan Qiao

    Abstract: Detecting small moving targets accurately in infrared (IR) image sequences is a significant challenge. To address this problem, we propose a novel method called spatial-temporal local feature difference (STLFD) with adaptive background suppression (ABS). Our approach utilizes filters in the spatial and temporal domains and performs pixel-level ABS on the output to enhance the contrast between the… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  46. arXiv:2309.01473  [pdf, ps, other

    math.AG math-ph

    Twisted Equivariant Gromov-Witten Theory of the Classifying Space of a Finite Group

    Authors: Zhuoming Lan, Zhengyu Zong

    Abstract: For any finite group $G$, the equivariant Gromov-Witten invariants of $[\mathbb{C}^r/G]$ can be viewed as a certain twisted Gromov-Witten invariants of the classifying stack $\mathcal{B} G$. In this paper, we use Tseng's orbifold quantum Riemann-Roch theorem to express the equivariant Gromov-Witten invariants of $[\mathbb{C}^r/G]$ as a sum over Feynman graphs, where the weight of each graph is exp… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Comments: This paper is a non-abelian generalization of arXiv:1310.4812

  47. arXiv:2308.16392  [pdf, other

    physics.chem-ph quant-ph

    Studies of Nonadiabatic Dynamics in the Singlet Fission Processes of Pentacene Dimer via Tensor Train Decomposition Method

    Authors: Jiawei Peng, De** Hu, Hong Liu, Qiang Shi, Peng Bao, Zhenggang Lan

    Abstract: Singlet fission (SF) is a very significant photophysical phenomenon and possesses potential applications. In this work, we try to give the rather detailed theoretical investigation of the SF process in the stacked polyacene dimer by combining the high-level quantum chemistry calculations, and the quantum dynamics simulations based on the tensor train decomposition method. Starting from the constru… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  48. arXiv:2307.16457  [pdf, other

    cs.CL

    A Benchmark for Understanding Dialogue Safety in Mental Health Support

    Authors: Huachuan Qiu, Tong Zhao, Anqi Li, Shuai Zhang, Hongliang He, Zhenzhong Lan

    Abstract: Dialogue safety remains a pervasive challenge in open-domain human-machine interaction. Existing approaches propose distinctive dialogue safety taxonomies and datasets for detecting explicitly harmful responses. However, these taxonomies may not be suitable for analyzing response safety in mental health support. In real-world interactions, a model response deemed acceptable in casual conversations… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: accepted to The 12th CCF International Conference on Natural Language Processing and Chinese Computing (NLPCC2023)

  49. arXiv:2307.15020  [pdf, other

    cs.CL cs.AI

    SuperCLUE: A Comprehensive Chinese Large Language Model Benchmark

    Authors: Liang Xu, Anqi Li, Lei Zhu, Hang Xue, Changtai Zhu, Kangkang Zhao, Haonan He, Xuanwei Zhang, Qiyue Kang, Zhenzhong Lan

    Abstract: Large language models (LLMs) have shown the potential to be integrated into human daily lives. Therefore, user preference is the most critical criterion for assessing LLMs' performance in real-world scenarios. However, existing benchmarks mainly focus on measuring models' accuracy using multi-choice questions, which limits the understanding of their capabilities in real applications. We fill this… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: 13 pages, 12 figures, 5 tables

  50. arXiv:2307.08487  [pdf, other

    cs.CL

    Latent Jailbreak: A Benchmark for Evaluating Text Safety and Output Robustness of Large Language Models

    Authors: Huachuan Qiu, Shuai Zhang, Anqi Li, Hongliang He, Zhenzhong Lan

    Abstract: Considerable research efforts have been devoted to ensuring that large language models (LLMs) align with human values and generate safe text. However, an excessive focus on sensitivity to certain topics can compromise the model's robustness in following instructions, thereby impacting its overall performance in completing tasks. Previous benchmarks for jailbreaking LLMs have primarily focused on e… ▽ More

    Submitted 28 August, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: Code and data are available at https://github.com/qiuhuachuan/latent-jailbreak