Skip to main content

Showing 1–50 of 924 results for author: Jimmy

.
  1. arXiv:2407.08402  [pdf, ps, other

    cond-mat.mtrl-sci

    Selective area epitaxy of in-plane HgTe nanostrcutures on CdTe(001) substrate

    Authors: Nicolas Chaize, Xavier Baudry, Pierre-Henri Jouneau, Eric Gautier, Jean-Luc Rouvière, Yves Deblock, Jimmy Xu, Maxime Berthe, Clément Barbot, Bruno Grandidier, Ludovic Desplanque, Hermann Sellier, Philippe Ballet

    Abstract: Semiconductor nanowires are believed to play a crucial role for future applications in electronics, spintronics and quantum technologies. A potential candidate is HgTe but its sensitivity to nanofabrication processes restrain its development. A way to circumvent this obstacle is the selective area growth technique. Here, in-plane HgTe nanostructures are grown thanks to selective area molecular bea… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 18 pages and 8 figures. Submitted to Nanotechnology

  2. Systematic Evaluation of Neural Retrieval Models on the Touché 2020 Argument Retrieval Subset of BEIR

    Authors: Nandan Thakur, Luiz Bonifacio, Maik Fröbe, Alexander Bondarenko, Ehsan Kamalloo, Martin Potthast, Matthias Hagen, Jimmy Lin

    Abstract: The zero-shot effectiveness of neural retrieval models is often evaluated on the BEIR benchmark -- a combination of different IR evaluation datasets. Interestingly, previous studies found that particularly on the BEIR subset Touché 2020, an argument retrieval task, neural retrieval models are considerably less effective than BM25. Still, so far, no further investigation has been conducted on what… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: SIGIR 2024 (Resource & Reproducibility Track)

  3. arXiv:2407.07279  [pdf, other

    cs.LG stat.ML

    Towards a theory of learning dynamics in deep state space models

    Authors: Jakub Smékal, Jimmy T. H. Smith, Michael Kleinman, Dan Biderman, Scott W. Linderman

    Abstract: State space models (SSMs) have shown remarkable empirical performance on many long sequence modeling tasks, but a theoretical understanding of these models is still lacking. In this work, we study the learning dynamics of linear SSMs to understand how covariance structure in data, latent state size, and initialization affect the evolution of parameters throughout learning with gradient descent. We… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  4. arXiv:2407.04684  [pdf, other

    astro-ph.HE

    Investigating the Mass of the Black Hole and Possible Wind Outflow of the Accretion Disk in the Tidal Disruption Event AT2021ehb

    Authors: Xin Xiang, Jon M. Miller, Abderahmen Zoghbi, Mark T. Reynolds, David Bogensberger, Lixin Dai, Paul A. Draghis, Jeremy J. Drake, Olivier Godet, Jimmy A. Irwin, Michael C. Miller, Brenna E. Mockler, Richard Saxton, Natalie Webb

    Abstract: Tidal disruption events (TDEs) can potentially probe low-mass black holes in host galaxies that might not adhere to bulge or stellar-dispersion relationships. At least initially, TDEs can also reveal super-Eddington accretion. X-ray spectroscopy can potentially constrain black hole masses, and reveal ionized outflows associated with super-Eddington accretion. Our analysis of XMM-Newton X-ray obser… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 19 pages, 4 figures

  5. arXiv:2407.04069  [pdf, other

    cs.CL cs.AI cs.LG

    A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations

    Authors: Md Tahmid Rahman Laskar, Sawsan Alqahtani, M Saiful Bari, Mizanur Rahman, Mohammad Abdullah Matin Khan, Haidar Khan, Israt Jahan, Amran Bhuiyan, Chee Wei Tan, Md Rizwan Parvez, Enamul Hoque, Shafiq Joty, Jimmy Huang

    Abstract: Large Language Models (LLMs) have recently gained significant attention due to their remarkable capabilities in performing diverse tasks across various domains. However, a thorough evaluation of these models is crucial before deploying them in real-world applications to ensure they produce reliable performance. Despite the well-established importance of evaluating LLMs in the community, the comple… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  6. arXiv:2406.18762  [pdf, other

    cs.CL

    Categorical Syllogisms Revisited: A Review of the Logical Reasoning Abilities of LLMs for Analyzing Categorical Syllogism

    Authors: Shi Zong, Jimmy Lin

    Abstract: There have been a huge number of benchmarks proposed to evaluate how large language models (LLMs) behave for logic inference tasks. However, it remains an open question how to properly evaluate this ability. In this paper, we provide a systematic overview of prior works on the logical reasoning ability of LLMs for analyzing categorical syllogisms. We first investigate all the possible variations f… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  7. arXiv:2406.17216  [pdf, other

    cs.LG cs.AI cs.CR cs.CY

    Machine Unlearning Fails to Remove Data Poisoning Attacks

    Authors: Martin Pawelczyk, Jimmy Z. Di, Yiwei Lu, Gautam Kamath, Ayush Sekhari, Seth Neel

    Abstract: We revisit the efficacy of several practical methods for approximate machine unlearning developed for large-scale deep learning. In addition to complying with data deletion requests, one often-cited potential application for unlearning methods is to remove the effects of training on poisoned data. We experimentally demonstrate that, while existing unlearning methods have been demonstrated to be ef… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  8. arXiv:2406.16828  [pdf, other

    cs.IR cs.AI cs.CL

    Ragnarök: A Reusable RAG Framework and Baselines for TREC 2024 Retrieval-Augmented Generation Track

    Authors: Ronak Pradeep, Nandan Thakur, Sahel Sharifymoghaddam, Eric Zhang, Ryan Nguyen, Daniel Campos, Nick Craswell, Jimmy Lin

    Abstract: Did you try out the new Bing Search? Or maybe you fiddled around with Google AI~Overviews? These might sound familiar because the modern-day search stack has recently evolved to include retrieval-augmented generation (RAG) systems. They allow searching and incorporating real-time data into large language models (LLMs) to provide a well-informed, attributed, concise summary in contrast to the tradi… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  9. arXiv:2406.16671  [pdf, other

    cs.RO

    STAR: Swarm Technology for Aerial Robotics Research

    Authors: Jimmy Chiun, Yan Rui Tan, Yuhong Cao, John Tan, Guillaume Sartoretti

    Abstract: In recent years, the field of aerial robotics has witnessed significant progress, finding applications in diverse domains, including post-disaster search and rescue operations. Despite these strides, the prohibitive acquisition costs associated with deploying physical multi-UAV systems have posed challenges, impeding their widespread utilization in research endeavors. To overcome these challenges,… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  10. arXiv:2406.11704  [pdf, other

    cs.CL cs.AI cs.LG

    Nemotron-4 340B Technical Report

    Authors: Nvidia, :, Bo Adler, Niket Agarwal, Ashwath Aithal, Dong H. Anh, Pallab Bhattacharya, Annika Brundyn, Jared Casper, Bryan Catanzaro, Sharon Clay, Jonathan Cohen, Sirshak Das, Ayush Dattagupta, Olivier Delalleau, Leon Derczynski, Yi Dong, Daniel Egert, Ellie Evans, Aleksander Ficek, Denys Fridman, Shaona Ghosh, Boris Ginsburg, Igor Gitman, Tomasz Grzegorzek , et al. (58 additional authors not shown)

    Abstract: We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4-340B-Instruct, and Nemotron-4-340B-Reward. Our models are open access under the NVIDIA Open Model License Agreement, a permissive model license that allows distribution, modification, and use of the models and its outputs. These models perform competitively to open access models on a wide range of evaluation be… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  11. arXiv:2406.11251  [pdf, other

    cs.IR

    Unifying Multimodal Retrieval via Document Screenshot Embedding

    Authors: Xueguang Ma, Sheng-Chieh Lin, Minghan Li, Wenhu Chen, Jimmy Lin

    Abstract: In the real world, documents are organized in different formats and varied modalities. Traditional retrieval pipelines require tailored document parsing techniques and content extraction modules to prepare input for indexing. This process is tedious, prone to errors, and has information loss. To this end, we propose Document Screenshot Embedding} (DSE), a novel retrieval paradigm that regards docu… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  12. arXiv:2406.10393  [pdf, other

    cs.CL

    EWEK-QA: Enhanced Web and Efficient Knowledge Graph Retrieval for Citation-based Question Answering Systems

    Authors: Mohammad Dehghan, Mohammad Ali Alomrani, Sunyam Bagga, David Alfonso-Hermelo, Khalil Bibi, Abbas Ghaddar, Yingxue Zhang, Xiaoguang Li, Jianye Hao, Qun Liu, Jimmy Lin, Boxing Chen, Prasanna Parthasarathi, Mahdi Biparva, Mehdi Rezagholizadeh

    Abstract: The emerging citation-based QA systems are gaining more attention especially in generative AI search applications. The importance of extracted knowledge provided to these systems is vital from both accuracy (completeness of information) and efficiency (extracting the information in a timely manner). In this regard, citation-based QA systems are suffering from two shortcomings. First, they usually… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  13. arXiv:2406.09355  [pdf, other

    cs.IR

    Can't Hide Behind the API: Stealing Black-Box Commercial Embedding Models

    Authors: Manveer Singh Tamber, Jasper Xian, Jimmy Lin

    Abstract: Embedding models that generate representation vectors from natural language text are widely used, reflect substantial investments, and carry significant commercial value. Companies such as OpenAI and Cohere have developed competing embedding models accessed through APIs that require users to pay for usage. In this architecture, the models are "hidden" behind APIs, but this does not mean that they… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  14. arXiv:2406.08673  [pdf, ps, other

    cs.CL cs.AI cs.LG

    HelpSteer2: Open-source dataset for training top-performing reward models

    Authors: Zhilin Wang, Yi Dong, Olivier Delalleau, Jiaqi Zeng, Gerald Shen, Daniel Egert, Jimmy J. Zhang, Makesh Narsimhan Sreedhar, Oleksii Kuchaiev

    Abstract: High-quality preference datasets are essential for training reward models that can effectively guide large language models (LLMs) in generating high-quality responses aligned with human preferences. As LLMs become stronger and better aligned, permissively licensed preference datasets, such as Open Assistant, HH-RLHF, and HelpSteer need to be updated to remain effective for reward modeling. Methods… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  15. arXiv:2406.08482  [pdf, other

    cs.CV cs.CL

    Words Worth a Thousand Pictures: Measuring and Understanding Perceptual Variability in Text-to-Image Generation

    Authors: Raphael Tang, Xinyu Zhang, Lixinyu Xu, Yao Lu, Wenyan Li, Pontus Stenetorp, Jimmy Lin, Ferhan Ture

    Abstract: Diffusion models are the state of the art in text-to-image generation, but their perceptual variability remains understudied. In this paper, we examine how prompts affect image variability in black-box diffusion-based models. We propose W1KP, a human-calibrated measure of variability in a set of images, bootstrapped from existing image-pair perceptual distances. Current datasets do not cover recen… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 13 pages, 11 figures

  16. arXiv:2406.06519  [pdf, other

    cs.IR

    UMBRELA: UMbrela is the (Open-Source Reproduction of the) Bing RELevance Assessor

    Authors: Shivani Upadhyay, Ronak Pradeep, Nandan Thakur, Nick Craswell, Jimmy Lin

    Abstract: Copious amounts of relevance judgments are necessary for the effective training and accurate evaluation of retrieval systems. Conventionally, these judgments are made by human assessors, rendering this process expensive and laborious. A recent study by Thomas et al. from Microsoft Bing suggested that large language models (LLMs) can accurately perform the relevance assessment task and provide huma… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 5 pages, 3 figures

  17. arXiv:2406.05364  [pdf, other

    cs.CR cs.AI

    Is On-Device AI Broken and Exploitable? Assessing the Trust and Ethics in Small Language Models

    Authors: Kalyan Nakka, Jimmy Dani, Nitesh Saxena

    Abstract: In this paper, we present a very first study to investigate trust and ethical implications of on-device artificial intelligence (AI), focusing on ''small'' language models (SLMs) amenable for personal devices like smartphones. While on-device SLMs promise enhanced privacy, reduced latency, and improved user experience compared to cloud-based services, we posit that they might also introduce signif… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: 26 pages, 31 figures and 5 tables

  18. arXiv:2406.00594  [pdf

    cs.IT

    Artificial General Intelligence (AGI) for the oil and gas industry: a review

    Authors: Jimmy Xuekai Li, Tiancheng Zhang, Yiran Zhu, Zhongwei Chen

    Abstract: Artificial General Intelligence (AGI) is set to profoundly impact the oil and gas industry by introducing unprecedented efficiencies and innovations. This paper explores AGI's foundational principles and its transformative applications, particularly focusing on the advancements brought about by large language models (LLMs) and extensive computer vision systems in the upstream sectors of the indust… ▽ More

    Submitted 11 June, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

    Comments: 20 Pages, Review paper, 15 Figures

  19. arXiv:2405.19683  [pdf, other

    cs.CR cs.LG

    Breaking Indistinguishability with Transfer Learning: A First Look at SPECK32/64 Lightweight Block Ciphers

    Authors: Jimmy Dani, Kalyan Nakka, Nitesh Saxena

    Abstract: In this research, we introduce MIND-Crypt, a novel attack framework that uses deep learning (DL) and transfer learning (TL) to challenge the indistinguishability of block ciphers, specifically SPECK32/64 encryption algorithm in CBC mode (Cipher Block Chaining) against Known Plaintext Attacks (KPA). Our methodology includes training a DL model with ciphertexts of two messages encrypted using the sa… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  20. arXiv:2405.19325  [pdf, other

    cs.CL

    Nearest Neighbor Speculative Decoding for LLM Generation and Attribution

    Authors: Minghan Li, Xilun Chen, Ari Holtzman, Beidi Chen, Jimmy Lin, Wen-tau Yih, Xi Victoria Lin

    Abstract: Large language models (LLMs) often hallucinate and lack the ability to provide attribution for their generations. Semi-parametric LMs, such as kNN-LM, approach these limitations by refining the output of an LM for a given prompt using its nearest neighbor matches in a non-parametric data store. However, these models often exhibit slow inference speeds and produce non-fluent texts. In this paper, w… ▽ More

    Submitted 30 May, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

  21. arXiv:2405.18685  [pdf, other

    astro-ph.GA astro-ph.CO

    Low-Mass Galaxy Interactions Trigger Black Hole Activity

    Authors: Marko Mićić, Jimmy A. Irwin, Preethi Nair, Brenna N. Wells, Olivia J. Holmes, Jackson T. Eames

    Abstract: The existence of high-$z$ over-massive supermassive black holes represents a major conundrum in our understanding of black hole evolution. In this paper, we probe from the observational point of view how early Universe environmental conditions could have acted as an evolutionary mechanism for the accelerated growth of the first black holes. Under the assumption that the early Universe is dominated… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 19 pages, 5 figures, 4 tables. Accepted for publication in the Astrophysical Journal Letters

  22. arXiv:2405.17387  [pdf, other

    eess.SP

    Batteryless BLE and Light-based IoT Sensor Nodes for Reliable Environmental Sensing

    Authors: Jimmy Fernandez Landivar, Khojiakbar Botirov, Hazem Sallouha, Marcos Katz, Sofie Pollin

    Abstract: The sustainable design of Internet of Things (IoT) networks encompasses considerations related to energy efficiency and autonomy as well as considerations related to reliable communications, ensuring no energy is wasted on undelivered data. Under these considerations, this work proposes the design and implementation of energy-efficient Bluetooth Low Energy (BLE) and Light-based IoT (LIoT) batteryl… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 6 pages, 9 figures, accepted for publication in the IEEE International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC 2024), Valencia, Spain

    MSC Class: 94C30 ACM Class: I.2.9

  23. arXiv:2405.16759  [pdf, other

    cs.CV cs.LG

    Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models

    Authors: Cristina N. Vasconcelos, Abdullah Rashwan, Austin Waters, Trevor Walker, Keyang Xu, Jimmy Yan, Rui Qian, Shixin Luo, Zarana Parekh, Andrew Bunner, Hongliang Fei, Roopal Garg, Mandy Guo, Ivana Kajic, Yeqing Li, Henna Nandwani, Jordi Pont-Tuset, Yasumasa Onoe, Sarah Rosston, Su Wang, Wenlei Zhou, Kevin Swersky, David J. Fleet, Jason M. Baldridge, Oliver Wang

    Abstract: We address the long-standing problem of how to learn effective pixel-based image diffusion models at scale, introducing a remarkably simple greedy growing method for stable training of large-scale, high-resolution models. without the needs for cascaded super-resolution components. The key insight stems from careful pre-training of core components, namely, those responsible for text-to-image alignm… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  24. arXiv:2405.10961  [pdf, other

    cond-mat.soft cs.RO

    Simplified discrete model for axisymmetric dielectric elastomer membranes with robotic applications

    Authors: Zhaowei Liu, Mingchao Liu, K. Jimmy Hsia, Xiaonan Huang, Weicheng Huang

    Abstract: Soft robots utilizing inflatable dielectric membranes can realize intricate functionalities through the application of non-mechanical fields. However, given the current limitations in simulations, including low computational efficiency and difficulty in dealing with complex external interactions, the design and control of such soft robots often require trial and error. Thus, a novel one-dimensiona… ▽ More

    Submitted 23 April, 2024; originally announced May 2024.

    Comments: 27 pages, 8 figures

  25. arXiv:2405.10311  [pdf, other

    cs.IR

    UniRAG: Universal Retrieval Augmentation for Multi-Modal Large Language Models

    Authors: Sahel Sharifymoghaddam, Shivani Upadhyay, Wenhu Chen, Jimmy Lin

    Abstract: Recently, Multi-Modal(MM) Large Language Models(LLMs) have unlocked many complex use-cases that require MM understanding (e.g., image captioning or visual question answering) and MM generation (e.g., text-guided image generation or editing) capabilities. To further improve the output fidelity of MM-LLMs we introduce the model-agnostic UniRAG technique that adds relevant retrieved information to pr… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 11 pages, 7 figures

  26. arXiv:2405.07503  [pdf, other

    cs.RO cs.AI

    Consistency Policy: Accelerated Visuomotor Policies via Consistency Distillation

    Authors: Aaditya Prasad, Kevin Lin, Jimmy Wu, Linqi Zhou, Jeannette Bohg

    Abstract: Many robotic systems, such as mobile manipulators or quadrotors, cannot be equipped with high-end GPUs due to space, weight, and power constraints. These constraints prevent these systems from leveraging recent developments in visuomotor policy architectures that require high-end GPUs to achieve fast policy inference. In this paper, we propose Consistency Policy, a faster and similarly powerful al… ▽ More

    Submitted 28 June, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

    Comments: https://consistency-policy.github.io/

  27. arXiv:2405.06147  [pdf, other

    cs.LG eess.SY

    State-Free Inference of State-Space Models: The Transfer Function Approach

    Authors: Rom N. Parnichkun, Stefano Massaroli, Alessandro Moro, Jimmy T. H. Smith, Ramin Hasani, Mathias Lechner, Qi An, Christopher Ré, Hajime Asama, Stefano Ermon, Taiji Suzuki, Atsushi Yamashita, Michael Poli

    Abstract: We approach designing a state-space model for deep learning applications through its dual representation, the transfer function, and uncover a highly efficient sequence parallel inference algorithm that is state-free: unlike other proposed algorithms, state-free inference does not incur any significant memory or computational cost with an increase in state size. We achieve this using properties of… ▽ More

    Submitted 1 June, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: Resubmission 02/06/2024: Fixed minor typo of recurrent form RTF

  28. arXiv:2405.05562  [pdf, other

    cs.IR

    Review-based Recommender Systems: A Survey of Approaches, Challenges and Future Perspectives

    Authors: Emrul Hasan, Mizanur Rahman, Chen Ding, Jimmy Xiangji Huang, Shaina Raza

    Abstract: Recommender systems play a pivotal role in hel** users navigate an overwhelming selection of products and services. On online platforms, users have the opportunity to share feedback in various modes, including numerical ratings, textual reviews, and likes/dislikes. Traditional recommendation systems rely on users explicit ratings or implicit interactions (e.g. likes, clicks, shares, saves) to le… ▽ More

    Submitted 11 May, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: The first two authors contributed equally

  29. arXiv:2405.04867  [pdf, other

    eess.IV cs.CV

    MIPI 2024 Challenge on Demosaic for HybridEVS Camera: Methods and Results

    Authors: Yaqi Wu, Zhihao Fan, Xiaofeng Chu, Jimmy S. Ren, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangcheng Zhou, Ruicheng Feng, Yuekun Dai, Peiqing Yang, Chen Change Loy, Senyan Xu, Zhi**g Sun, Jiaying Zhu, Yurui Zhu, Xueyang Fu, Zheng-Jun Zha, Jun Cao, Cheng Li, Shu Chen, Liang Ma, Shiyang Zhou, Hai** Zeng, Kai Feng , et al. (24 additional authors not shown)

    Abstract: The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photogra… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: MIPI@CVPR2024. Website: https://mipi-challenge.org/MIPI2024/

  30. arXiv:2405.04727  [pdf, other

    cs.IR

    LLMs Can Patch Up Missing Relevance Judgments in Evaluation

    Authors: Shivani Upadhyay, Ehsan Kamalloo, Jimmy Lin

    Abstract: Unjudged documents or holes in information retrieval benchmarks are considered non-relevant in evaluation, yielding no gains in measuring effectiveness. However, these missing judgments may inadvertently introduce biases into the evaluation as their prevalence for a retrieval model is heavily contingent on the pooling process. Thus, filling holes becomes crucial in ensuring reliable and accurate e… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 5 pages, 4 figures

  31. arXiv:2405.01525  [pdf, other

    cs.CL cs.AI

    FLAME: Factuality-Aware Alignment for Large Language Models

    Authors: Sheng-Chieh Lin, Luyu Gao, Barlas Oguz, Wenhan Xiong, Jimmy Lin, Wen-tau Yih, Xilun Chen

    Abstract: Alignment is a standard procedure to fine-tune pre-trained large language models (LLMs) to follow natural language instructions and serve as helpful AI assistants. We have observed, however, that the conventional alignment process fails to enhance the factual accuracy of LLMs, and often leads to the generation of more false facts (i.e. hallucination). In this paper, we study how to make the LLM al… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  32. arXiv:2405.01481  [pdf, other

    cs.CL cs.AI cs.LG

    NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment

    Authors: Gerald Shen, Zhilin Wang, Olivier Delalleau, Jiaqi Zeng, Yi Dong, Daniel Egert, Shengyang Sun, Jimmy Zhang, Sahil Jain, Ali Taghibakhshi, Markel Sanz Ausin, Ashwath Aithal, Oleksii Kuchaiev

    Abstract: Aligning Large Language Models (LLMs) with human values and preferences is essential for making them helpful and safe. However, building efficient tools to perform alignment can be challenging, especially for the largest and most competent LLMs which often contain tens or hundreds of billions of parameters. We create NeMo-Aligner, a toolkit for model alignment that can efficiently scale to using h… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 13 pages, 4 figures

  33. arXiv:2404.19321  [pdf, other

    cond-mat.soft physics.app-ph

    Observation of strain-rate softening behavior in jammed granular media

    Authors: Mingchao Liu, Weining Mao, Yiqiu Zhao, Qin Xu, Yixiang Gan, Yifan Wang, K Jimmy Hsia

    Abstract: The strain-rate sensitivity of confined granular materials has been widely explored, with most findings exhibiting rate-strengthening behaviors. This study, however, reveals a distinct rate-softening behavior across a certain strain rate range based on triaxial tests on particle clusters of various materials with different surface properties, particle sizes, shapes, and stiffness. This softening e… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: 16 pages, 12 figures

  34. arXiv:2404.19165  [pdf, other

    cs.NE cs.ET cs.LG

    DelGrad: Exact gradients in spiking networks for learning transmission delays and weights

    Authors: Julian Göltz, Jimmy Weber, Laura Kriener, Peter Lake, Melika Payvand, Mihai A. Petrovici

    Abstract: Spiking neural networks (SNNs) inherently rely on the timing of signals for representing and processing information. Transmission delays play an important role in sha** these temporal characteristics. Recent work has demonstrated the substantial advantages of learning these delays along with synaptic weights, both in terms of accuracy and memory efficiency. However, these approaches suffer from… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 15 pages, 7 figures

  35. arXiv:2404.18424  [pdf, other

    cs.IR

    PromptReps: Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval

    Authors: Shengyao Zhuang, Xueguang Ma, Bevan Koopman, Jimmy Lin, Guido Zuccon

    Abstract: Utilizing large language models (LLMs) for zero-shot document ranking is done in one of two ways: 1) prompt-based re-ranking methods, which require no further training but are only feasible for re-ranking a handful of candidate documents due to computational costs; and 2) unsupervised contrastive trained dense retrieval methods, which can retrieve relevant documents from the entire corpus but requ… ▽ More

    Submitted 16 June, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  36. arXiv:2404.15807  [pdf, other

    cs.CL

    One Subgraph for All: Efficient Reasoning on Opening Subgraphs for Inductive Knowledge Graph Completion

    Authors: Zhiwen Xie, Yi Zhang, Guangyou Zhou, ** Liu, Xinhui Tu, Jimmy Xiangji Huang

    Abstract: Knowledge Graph Completion (KGC) has garnered massive research interest recently, and most existing methods are designed following a transductive setting where all entities are observed during training. Despite the great progress on the transductive KGC, these methods struggle to conduct reasoning on emerging KGs involving unseen entities. Thus, inductive KGC, which aims to deduce missing links am… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  37. arXiv:2404.15279  [pdf, other

    eess.SP cs.AI

    Jointly Modeling Spatio-Temporal Features of Tactile Signals for Action Classification

    Authors: Jimmy Lin, Junkai Li, Jiasi Gao, Weizhi Ma, Yang Liu

    Abstract: Tactile signals collected by wearable electronics are essential in modeling and understanding human behavior. One of the main applications of tactile signals is action classification, especially in healthcare and robotics. However, existing tactile classification methods fail to capture the spatial and temporal features of tactile signals simultaneously, which results in sub-optimal performances.… ▽ More

    Submitted 20 January, 2024; originally announced April 2024.

    Comments: Accepted by AAAI 2024

  38. arXiv:2404.10981  [pdf, other

    cs.IR cs.AI cs.CL

    A Survey on Retrieval-Augmented Text Generation for Large Language Models

    Authors: Yizheng Huang, Jimmy Huang

    Abstract: Retrieval-Augmented Generation (RAG) merges retrieval methods with deep learning advancements to address the static limitations of large language models (LLMs) by enabling the dynamic integration of up-to-date external information. This methodology, focusing primarily on the text domain, provides a cost-effective solution to the generation of plausible but incorrect responses by LLMs, thereby enha… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: Ongoing work

  39. arXiv:2404.05386  [pdf, other

    cs.IR

    MealRec$^+$: A Meal Recommendation Dataset with Meal-Course Affiliation for Personalization and Healthiness

    Authors: Ming Li, Lin Li, Xiaohui Tao, Jimmy Xiangji Huang

    Abstract: Meal recommendation, as a typical health-related recommendation task, contains complex relationships between users, courses, and meals. Among them, meal-course affiliation associates user-meal and user-course interactions. However, an extensive literature review demonstrates that there is a lack of publicly available meal recommendation datasets including meal-course affiliation. Meal recommendati… ▽ More

    Submitted 27 April, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

    Comments: Accepted by SIGIR 2024

  40. arXiv:2403.18639  [pdf, other

    cs.DC cs.LG

    Dependency Aware Incident Linking in Large Cloud Systems

    Authors: Supriyo Ghosh, Karish Grover, Jimmy Wong, Chetan Bansal, Rakesh Namineni, Mohit Verma, Saravan Rajmohan

    Abstract: Despite significant reliability efforts, large-scale cloud services inevitably experience production incidents that can significantly impact service availability and customer's satisfaction. Worse, in many cases one incident can lead to multiple downstream failures due to cascading effects that creates several related incidents across different dependent services. Often time On-call Engineers (OCE… ▽ More

    Submitted 5 February, 2024; originally announced March 2024.

  41. arXiv:2403.12945  [pdf, other

    cs.RO

    DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset

    Authors: Alexander Khazatsky, Karl Pertsch, Suraj Nair, Ashwin Balakrishna, Sudeep Dasari, Siddharth Karamcheti, Soroush Nasiriany, Mohan Kumar Srirama, Lawrence Yunliang Chen, Kirsty Ellis, Peter David Fagan, Joey Hejna, Masha Itkina, Marion Lepert, Yecheng Jason Ma, Patrick Tree Miller, Jimmy Wu, Suneel Belkhale, Shivin Dass, Huy Ha, Arhan Jain, Abraham Lee, Youngwoon Lee, Marius Memmel, Sungjae Park , et al. (74 additional authors not shown)

    Abstract: The creation of large, diverse, high-quality robot manipulation datasets is an important step** stone on the path toward more capable and robust robotic manipulation policies. However, creating such datasets is challenging: collecting robot manipulation data in diverse environments poses logistical and safety challenges and requires substantial investments in hardware and human labour. As a resu… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Project website: https://droid-dataset.github.io/

  42. arXiv:2403.11407  [pdf, other

    stat.ML cs.LG

    Divide-and-Conquer Posterior Sampling for Denoising Diffusion Priors

    Authors: Yazid Janati, Alain Durmus, Eric Moulines, Jimmy Olsson

    Abstract: Interest in the use of Denoising Diffusion Models (DDM) as priors for solving inverse Bayesian problems has recently increased significantly. However, sampling from the resulting posterior distribution poses a challenge. To solve this problem, previous works have proposed approximations to bias the drift term of the diffusion. In this work, we take a different approach and utilize the specific str… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: preprint

  43. arXiv:2403.09969  [pdf, other

    cs.LG

    Prediction of Vessel Arrival Time to Pilotage Area Using Multi-Data Fusion and Deep Learning

    Authors: Xiaocai Zhang, Xiuju Fu, Zhe Xiao, Haiyan Xu, Xiaoyang Wei, Jimmy Koh, Daichi Ogawa, Zheng Qin

    Abstract: This paper investigates the prediction of vessels' arrival time to the pilotage area using multi-data fusion and deep learning approaches. Firstly, the vessel arrival contour is extracted based on Multivariate Kernel Density Estimation (MKDE) and clustering. Secondly, multiple data sources, including Automatic Identification System (AIS), pilotage booking information, and meteorological data, are… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: The 26th IEEE International Conference on Intelligent Transportation Systems (ITSC 2023)

  44. arXiv:2403.08146  [pdf, ps, other

    math.AP math.DG

    Nodal solutions to Paneitz-type equations

    Authors: Jurgen Julio-Batalla, Jimmy Petean

    Abstract: On a closed Riemannian manifold $(M^n ,g)$ with a proper isoparametric function $f$ we consider the equation $Δ^2 u -αΔu +βu = u^q$, where $α$ and $β$ are positive constants satisfying that $α^2 \geq 4 β$. We let ${\bf m}$ be the minimum of the dimensions of the focal varieties of $f$ and $q_f = \frac{n-{\bf m}+4}{n-{\bf m}-4}$, $q_f = \infty$ if $n\leq {\bf m}+4$. We prove the existence of infini… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  45. arXiv:2403.07495  [pdf, other

    stat.CO stat.ME stat.ML

    Tuning diagonal scale matrices for HMC

    Authors: Jimmy Huy Tran, Tore Selland Kleppe

    Abstract: Three approaches for adaptively tuning diagonal scale matrices for HMC are discussed and compared. The common practice of scaling according to estimated marginal standard deviations is taken as a benchmark. Scaling according to the mean log-target gradient (ISG), and a scaling method targeting that the frequency of when the underlying Hamiltonian dynamics crosses the respective medians should be u… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  46. arXiv:2403.03218  [pdf, other

    cs.LG cs.AI cs.CL cs.CY

    The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

    Authors: Nathaniel Li, Alexander Pan, Anjali Gopal, Summer Yue, Daniel Berrios, Alice Gatti, Justin D. Li, Ann-Kathrin Dombrowski, Shashwat Goel, Long Phan, Gabriel Mukobi, Nathan Helm-Burger, Rassin Lababidi, Lennart Justen, Andrew B. Liu, Michael Chen, Isabelle Barrass, Oliver Zhang, Xiaoyuan Zhu, Rishub Tamirisa, Bhrugu Bharathi, Adam Khoja, Zhenqi Zhao, Ariel Herbert-Voss, Cort B. Breuer , et al. (32 additional authors not shown)

    Abstract: The White House Executive Order on Artificial Intelligence highlights the risks of large language models (LLMs) empowering malicious actors in develo** biological, cyber, and chemical weapons. To measure these risks of malicious use, government institutions and major AI labs are develo** evaluations for hazardous capabilities in LLMs. However, current evaluations are private, preventing furthe… ▽ More

    Submitted 15 May, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: See the project page at https://wmdp.ai

  47. arXiv:2403.00784  [pdf, other

    cs.IR cs.AI cs.CL

    Utilizing BERT for Information Retrieval: Survey, Applications, Resources, and Challenges

    Authors: Jiajia Wang, Jimmy X. Huang, Xinhui Tu, Junmei Wang, Angela J. Huang, Md Tahmid Rahman Laskar, Amran Bhuiyan

    Abstract: Recent years have witnessed a substantial increase in the use of deep learning to solve various natural language processing (NLP) problems. Early deep learning models were constrained by their sequential or unidirectional nature, such that they struggled to capture the contextual relationships across text inputs. The introduction of bidirectional encoder representations from transformers (BERT) le… ▽ More

    Submitted 18 February, 2024; originally announced March 2024.

  48. arXiv:2402.18545  [pdf, other

    cs.CY

    Crowdsourcing Dermatology Images with Google Search Ads: Creating a Real-World Skin Condition Dataset

    Authors: Abbi Ward, Jimmy Li, Julie Wang, Sriram Lakshminarasimhan, Ashley Carrick, Bilson Campana, Jay Hartford, Pradeep Kumar S, Tiya Tiyasirichokchai, Sunny Virmani, Renee Wong, Yossi Matias, Greg S. Corrado, Dale R. Webster, Dawn Siegel, Steven Lin, Justin Ko, Alan Karthikesalingam, Christopher Semturs, Pooja Rao

    Abstract: Background: Health datasets from clinical sources do not reflect the breadth and diversity of disease in the real world, impacting research, medical education, and artificial intelligence (AI) tool development. Dermatology is a suitable area to develop and test a new and scalable method to create representative health datasets. Methods: We used Google Search advertisements to invite contribution… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  49. arXiv:2402.11203  [pdf, ps, other

    cs.IR cs.AI cs.CL cs.LG

    Exploring ChatGPT for Next-generation Information Retrieval: Opportunities and Challenges

    Authors: Yizheng Huang, Jimmy Huang

    Abstract: The rapid advancement of artificial intelligence (AI) has highlighted ChatGPT as a pivotal technology in the field of information retrieval (IR). Distinguished from its predecessors, ChatGPT offers significant benefits that have attracted the attention of both the industry and academic communities. While some view ChatGPT as a groundbreaking innovation, others attribute its success to the effectiv… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

    Comments: Survey Paper

    Journal ref: Web Intelligence, vol. 22, no. 1, pp. 31-44, 2024

  50. arXiv:2402.10519  [pdf

    cond-mat.mtrl-sci

    A Simple Modeling for Gas Release During Annealing of Irradiated Nuclear Fuel

    Authors: Jimmy Losfeld, Lionel Desgranges, Yves Pontillon, Gianguido Baldinozzi

    Abstract: We have developed a gas flow model in the spent nuclear fuel during the annealing. It postulates that the gas release during an isothermal plateau at 1200{\textdegree}C corresponds to the equilibrium between overpressure gas reservoirs in the fuel sample connected to the free surface at atmospheric pressure.

    Submitted 16 February, 2024; originally announced February 2024.

    Journal ref: Transactions of the American Nuclear Society, 2023, 128 (1), pp.404-407