Skip to main content

Showing 1–50 of 238 results for author: Rae, J

.
  1. arXiv:2407.04925  [pdf, other

    cs.IR cs.AI cs.HC

    RAMO: Retrieval-Augmented Generation for Enhancing MOOCs Recommendations

    Authors: Jiarui Rao, Jionghao Lin

    Abstract: Massive Open Online Courses (MOOCs) have significantly enhanced educational accessibility by offering a wide variety of courses and breaking down traditional barriers related to geography, finance, and time. However, students often face difficulties navigating the vast selection of courses, especially when exploring new fields of study. Driven by this challenge, researchers have been exploring cou… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 7 pages, this paper underwent a rigorous review process and was officially accepted on May 31, 2024, for presentation at the Educational Data Mining 2024 Workshop: Leveraging Large Language Models for Next Generation Educational Technologies

  2. arXiv:2407.03449  [pdf, other

    eess.SP

    A Tutorial on Fluid Antenna System for 6G Networks: Encompassing Communication Theory, Optimization Methods and Hardware Designs

    Authors: Wee Kiat New, Kai-Kit Wong, Hao Xu, Chao Wang, Farshad Rostami Ghadi, Jichen Zhang, Junhui Rao, Ross Murch, Pablo Ramírez-Espinosa, David Morales-Jimenez, Chan-Byoung Chae, Kin-Fai Tong

    Abstract: The advent of the sixth-generation (6G) networks presents another round of revolution for the mobile communication landscape, promising an immersive experience, robust reliability, minimal latency, extreme connectivity, ubiquitous coverage, and capabilities beyond communication, including intelligence and sensing. To achieve these ambitious goals, it is apparent that 6G networks need to incorporat… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 50 pages, 45 figures, 5 tables. Submitted for potential publication

  3. arXiv:2407.00137  [pdf, other

    physics.space-ph physics.ao-ph

    Understanding and Modeling the Dynamics of Storm-time Atmospheric Neutral Density using Random Forests

    Authors: Kyle R. Murphy, Alexa J. Halford, Vivian Liu, Jeffery Klenzing, Jonathon Smith, Katherine Garcia-Sage, Joshua Pettit, I. Jonathan Rae

    Abstract: Atmospheric neutral density is a crucial component to accurately predict and track the motion of satellites. During periods of elevated solar and geomagnetic activity atmospheric neutral density becomes highly variable and dynamic. This variability and enhanced dynamics make it difficult to accurately model neutral density leading to increased errors which propagate from neutral density models thr… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

    Comments: Submitted for publication to Space Weather

  4. arXiv:2406.18530  [pdf, other

    cs.CV

    MatchTime: Towards Automatic Soccer Game Commentary Generation

    Authors: Jiayuan Rao, Haoning Wu, Chang Liu, Yanfeng Wang, Weidi Xie

    Abstract: Soccer is a globally popular sport with a vast audience, in this paper, we consider constructing an automatic soccer game commentary model to improve the audiences' viewing experience. In general, we make the following contributions: First, observing the prevalent video-text misalignment in existing datasets, we manually annotate timestamps for 49 matches, establishing a more robust benchmark for… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Technical Report; Project Page: https://haoningwu3639.github.io/MatchTime/

  5. arXiv:2406.16227  [pdf, other

    stat.ML cs.LG stat.ME

    VICatMix: variational Bayesian clustering and variable selection for discrete biomedical data

    Authors: Paul D. W. Kirk, Jackie Rao

    Abstract: Effective clustering of biomedical data is crucial in precision medicine, enabling accurate stratifiction of patients or samples. However, the growth in availability of high-dimensional categorical data, including `omics data, necessitates computationally efficient clustering algorithms. We present VICatMix, a variational Bayesian finite mixture model designed for the clustering of categorical dat… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  6. arXiv:2406.05499  [pdf, other

    eess.SP

    A Pixel-based Reconfigurable Antenna Design for Fluid Antenna Systems

    Authors: Jichen Zhang, Junhui Rao, Zhaoyang Ming, Zan Li, Chi-Yuk Chiu, Kai-Kit Wong, Kin-Fai Tong, Ross Murch

    Abstract: Fluid Antenna Systems (FASs) have recently been proposed for enhancing the performance of wireless communication. Previous antenna designs to meet the requirements of FAS have been based on mechanically movable or liquid antennas and therefore have limited reconfiguration speeds. In this paper, we propose a design for a pixel-based reconfigurable antenna (PRA) that meets the requirements of FAS an… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 June, 2024; originally announced June 2024.

    Comments: 13 pages, 16 figures, Submitted to IEEE Transations on Antennas and Propagation

  7. arXiv:2406.02975  [pdf, other

    eess.SP

    A Shared-Aperture Dual-Band sub-6 GHz and mmWave Reconfigurable Intelligent Surface With Independent Operation

    Authors: Junhui Rao, Yujie Zhang, Shiwen Tang, Zan Li, Zhaoyang Ming, Jichen Zhang, Chi Yuk Chiu, Ross Murch

    Abstract: A novel dual-band reconfigurable intelligent surface (DBI-RIS) design that combines the functionalities of millimeter-wave (mmWave) and sub-6 GHz bands within a single aperture is proposed. This design aims to bridge the gap between current single-band reconfigurable intelligent surfaces (RISs) and wireless systems utilizing sub-6 GHz and mmWave bands that require RIS with independently reconfigur… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  8. arXiv:2405.13314  [pdf, other

    astro-ph.HE

    Simulation Study on Constraining GW Propagation Speed by GW and GRB Joint Observation on Binary Neutron Star Mergers

    Authors: **-Hui Rao, Shu-Xu Yi, Lian Tao, Qing-Wen Tang

    Abstract: Theories of modified gravity suggest that the propagation speed of gravitational wave (GW) $v_g$ may deviate from the speed of light $c$. A constraint can be placed on the difference between $c$ and $v_g$ with a simple method that uses the arrival time delay between GW and electromagnetic (EM) wave simultaneously emitted from a burst event. We simulated the joint observation of GW and short Gamma-… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  9. arXiv:2405.13021  [pdf, other

    cs.CL cs.AI cs.IR

    IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues

    Authors: Diji Yang, **meng Rao, Kezhen Chen, Xiaoyuan Guo, Yawen Zhang, Jie Yang, Yi Zhang

    Abstract: Although the Retrieval-Augmented Generation (RAG) paradigms can use external knowledge to enhance and ground the outputs of Large Language Models (LLMs) to mitigate generative hallucinations and static knowledge base problems, they still suffer from limited flexibility in adopting Information Retrieval (IR) systems with varying capabilities, constrained interpretability during the multi-round retr… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: Proceedings of the 47th International ACM SIGIR 2024

  10. arXiv:2404.18413  [pdf, other

    cs.CV cs.AI

    3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset

    Authors: Xinyu Ma, Xuebo Liu, Derek F. Wong, Jun Rao, Bei Li, Liang Ding, Lidia S. Chao, Dacheng Tao, Min Zhang

    Abstract: Multimodal machine translation (MMT) is a challenging task that seeks to improve translation quality by incorporating visual information. However, recent studies have indicated that the visual information provided by existing MMT datasets is insufficient, causing models to disregard it and overestimate their capabilities. This issue presents a significant obstacle to the development of MMT researc… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  11. arXiv:2404.07503  [pdf, other

    cs.CL

    Best Practices and Lessons Learned on Synthetic Data for Language Models

    Authors: Ruibo Liu, Jerry Wei, Fangyu Liu, Chenglei Si, Yanzhe Zhang, **meng Rao, Steven Zheng, Daiyi Peng, Diyi Yang, Denny Zhou, Andrew M. Dai

    Abstract: The success of AI models relies on the availability of large, diverse, and high-quality datasets, which can be challenging to obtain due to data scarcity, privacy concerns, and high costs. Synthetic data has emerged as a promising solution by generating artificial data that mimics real-world patterns. This paper provides an overview of synthetic data research, discussing its applications, challeng… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  12. arXiv:2403.10504  [pdf, other

    cs.DC cs.SE

    ATOM: Asynchronous Training of Massive Models for Deep Learning in a Decentralized Environment

    Authors: Xiaofeng Wu, Jia Rao, Wei Chen

    Abstract: The advent of the Transformer architecture has propelled the growth of natural language processing (NLP) models, leading to remarkable achievements in numerous NLP tasks. Yet, the absence of specialized hardware like expansive GPU memory and high-speed interconnects poses challenges for training large-scale models. This makes it daunting for many users to experiment with pre-training and fine-tuni… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  13. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  14. Simulation Studies for the First Pathfinder of the CATCH Space Mission

    Authors: Yiming Huang, Juan Zhang, Lian Tao, Zhengwei Li, Donghua Zhao, Qian-Qing Yin, Xiangyang Wen, **gyu Xiao, Chen Zhang, Shuang-Nan Zhang, Shaolin Xiong, Qingcui Bu, Jirong Cang, Dezhi Cao, Wen Chen, Siran Ding, Min Gao, Yang Gao, Shu** Hou, Li** Jia, Ge **, Dalin Li, **song Li, Pan** Li, Yajun Li , et al. (20 additional authors not shown)

    Abstract: The Chasing All Transients Constellation Hunters (CATCH) space mission is an intelligent constellation consisting of 126 micro-satellites in three types (A, B, and C), designed for X-ray observation with the objective of studying the dynamic universe. Currently, we are actively develo** the first Pathfinder (CATCH-1) for the CATCH mission, specifically for type-A satellites. CATCH-1 is equipped… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  15. arXiv:2402.14538  [pdf, other

    stat.AP econ.EM

    Interference Produces False-Positive Pricing Experiments

    Authors: Lars Roemheld, Justin Rao

    Abstract: It is standard practice in online retail to run pricing experiments by randomizing at the article-level, i.e. by changing prices of different products to identify treatment effects. Due to customers' cross-price substitution behavior, such experiments suffer from interference bias: the observed difference between treatment groups in the experiment is typically significantly larger than the global… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  16. arXiv:2402.08562  [pdf, other

    cs.CL cs.AI

    Higher Layers Need More LoRA Experts

    Authors: Chongyang Gao, Kezhen Chen, **meng Rao, Baochen Sun, Ruibo Liu, Daiyi Peng, Yawen Zhang, Xiaoyuan Guo, Jie Yang, VS Subrahmanian

    Abstract: Parameter-efficient tuning (PEFT) techniques like low-rank adaptation (LoRA) offer training efficiency on Large Language Models, but their impact on model performance remains limited. Recent efforts integrate LoRA and Mixture-of-Experts (MoE) to improve the performance of PEFT methods. Despite promising results, research on improving the efficiency of LoRA with MoE is still in its early stages. Re… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: The code is available at https://github.com/GCYZSL/MoLA

  17. arXiv:2402.07427  [pdf

    physics.flu-dyn physics.app-ph

    Insights into Spatio-temporal dynamics during shock -- droplet flame interaction

    Authors: Gautham Vadlamudi, Akhil Aravind, Saini Jatin Rao, Saptarshi Basu

    Abstract: The study comprehensively investigates the response of a combusting droplet during its interaction with a high-speed transient flow that is imposed by a coaxially propagating blast wave. The blast wave is generated using a specially designed unique miniature shock generation apparatus that generates blast waves using the wire-explosion technique which facilitates a wide range of shock Mach number… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  18. arXiv:2402.04710  [pdf, other

    cs.LG

    Incorporating Retrieval-based Causal Learning with Information Bottlenecks for Interpretable Graph Neural Networks

    Authors: Jiahua Rao, Jiancong Xie, Han**g Lin, Shuangjia Zheng, Zhen Wang, Yuedong Yang

    Abstract: Graph Neural Networks (GNNs) have gained considerable traction for their capability to effectively process topological data, yet their interpretability remains a critical concern. Current interpretation methods are dominated by post-hoc explanations to provide a transparent and intuitive understanding of GNNs. However, they have limited performance in interpreting complicated subgraphs and can't u… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  19. arXiv:2401.13154  [pdf, other

    cs.OS

    Nomad: Non-Exclusive Memory Tiering via Transactional Page Migration

    Authors: Lingfeng Xiang, Zhen Lin, Weishu Deng, Hui Lu, Jia Rao, Yifan Yuan, Ren Wang

    Abstract: With the advent of byte-addressable memory devices, such as CXL memory, persistent memory, and storage-class memory, tiered memory systems have become a reality. Page migration is the de facto method within operating systems for managing tiered memory. It aims to bring hot data whenever possible into fast memory to optimize the performance of data accesses while using slow memory to accommodate da… ▽ More

    Submitted 17 June, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

  20. arXiv:2401.12068  [pdf, other

    cs.SD cs.LG eess.AS

    Resource-constrained stereo singing voice cancellation

    Authors: Clara Borrelli, James Rae, Dogac Basaran, Matt McVicar, Mehrez Souden, Matthias Mauch

    Abstract: We study the problem of stereo singing voice cancellation, a subtask of music source separation, whose goal is to estimate an instrumental background from a stereo mix. We explore how to achieve performance similar to large state-of-the-art source separation networks starting from a small, efficient model for real-time speech separation. Such a model is useful when memory and compute are limited a… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  21. arXiv:2401.07462  [pdf, other

    hep-ex physics.ins-det

    Nonproportionality of NaI(Tl) Scintillation Detector for Dark Matter Search Experiments

    Authors: S. M. Lee, G. Adhikari, N. Carlin, J. Y. Cho, J. J. Choi, S. Choi, A. C. Ezeribe, L. E. Fran. a, C. Ha, I. S. Hahn, S. J. Hollick, E. J. Jeon, H. W. Joo, W. G. Kang, M. Kauer, B. H. Kim, H. J. Kim, J. Kim, K. W. Kim, S. H. Kim, S. K. Kim, S. W. Kim, W. K. Kim, Y. D. Kim, Y. H. Kim , et al. (37 additional authors not shown)

    Abstract: We present a comprehensive study of the nonproportionality of NaI(Tl) scintillation detectors within the context of dark matter search experiments. Our investigation, which integrates COSINE-100 data with supplementary $γ$ spectroscopy, measures light yields across diverse energy levels from full-energy $γ$ peaks produced by the decays of various isotopes. These $γ$ peaks of interest were produced… ▽ More

    Submitted 10 May, 2024; v1 submitted 14 January, 2024; originally announced January 2024.

    Comments: 12 pages, 7 figures

    Journal ref: Eur. Phys. J. C 84 (2024) 484

  22. arXiv:2401.07032  [pdf, other

    physics.flu-dyn

    Interaction of Vortex Ring with Perforated V-Wall

    Authors: Siddhant Jain, Saini Jatin Rao, Saptarshi Basu

    Abstract: Experiments are performed to investigate the interaction of a vortex ring (Reynolds number based on circulation (Re gamma = 11500) with perforated surface (open area ratio, phi1 = 0.24 and phi2 = 0.44) with different included angles (theta = 60deg - 180deg ). The phenomenon is characterized using techniques like Planer Laser-Induced Fluorescence (PLIF) imaging and Particle Image Velocimetry (PIV).… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

  23. arXiv:2401.07027  [pdf, other

    physics.flu-dyn

    Dynamics of Soap Bubble Inflation

    Authors: Saini Jatin Rao, Siddhant Jain, Saptarshi Basu

    Abstract: Bubbles have always captivated our curiosity with their aesthetics and complexities alike. While the act of blowing bubbles is familiar to everyone, the underlying physics of these fleeting spheres often eludes reasoning. In this letter, we discuss the dynamics of inflating a soap bubble using controlled airflow through a film-coated nozzle. We assess and predict the rate of inflation by varying t… ▽ More

    Submitted 8 February, 2024; v1 submitted 13 January, 2024; originally announced January 2024.

  24. arXiv:2312.12130  [pdf, other

    hep-lat nucl-ex nucl-th

    Cumulants and ordering of their ratios in 2D Potts models: Lessons for QCD?

    Authors: Rajiv V. Gavai, Bedangadas Mohanty, Jaydev Singh Rao, Swati Saha

    Abstract: Theoretical considerations suggest an ordering of the ratios of net-baryon number fluctuations in the vicinity of the transition from the low-temperature hadronic phase to the high temperature quark-gluon plasma phase at small values of the baryon chemical potential, $μ_B$, in the QCD phase diagram. The ordering hierarchy is $\frac{χ_6}{χ_2} < \frac{χ_5}{χ_1} < \frac{χ_4}{χ_2} < \frac{χ_3}{χ_1}$,… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  25. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  26. arXiv:2312.05752  [pdf, other

    cs.CV

    Camera-based 3D Semantic Scene Completion with Sparse Guidance Network

    Authors: Jianbiao Mei, Yu Yang, Mengmeng Wang, Junyu Zhu, Xiangrui Zhao, Jongwon Ra, Laijian Li, Yong Liu

    Abstract: Semantic scene completion (SSC) aims to predict the semantic occupancy of each voxel in the entire 3D scene from limited observations, which is an emerging and critical task for autonomous driving. Recently, many studies have turned to camera-based SSC solutions due to the richer visual cues and cost-effectiveness of cameras. However, existing methods usually rely on sophisticated and heavy 3D mod… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  27. arXiv:2312.01151  [pdf

    cs.CY cs.CL cs.SC

    Here Is Not There: Measuring Entailment-Based Trajectory Similarity for Location-Privacy Protection and Beyond

    Authors: Zilong Liu, Krzysztof Janowicz, Kitty Currier, Meilin Shi, **meng Rao, Song Gao, Ling Cai, Anita Graser

    Abstract: While the paths humans take play out in social as well as physical space, measures to describe and compare their trajectories are carried out in abstract, typically Euclidean, space. When these measures are applied to trajectories of actual individuals in an application area, alterations that are inconsequential in abstract space may suddenly become problematic once overlaid with geographic realit… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  28. arXiv:2311.05010  [pdf, other

    astro-ph.IM physics.ins-det

    Alpha backgrounds in NaI(Tl) crystals of COSINE-100

    Authors: G. Adhikari, N. Carlin, D. F. F. S. Cavalcante, J. Y. Cho, J. J. Choi, S. Choi, A. C. Ezeribe, L. E. Franca, C. Ha, I. S. Hahn, S. J. Hollick, E. J. Jeon, H. W. Joo, W. G. Kang, M. Kauer, B. H. Kim, H. J. Kim, J. Kim, K. W. Kim, S. H. Kim, S. K. Kim, S. W. Kim, W. K. Kim, Y. D. Kim, Y. H. Kim , et al. (38 additional authors not shown)

    Abstract: COSINE-100 is a dark matter direct detection experiment with 106 kg NaI(Tl) as the target material. 210Pb and daughter isotopes are a dominant background in the WIMP region of interest and are detected via beta decay and alpha decay. Analysis of the alpha channel complements the background model as observed in the beta/gamma channel. We present the measurement of the quenching factors and Monte Ca… ▽ More

    Submitted 30 January, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

  29. arXiv:2310.13248  [pdf, other

    cs.LG cs.AI cs.CY cs.SI

    FLEE-GNN: A Federated Learning System for Edge-Enhanced Graph Neural Network in Analyzing Geospatial Resilience of Multicommodity Food Flows

    Authors: Yuxiao Qu, **meng Rao, Song Gao, Qianheng Zhang, Wei-Lun Chao, Yu Su, Michelle Miller, Alfonso Morales, Patrick Huber

    Abstract: Understanding and measuring the resilience of food supply networks is a global imperative to tackle increasing food insecurity. However, the complexity of these networks, with their multidimensional interactions and decisions, presents significant challenges. This paper proposes FLEE-GNN, a novel Federated Learning System for Edge-Enhanced Graph Neural Network, designed to overcome these challenge… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: 10 pages, 5 figures

    ACM Class: I.2

    Journal ref: ACM SIGSPATIAL GeoAI 2023

  30. arXiv:2310.05286  [pdf, other

    cs.LG cs.AI cs.HC

    Generalizable Error Modeling for Search Relevance Data Annotation Tasks

    Authors: Heinrich Peters, Alireza Hashemi, James Rae

    Abstract: Human data annotation is critical in sha** the quality of machine learning (ML) and artificial intelligence (AI) systems. One significant challenge in this context is posed by annotation errors, as their effects can degrade the performance of ML models. This paper presents a predictive error model trained to detect potential errors in search relevance annotation tasks for three industry-scale ML… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

  31. arXiv:2310.00413  [pdf, other

    cs.CV cs.LG eess.IV

    SSIF: Learning Continuous Image Representation for Spatial-Spectral Super-Resolution

    Authors: Gengchen Mai, Ni Lao, Weiwei Sun, Yuchi Ma, Jiaming Song, Chenlin Meng, Hongxu Ma, **meng Rao, Ziyuan Li, Stefano Ermon

    Abstract: Existing digital sensors capture images at fixed spatial and spectral resolutions (e.g., RGB, multispectral, and hyperspectral images), and each combination requires bespoke machine learning models. Neural Implicit Functions partially overcome the spatial resolution challenge by representing an image in a resolution-independent way. However, they still operate at fixed, pre-defined spectral resolu… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

    MSC Class: 68T07; 68T45 ACM Class: I.4.10; I.2.10; I.4.6

  32. Building Privacy-Preserving and Secure Geospatial Artificial Intelligence Foundation Models

    Authors: **meng Rao, Song Gao, Gengchen Mai, Krzysztof Janowicz

    Abstract: In recent years we have seen substantial advances in foundation models for artificial intelligence, including language, vision, and multimodal models. Recent studies have highlighted the potential of using foundation models in geospatial artificial intelligence, known as GeoAI Foundation Models, for geographic question answering, remote sensing image understanding, map generation, and location-bas… ▽ More

    Submitted 12 October, 2023; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: 1 figure

    ACM Class: I.2.0

    Journal ref: ACM SIGSPATIAL 2023

  33. arXiv:2309.12916  [pdf, other

    cond-mat.mtrl-sci math.NA

    Meso-scale size effects of material heterogeneities on crack propagation in brittle solids: Perspectives from phase-field simulations

    Authors: Liuchi Li, Jack Rao, Todd Hufnagel, KT Ramesh

    Abstract: Brittle solids are often toughened by adding a second-phase material. This practice often results in composites with material heterogeneities on the meso scale: large compared to the scale of the process zone but small compared to that of the application. The specific configuration (both geometrical and mechanical) of this mesoscale heterogeneity is generally recognized as important in determining… ▽ More

    Submitted 19 February, 2024; v1 submitted 22 September, 2023; originally announced September 2023.

  34. arXiv:2309.11587  [pdf, other

    cs.LG cs.AI cs.CR

    CATS: Conditional Adversarial Trajectory Synthesis for Privacy-Preserving Trajectory Data Publication Using Deep Learning Approaches

    Authors: **meng Rao, Song Gao, Sijia Zhu

    Abstract: The prevalence of ubiquitous location-aware devices and mobile Internet enables us to collect massive individual-level trajectory dataset from users. Such trajectory big data bring new opportunities to human mobility research but also raise public concerns with regard to location privacy. In this work, we present the Conditional Adversarial Trajectory Synthesis (CATS), a deep-learning-based GeoAI… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Comments: 9 figures, 4 figures

    ACM Class: I.2

    Journal ref: International Journal of Geographical Information Science; 2023

  35. arXiv:2309.04041  [pdf, other

    cs.CV cs.CL

    Evaluation and Enhancement of Semantic Grounding in Large Vision-Language Models

    Authors: Jiaying Lu, **meng Rao, Kezhen Chen, Xiaoyuan Guo, Yawen Zhang, Baochen Sun, Carl Yang, Jie Yang

    Abstract: Large Vision-Language Models (LVLMs) offer remarkable benefits for a variety of vision-language tasks. However, a challenge hindering their application in real-world scenarios, particularly regarding safety, robustness, and reliability, is their constrained semantic grounding ability, which pertains to connecting language to the physical-world entities or concepts referenced in images. Therefore,… ▽ More

    Submitted 12 January, 2024; v1 submitted 7 September, 2023; originally announced September 2023.

    Comments: This paper has been accepted to the AAAI'24 Workshop on Responsible Language Models (ReLM 2024)

  36. arXiv:2308.12898  [pdf, other

    cs.MM cs.AI cs.CL cs.CV

    Can Linguistic Knowledge Improve Multimodal Alignment in Vision-Language Pretraining?

    Authors: Fei Wang, Liang Ding, Jun Rao, Ye Liu, Li Shen, Changxing Ding

    Abstract: The multimedia community has shown a significant interest in perceiving and representing the physical world with multimodal pretrained neural network models, and among them, the visual-language pertaining (VLP) is, currently, the most captivating topic. However, there have been few endeavors dedicated to the exploration of 1) whether essential linguistic knowledge (e.g., semantics and syntax) can… ▽ More

    Submitted 25 August, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: [TL;DR] we design and release the SNARE, the first large-scale multimodal alignment probing benchmark for current vision-language pretrained models

  37. arXiv:2308.09970  [pdf, other

    cs.CL cs.AI cs.LG

    Tackling Vision Language Tasks Through Learning Inner Monologues

    Authors: Diji Yang, Kezhen Chen, **meng Rao, Xiaoyuan Guo, Yawen Zhang, Jie Yang, Yi Zhang

    Abstract: Visual language tasks require AI models to comprehend and reason with both visual and textual content. Driven by the power of Large Language Models (LLMs), two prominent methods have emerged: (1) the hybrid integration between LLMs and Vision-Language Models (VLMs), where visual inputs are firstly converted into language descriptions by VLMs, serving as inputs for LLMs to generate final answer(s);… ▽ More

    Submitted 19 August, 2023; originally announced August 2023.

  38. arXiv:2307.10678  [pdf, other

    physics.flu-dyn eess.IV

    Depth from Defocus Technique: A Simple Calibration-Free Approach for Dispersion Size Measurement

    Authors: Saini Jatin Rao, Shubham Sharma, Saptarshi Basu, Cameron Tropea

    Abstract: Particle size measurement is crucial in various applications, be it sizing droplets in inkjet printing or respiratory events, tracking particulate ejection in hypersonic impacts, or detecting floating target markers in free surface flows. Such systems are characterised by extracting quantitative information like size, position, velocity and number density of the dispersed particles, which is typic… ▽ More

    Submitted 3 October, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

  39. arXiv:2307.09814  [pdf, other

    hep-ex physics.ins-det

    Search for inelastic WIMP-iodine scattering with COSINE-100

    Authors: G. Adhikari, N. Carlin, J. J. Choi, S. Choi, A. C. Ezeribe, L. E. Franca, C. Ha, I. S. Hahn, S. J. Hollick, E. J. Jeon, J. H. Jo, H. W. Joo, W. G. Kang, M. Kauer, B. H. Kim, H. J. Kim, J. Kim, K. W. Kim, S. H. Kim, S. K. Kim, W. K. Kim, Y. D. Kim, Y. H. Kim, Y. J. Ko, D. H. Lee , et al. (34 additional authors not shown)

    Abstract: We report the results of a search for inelastic scattering of weakly interacting massive particles (WIMPs) off $^{127}$I nuclei using NaI(Tl) crystals with a data exposure of 97.7 kg$\cdot$years from the COSINE-100 experiment. The signature of inelastic WIMP-$^{127}$I scattering is a nuclear recoil accompanied by a 57.6 keV $γ$-ray from the prompt deexcitation, producing a more energetic signal co… ▽ More

    Submitted 30 October, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: 8 pages, 5 figures. arXiv admin note: text overlap with arXiv:2104.03537

    Journal ref: Phys. Rev. D 108, 092006 (2023)

  40. arXiv:2306.14657  [pdf, other

    cs.RO eess.SY

    A Diversity Analysis of Safety Metrics Comparing Vehicle Performance in the Lead-Vehicle Interaction Regime

    Authors: Harnarayan Singh, Bowen Weng, Sughosh J. Rao, Devin Elsasser

    Abstract: Vehicle performance metrics analyze data sets consisting of subject vehicle's interactions with other road users in a nominal driving environment and provide certain performance measures as outputs. To the best of the authors' knowledge, the vehicle safety performance metrics research dates back to at least 1967. To date, there still does not exist a community-wide accepted metric or a set of metr… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: A modified manuscript of this preprint has been accepted to be published as a regular paper at IEEE Transactions on Intelligent Transportation Systems

  41. arXiv:2306.04907  [pdf, other

    stat.ME stat.AP

    Estimation of Poverty Measures for Small Areas Under a Two-Fold Nested Error Linear Regression Model: Comparison of Two Methods

    Authors: Maryam Sohrabi, J. N. K. Rao

    Abstract: Demand for reliable statistics at a local area (small area) level has greatly increased in recent years. Traditional area-specific estimators based on probability samples are not adequate because of small sample size or even zero sample size in a local area. As a result, methods based on models linking the areas are widely used. World Bank focused on estimating poverty measures, in particular pove… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  42. arXiv:2306.04348  [pdf, other

    cond-mat.mes-hall

    Non-Hermitian Topological Magnonics

    Authors: Tao Yu, Ji Zou, Bowen Zeng, J. W. Rao, Ke Xia

    Abstract: Dissipation in mechanics, optics, acoustics, and electronic circuits is nowadays recognized to be not always detrimental but can be exploited to achieve non-Hermitian topological phases or properties with functionalities for potential device applications. As elementary excitations of ordered magnetic moments that exist in various magnetic materials, magnons are the information carriers in magnonic… ▽ More

    Submitted 9 November, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: 101 pages, 35 figures

  43. arXiv:2306.02120  [pdf, other

    cond-mat.mes-hall physics.optics

    Giant Enhancement of Magnonic Frequency Combs by Exceptional Points

    Authors: Congyi Wang, **wei Rao, Zhijian Chen, Kaixin Zhao, Liaoxin Sun, Bimu Yao, Tao Yu, Yi-Pu Wang, Wei Lu

    Abstract: With their incomparable time-frequency accuracy, frequency combs have significantly advanced precision spectroscopy, ultra-sensitive detection, and atomic clocks. Traditional methods to create photonic, phononic, and magnonic frequency combs hinge on material nonlinearities which are often weak, necessitating high power densities to surpass their initiation thresholds, which subsequently limits th… ▽ More

    Submitted 3 June, 2023; originally announced June 2023.

    Comments: 7 pages, 4 figures

  44. Search for Boosted Dark Matter in COSINE-100

    Authors: G. Adhikari, N. Carlin, J. J. Choi, S. Choi, A. C. Ezeribe, L. E. Franca, C. Ha, I. S. Hahn, S. J. Hollick, E. J. Jeon, J. H. Jo, H. W. Joo, W. G. Kang, M. Kauer, B. H. Kim, H. J. Kim, J. Kim, K. W. Kim, S. H. Kim, S. K. Kim, W. K. Kim, Y. D. Kim, Y. H. Kim, Y. J. Ko, D. H. Lee , et al. (34 additional authors not shown)

    Abstract: We search for energetic electron recoil signals induced by boosted dark matter (BDM) from the galactic center using the COSINE-100 array of NaI(Tl) crystal detectors at the Yangyang Underground Laboratory. The signal would be an excess of events with energies above 4 MeV over the well-understood background. Because no excess of events are observed in a 97.7 kg$\cdot$years exposure, we set limits o… ▽ More

    Submitted 30 October, 2023; v1 submitted 31 May, 2023; originally announced June 2023.

    Comments: 7 pages, 4 figures

    Journal ref: Phys. Rev. Lett. 131, 201802 (2023)

  45. arXiv:2305.20047  [pdf, other

    cs.CV cs.AI

    LOWA: Localize Objects in the Wild with Attributes

    Authors: Xiaoyuan Guo, Kezhen Chen, **meng Rao, Yawen Zhang, Baochen Sun, Jie Yang

    Abstract: We present LOWA, a novel method for localizing objects with attributes effectively in the wild. It aims to address the insufficiency of current open-vocabulary object detectors, which are limited by the lack of instance-level attribute classification and rare class names. To train LOWA, we propose a hybrid vision-language training strategy to learn object detection and recognition with class names… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  46. arXiv:2305.19215  [pdf, other

    stat.ML cs.LG

    dotears: Scalable, consistent DAG estimation using observational and interventional data

    Authors: Albert Xue, **gyou Rao, Sriram Sankararaman, Harold Pimentel

    Abstract: New biological assays like Perturb-seq link highly parallel CRISPR interventions to a high-dimensional transcriptomic readout, providing insight into gene regulatory networks. Causal gene regulatory networks can be represented by directed acyclic graph (DAGs), but learning DAGs from observational data is complicated by lack of identifiability and a combinatorial solution space. Score-based structu… ▽ More

    Submitted 20 February, 2024; v1 submitted 30 May, 2023; originally announced May 2023.

  47. arXiv:2305.08997  [pdf, ps, other

    stat.ME

    Bayesian predictive inference when integrating a non-probability sample and a probability sample

    Authors: Balgobin Nandram, JNK Rao

    Abstract: We consider the problem of integrating a small probability sample (ps) and a non-probability sample (nps). By definition, for the nps, there are no survey weights, but for the ps, there are survey weights. The key issue is that the nps, although much larger than the ps, can lead to a biased estimator of a finite population quantity but with much smaller variance. We begin with a relatively simple… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: 35 pages, 1 figure

  48. arXiv:2305.08589  [pdf, other

    astro-ph.IM hep-ex

    In-orbit background simulation of a type-B CATCH satellite

    Authors: **gyu Xiao, Liqiang Qi, Shuang-Nan Zhang, Lian Tao, Zhengwei Li, Juan Zhang, Xiangyang Wen, Qian-Qing Yin, Yanji Yang, Qingcui Bu, Sheng Yang, Xiao**g Liu, Yiming Huang, Wen Chen, Yong Yang, Huaqiu Liu, Yibo Xu, Shujie Zhao, Xuan Zhang, Pan** Li, Kang Zhao, Ruican Ma, Qingchang Zhao, Rui**g Tang, **hui Rao , et al. (1 additional authors not shown)

    Abstract: The Chasing All Transients Constellation Hunters (CATCH) space mission plans to launch three types of micro-satellites (A, B, and C). The type-B CATCH satellites are dedicated to locating transients and detecting their time-dependent energy spectra. A type-B satellite is equipped with lightweight Wolter-I X-ray optics and an array of position-sensitive multi-pixel Silicon Drift Detectors. To optim… ▽ More

    Submitted 21 July, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: 24 pages, 13 figures, 7 tables, accepted for publication in Experimental Astronomy

  49. arXiv:2304.13923  [pdf, other

    cs.CV cs.CL cs.MM

    Retrieval-based Knowledge Augmented Vision Language Pre-training

    Authors: Jiahua Rao, Zifei Shan, Longpo Liu, Yao Zhou, Yuedong Yang

    Abstract: With the recent progress in large-scale vision and language representation learning, Vision Language Pre-training (VLP) models have achieved promising improvements on various multi-modal downstream tasks. Albeit powerful, these models have not fully leveraged world knowledge to their advantage. A key challenge of knowledge-augmented VLP is the lack of clear connections between knowledge and multi-… ▽ More

    Submitted 6 August, 2023; v1 submitted 26 April, 2023; originally announced April 2023.

    Comments: arXiv admin note: text overlap with arXiv:2210.09338 by other authors

  50. arXiv:2304.11948  [pdf

    cond-mat.mes-hall

    Perspective on non-Hermitian physics in magnetic systems

    Authors: Tao Yu, J. W. Rao

    Abstract: A perspective on non-Hermitian physics in magnetic systems is addressed in this short article, including exceptional points, exceptional nodal phases, the non-Hermitian SSH model, and the non-Hermitian skin effect.

    Submitted 23 August, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

    Comments: 5 pages. Submitted as a section of Magnonic Roadmap 2024