Skip to main content

Showing 101–150 of 709,815 results for author: J.

.
  1. arXiv:2407.09161  [pdf, other

    physics.optics cs.ET quant-ph

    Encoding arbitrary Ising Hamiltonians on Spatial Photonic Ising Machines

    Authors: Jason Sakellariou, Alexis Askitopoulos, Georgios Pastras, Symeon I. Tsintzos

    Abstract: Photonic Ising Machines constitute an emergent new paradigm of computation, geared towards tackling combinatorial optimization problems that can be reduced to the problem of finding the ground state of an Ising model. Spatial Photonic Ising Machines have proven to be advantageous for simulating fully connected large-scale spin systems. However, fine control of a general interaction matrix $J$ has… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 7 pages, 4 figures

  2. arXiv:2407.09159  [pdf, other

    cs.CV

    Weakly-supervised Autism Severity Assessment in Long Videos

    Authors: Abid Ali, Mahmoud Ali, Jean-Marc Odobez, Camilla Barbini, Séverine Dubuisson, Francois Bremond, Susanne Thümmler

    Abstract: Autism Spectrum Disorder (ASD) is a diverse collection of neurobiological conditions marked by challenges in social communication and reciprocal interactions, as well as repetitive and stereotypical behaviors. Atypical behavior patterns in a long, untrimmed video can serve as biomarkers for children with ASD. In this paper, we propose a video-based weakly-supervised method that takes spatio-tempor… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Journal ref: https://cbmi2024.org/

  3. arXiv:2407.09158  [pdf, ps, other

    math.RA math.KT

    A non-abelian tensor product of algebras with bracket

    Authors: José Manuel Casas, Emzar Khmaladze, Manuel Ladra

    Abstract: We introduce and study a non-abelian tensor product of two algebras with bracket with compatible actions on each other. We investigate its applications to the universal central extensions and the low-dimensional homology of perfect algebras with bracket.

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 17 pages. arXiv admin note: text overlap with arXiv:2307.15636

    MSC Class: 16E40; 16E99; 16W99; 16B50

  4. arXiv:2407.09153  [pdf

    cond-mat.mtrl-sci

    Topological Fermi-arc surface state covered by floating electrons on a two-dimensional electride

    Authors: Chan-young Lim, Min-Seok Kim, Dong Cheol Lim, Sunghun Kim, Yeonghoon Lee, Jaehoon Cha, Gyubin Lee, Sang Yong Song, Dinesh Thapa, Jonathan D. Denlinger, Seong-Gon Kim, Sung Wng Kim, Jungpil Seo, Yeongkwan Kim

    Abstract: Two-dimensional electrides can acquire topologically non-trivial phases due to intriguing interplay between the cationic atomic layers and anionic electron layers. However, experimental evidence of topological surface states has yet to be verified. Here, via angle-resolved photoemission spectroscopy (ARPES) and scanning tunnelling microscopy (STM), we probe the magnetic Weyl states of the ferromag… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 22 pages, 6 figures

    Journal ref: Nat. Commun. 15 (2024) 5615

  5. arXiv:2407.09146  [pdf, other

    cs.LO math.AT math.CT

    Directed univalence in simplicial homotopy type theory

    Authors: Daniel Gratzer, Jonathan Weinberger, Ulrik Buchholtz

    Abstract: Simplicial type theory extends homotopy type theory with a directed path type which internalizes the notion of a homomorphism within a type. This concept has significant applications both within mathematics -- where it allows for synthetic (higher) category theory -- and programming languages -- where it leads to a directed version of the structure identity principle. In this work, we construct th… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    MSC Class: 03B38; 18N60; 18D30; 18B50; 18N45; 55U35; 18N50 ACM Class: F.4.1

  6. arXiv:2407.09142  [pdf, other

    cs.CR cs.DC cs.SE

    Securing Confidential Data For Distributed Software Development Teams: Encrypted Container File

    Authors: Tobias J. Bauer, Andreas Aßmuth

    Abstract: In the context of modern software engineering, there is a trend towards Cloud-native software development involving international teams with members from all over the world. Cloud-based version management services like GitHub are commonly used for source code and other files. However, a challenge arises when developers from different companies or organizations share the platform, as sensitive data… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 18 pages, for associated implementation etc., see https://github.com/Hirnmoder/ECF

    Journal ref: International Journal On Advances in Security, vol. 17, no. 1 and 2, pp. 11-28, 2024, ISSN 1942-2636

  7. arXiv:2407.09140  [pdf, other

    astro-ph.IM

    FlyEye Ground-Based Telescope: Unveiling New Frontiers in Astronomical Science

    Authors: Carmelo Arcidiacono, Matteo Simioni, Roberto Ragazzoni, Piero Gregori, Paolo Lorenzi, Francesco Cerutti, Roberto Ziano, Matteo Bisiani, Roberta Pellegrini, Andrea Guazzora, Silvano Pieri, Marco Dima, Silvio Di Rosa, Simone Zaggia, Jacopo Farinato, Demetrio Magrin, Andrea Grazian, Marco Gullieuszik

    Abstract: The FlyEye design makes its debut in the ESA's NEOSTEL developed by OHB-Italia. This pioneering FlyEye telescope integrates a monolithic 1-meter class primary mirror feeding 16 CCD cameras for discovering Near-Earth Object (NEO) and any class of transient phenomena. OHB-Italia is the prime contractor, receiving extended support from the Italian National Institute for Astrophysics (INAF) in the ESA… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 9 pages, 1 figure, SPIE Astronomical Telescopes + Instrumentation, Ground-based and Airborne Instrumentation for Astronomy X, 16-21 June 2024

  8. arXiv:2407.09139  [pdf, other

    hep-ex

    Measurement of $CP$ asymmetries in $B^0 \to K^0_S π^0 γ$ decays at Belle II

    Authors: Belle II Collaboration, I. Adachi, L. Aggarwal, H. Ahmed, H. Aihara, N. Akopov, A. Aloisio, N. Anh Ky, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot, A. Baur, A. Beaubien, F. Becherer , et al. (414 additional authors not shown)

    Abstract: We report measurements of time-dependent $CP$ asymmetries in $B^0 \to K^0_S π^0 γ$ decays based on a data sample of $(388\pm6)\times10^6$ $B\bar{B}$ events collected at the $Υ(4S)$ resonance with the Belle II detector. The Belle II experiment operates at the SuperKEKB asymmetric-energy $e^+e^-$ collider. We measure decay-time distributions to determine $CP$-violating parameters $S$ and $C$. We det… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 10 pages, 4 figures

    Report number: Belle II Preprint 2024-009, KEK Preprint 2024-1

  9. arXiv:2407.09136  [pdf, other

    cs.CL cs.AI cs.LG

    Stepwise Verification and Remediation of Student Reasoning Errors with Large Language Model Tutors

    Authors: Nico Daheim, Jakub Macina, Manu Kapur, Iryna Gurevych, Mrinmaya Sachan

    Abstract: Large language models (LLMs) present an opportunity to scale high-quality personalized education to all. A promising approach towards this means is to build dialog tutoring models that scaffold students' problem-solving. However, even though existing LLMs perform well in solving reasoning questions, they struggle to precisely detect student's errors and tailor their feedback to these errors. Inspi… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Preprint. Nico Daheim and Jakub Macina contributed equally. Code and dataset can be found under: https://github.com/eth-lre/verify-then-generate

  10. arXiv:2407.09132  [pdf, ps, other

    astro-ph.IM

    The MICADO first light imager for the ELT: the PSF Reconstruction Software

    Authors: Andrea Grazian, Elisa Portaluri, Matteo Simioni, Carmelo Arcidiacono, Marco Gullieuszik, Johanna Hartke, Daniel Jodlbauer, Fernando Pedichini, Roberto Piazzesi, Piero Vaccari, Benedetta Vulcani, Roland Wagner, Anita Zanella

    Abstract: MICADO is the first-light camera of the ESO ELT, allowing NIR imaging and long-slit spectroscopy assisted by adaptive optics. MICADO is now entering its construction phase, and the software for data reduction is reaching an adequate maturity level. The PSF Reconstruction (PSF-R) of MICADO is a software tool for the blind derivation of the PSF, only using adaptive optics telemetry data. An update o… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 5 pages, 3 figures, Proceedings for the SPIE Astronomical Telescopes and Instrumentation 2024, Adaptive Optics Systems IX, Paper No.13097-234

  11. arXiv:2407.09130  [pdf, other

    math.ST stat.ME

    On goodness-of-fit testing for self-exciting point processes

    Authors: José C. F. Kling, Mathias Vetter

    Abstract: Despite the wide usage of parametric point processes in theory and applications, a sound goodness-of-fit procedure to test whether a given parametric model is appropriate for data coming from a self-exciting point processes has been missing in the literature. In this work, we establish a bootstrap-based goodness-of-fit test which empirically works for all kinds of self-exciting point processes (an… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  12. arXiv:2407.09127  [pdf, other

    cs.LG

    Robustness of Explainable Artificial Intelligence in Industrial Process Modelling

    Authors: Benedikt Kantz, Clemens Staudinger, Christoph Feilmayr, Johannes Wachlmayr, Alexander Haberl, Stefan Schuster, Franz Pernkopf

    Abstract: eXplainable Artificial Intelligence (XAI) aims at providing understandable explanations of black box models. In this paper, we evaluate current XAI methods by scoring them based on ground truth simulations and sensitivity analysis. To this end, we used an Electric Arc Furnace (EAF) model to better understand the limits and robustness characteristics of XAI methods such as SHapley Additive exPlanat… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 11 pages, 3 figures, accepted at the ICML'24 Workshop ML4MS

  13. arXiv:2407.09121  [pdf, other

    cs.CL cs.AI

    Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training

    Authors: Youliang Yuan, Wenxiang Jiao, Wenxuan Wang, Jen-tse Huang, Jiahao Xu, Tian Liang, Pinjia He, Zhaopeng Tu

    Abstract: This study addresses a critical gap in safety tuning practices for Large Language Models (LLMs) by identifying and tackling a refusal position bias within safety tuning data, which compromises the models' ability to appropriately refuse generating unsafe content. We introduce a novel approach, Decoupled Refusal Training (DeRTa), designed to empower LLMs to refuse compliance to harmful prompts at a… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  14. arXiv:2407.09120  [pdf, other

    cs.LG cs.CL cs.CV

    URRL-IMVC: Unified and Robust Representation Learning for Incomplete Multi-View Clustering

    Authors: Ge Teng, Ting Mao, Chen Shen, Xiang Tian, Xuesong Liu, Yaowu Chen, Jie** Ye

    Abstract: Incomplete multi-view clustering (IMVC) aims to cluster multi-view data that are only partially available. This poses two main challenges: effectively leveraging multi-view information and mitigating the impact of missing views. Prevailing solutions employ cross-view contrastive learning and missing view recovery techniques. However, they either neglect valuable complementary information by focusi… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Accepted by ACM SIGKDD 2024

  15. arXiv:2407.09119  [pdf, other

    quant-ph cond-mat.quant-gas physics.atom-ph

    Enhanced quantum state transfer via feedforward cancellation of optical phase noise

    Authors: Benjamin P. Maddox, Jonathan M. Mortlock, Tom R. Hepworth, Adarsh P. Raghuram, Philip D. Gregory, Alexander Guttridge, Simon L. Cornish

    Abstract: Many experimental platforms for quantum science depend on state control via laser fields. Frequently, however, the control fidelity is limited by optical phase noise. This is exacerbated in stabilized laser systems where high-frequency phase noise is an unavoidable consequence of feedback. Here we implement an optical feedforward technique to suppress laser phase noise in the STIRAP state transfer… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  16. arXiv:2407.09114  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci physics.optics quant-ph

    Spectroscopy of Single CdSe Magic-Sized Nanocrystals

    Authors: Gabriel Nagamine, Julian Santen, Juri G. Crimmann, Aniket S. Mule, Andrew B. Pun, David J. Norris

    Abstract: Chemical syntheses that provide nanocrystals (NCs) with narrow distributions in size and shape are critical for NC research. This has led to the investigation of magic-sized NCs (MSNCs), a class of semiconductor crystallites that grow in discrete steps, potentially offering a single size and shape (i.e., monodispersity). However, the photoluminescence (PL) spectra of CdSe MSNCs measured at room te… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  17. arXiv:2407.09111  [pdf, other

    cs.AI cs.LG

    Inference Optimization of Foundation Models on AI Accelerators

    Authors: Youngsuk Park, Kailash Budhathoki, Liangfu Chen, Jonas Kübler, Jiaji Huang, Matthäus Kleindessner, Jun Huan, Volkan Cevher, Yida Wang, George Karypis

    Abstract: Powerful foundation models, including large language models (LLMs), with Transformer architectures have ushered in a new era of Generative AI across various industries. Industry and research community have witnessed a large number of new applications, based on those foundation models. Such applications include question and answer, customer services, image and video generation, and code completions… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Tutorial published at KDD 2024. Camera-ready version

  18. arXiv:2407.09104  [pdf, other

    cs.CR cs.LG

    UserBoost: Generating User-specific Synthetic Data for Faster Enrolment into Behavioural Biometric Systems

    Authors: George Webber, Jack Sturgess, Ivan Martinovic

    Abstract: Behavioural biometric authentication systems entail an enrolment period that is burdensome for the user. In this work, we explore generating synthetic gestures from a few real user gestures with generative deep learning, with the application of training a simple (i.e. non-deep-learned) authentication model. Specifically, we show that utilising synthetic data alongside real data can reduce the numb… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  19. arXiv:2407.09102  [pdf, ps, other

    math.PR

    Quantitative diffusion approximation for the Neutral $r$-Alleles Wright-Fisher Model with Mutations

    Authors: Peng Chen, Jie Xiong, Lihu Xu, Jiayu Zheng

    Abstract: We apply a Lindeberg principle under the Markov process setting to approximate the Wright-Fisher model with neutral $r$-alleles using a diffusion process, deriving an error rate based on a function class distance involving fourth-order bounded differentiable functions. This error rate consists of a linear combination of the maximum mutation rate and the reciprocal of the population size. Our resul… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  20. arXiv:2407.09095  [pdf, other

    cs.CR

    TAPFixer: Automatic Detection and Repair of Home Automation Vulnerabilities based on Negated-property Reasoning

    Authors: Yinbo Yu, Yuanqi Xu, Kepu Huang, Jiajia Liu

    Abstract: Trigger-Action Programming (TAP) is a popular end-user programming framework in the home automation (HA) system, which eases users to customize home automation and control devices as expected. However, its simplified syntax also introduces new safety threats to HA systems through vulnerable rule interactions. Accurately fixing these vulnerabilities by logically and physically eliminating their roo… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Journal ref: USENIX Security 2024

  21. arXiv:2407.09093  [pdf, ps, other

    cs.LG cs.AI

    On Exact Bit-level Reversible Transformers Without Changing Architectures

    Authors: Guoqiang Zhang, J. P. Lewis, W. B. Kleijn

    Abstract: In the literature, various reversible deep neural networks (DNN) models have been proposed to reduce memory consumption or improve data-throughput in the training process. However, almost all existing reversible DNNs either are constrained to have special structures or are constructed by modifying the original DNN architectures considerably to enable reversibility. In this work, we propose exact b… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  22. arXiv:2407.09091  [pdf, other

    cs.RO

    Accurate Prior-centric Monocular Positioning with Offline LiDAR Fusion

    Authors: **hao He, Huaiyang Huang, Shuyang Zhang, Jianhao Jiao, Chengju Liu, Ming Liu

    Abstract: Unmanned vehicles usually rely on Global Positioning System (GPS) and Light Detection and Ranging (LiDAR) sensors to achieve high-precision localization results for navigation purpose. However, this combination with their associated costs and infrastructure demands, poses challenges for widespread adoption in mass-market applications. In this paper, we aim to use only a monocular camera to achieve… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: ICRA 2024

  23. arXiv:2407.09089  [pdf

    q-bio.MN

    Lomics: Generation of Pathways and Gene Sets using Large Language Models for Transcriptomic Analysis

    Authors: Chun-Ka Wong, Ali Choo, Eugene C. C. Cheng, Wing-Chun San, Kelvin Chak-Kong Cheng, Yee-Man Lau, Minqing Lin, Fei Li, Wei-Hao Liang, Song-Yan Liao, Kwong-Man Ng, Ivan Fan-Ngai Hung, Hung-Fat Tse, Jason Wing-Hon Wong

    Abstract: Interrogation of biological pathways is an integral part of omics data analysis. Large language models (LLMs) enable the generation of custom pathways and gene sets tailored to specific scientific questions. These targeted sets are significantly smaller than traditional pathway enrichment analysis libraries, reducing multiple hypothesis testing and potentially enhancing statistical power. Lomics (… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  24. arXiv:2407.09082  [pdf, other

    astro-ph.GA

    The MeerKAT Fornax Survey. III. Ram-pressure strip** of the tidally interacting galaxy NGC 1427A in the Fornax cluster

    Authors: P. Serra, T. A. Oosterloo, P. Kamphuis, G. I. G. Jozsa, W. J. G. de Blok, G. L. Bryan, J. H. van Gorkom, E. Iodice, D. Kleiner, A. Loni, S. I. Loubser, F. M. Maccagni, D. Molnar, R. Peletier, D. J. Pisano, M. Ramatsoku, M. W. L. Smith, M. A. W. Verheijen, N. Zabel

    Abstract: We present MeerKAT Fornax Survey HI observations of NGC 1427A, a blue irregular galaxy with a stellar mass of 2e+9 Msun located near the centre of the Fornax galaxy cluster. Thanks to the excellent resolution (1 to 6 kpc spatially, 1.4 km/s in velocity) and HI column density sensitivity (4e+19/cm^2 to 1e+18/cm^2 depending on resolution), our data deliver new insights on the long-debated interactio… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Astronomy & Astrophysics, accepted. Data available at the MeerKAT Fornax Survey website, https://sites.google.com/inaf.it/meerkatfornaxsurvey

  25. arXiv:2407.09073  [pdf, other

    cs.CV

    Open Vocabulary Multi-Label Video Classification

    Authors: Rohit Gupta, Mamshad Nayeem Rizve, Jayakrishnan Unnikrishnan, Ashish Tawari, Son Tran, Mubarak Shah, Benjamin Yao, Trishul Chilimbi

    Abstract: Pre-trained vision-language models (VLMs) have enabled significant progress in open vocabulary computer vision tasks such as image classification, object detection and image segmentation. Some recent works have focused on extending VLMs to open vocabulary single label action classification in videos. However, previous methods fall short in holistic video understanding which requires the ability to… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Accepted at ECCV 2024

  26. arXiv:2407.09069  [pdf, other

    hep-th gr-qc

    Holographic Einstein Ring of Deformed AdS-Schwarzschild Black Holes

    Authors: **-Yu Gui, Xiao-Xiong Zeng, Ke-Jian He, Huan Ye

    Abstract: The Einstein ring of a deformed AdS-Schwarzschild black hole (BH) is investigated under the wave optics framework. When the source is fixed on the AdS boundary, we can obtain the corresponding response function generated on the opposite side of the boundary. Utilizing a specialized optical system equipped with a convex lens enables us to capture an image of the BH's holographic Einstein ring on th… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 12 pages, 21 figures

  27. arXiv:2407.09064  [pdf, other

    cs.IR cs.LG

    Multi-Modal Dataset Creation for Federated~Learning with DICOM Structured Reports

    Authors: Malte Tölle, Lukas Burger, Halvar Kelm, Florian André, Peter Bannas, Gerhard Diller, Norbert Frey, Philipp Garthe, Stefan Groß, Anja Hennemuth, Lars Kaderali, Nina Krüger, Andreas Leha, Simon Martin, Alexander Meyer, Eike Nagel, Stefan Orwat, Clemens Scherer, Moritz Seiffert, Jan Moritz Seliger, Stefan Simm, Tim Friede, Tim Seidler, Sandy Engelhardt

    Abstract: Purpose: Federated training is often hindered by heterogeneous datasets due to divergent data storage options, inconsistent naming schemes, varied annotation procedures, and disparities in label quality. This is particularly evident in the emerging multi-modal learning paradigms, where dataset harmonization including a uniform data representation and filtering options are of paramount importance.… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  28. arXiv:2407.09059  [pdf, other

    cs.CV

    Domain-adaptive Video Deblurring via Test-time Blurring

    Authors: **-Ting He, Fu-Jen Tsai, Jia-Hao Wu, Yan-Tsung Peng, Chung-Chi Tsai, Chia-Wen Lin, Yen-Yu Lin

    Abstract: Dynamic scene video deblurring aims to remove undesirable blurry artifacts captured during the exposure process. Although previous video deblurring methods have achieved impressive results, they suffer from significant performance drops due to the domain gap between training and testing videos, especially for those captured in real-world scenarios. To address this issue, we propose a domain adapta… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  29. arXiv:2407.09055  [pdf, other

    stat.ML cs.LG

    Advanced Graph Clustering Methods: A Comprehensive and In-Depth Analysis

    Authors: Timothé Watteau, Aubin Bonnefoy, Simon Illouz-Laurent, Joaquim Jusseau, Serge Iovleff

    Abstract: Graph clustering, which aims to divide a graph into several homogeneous groups, is a critical area of study with applications that span various fields such as social network analysis, bioinformatics, and image segmentation. This paper explores both traditional and more recent approaches to graph clustering. Firstly, key concepts and definitions in graph theory are introduced. The background sectio… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  30. arXiv:2407.09053  [pdf, other

    cs.RO

    Navi2Gaze: Leveraging Foundation Models for Navigation and Target Gazing

    Authors: Jun Zhu, Zihao Du, Haotian Xu, Fengbo Lan, Zilong Zheng, Bo Ma, Shengjie Wang, Tao Zhang

    Abstract: Task-aware navigation continues to be a challenging area of research, especially in scenarios involving open vocabulary. Previous studies primarily focus on finding suitable locations for task completion, often overlooking the importance of the robot's pose. However, the robot's orientation is crucial for successfully completing tasks because of how objects are arranged (e.g., to open a refrigerat… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  31. arXiv:2407.09045  [pdf, other

    cs.IR cs.AI

    Time-Frequency Analysis of Variable-Length WiFi CSI Signals for Person Re-Identification

    Authors: Chen Mao, Chong Tan, **gqi Hu, Min Zheng

    Abstract: Person re-identification (ReID), as a crucial technology in the field of security, plays an important role in security detection and people counting. Current security and monitoring systems largely rely on visual information, which may infringe on personal privacy and be susceptible to interference from pedestrian appearances and clothing in certain scenarios. Meanwhile, the widespread use of rout… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  32. arXiv:2407.09043  [pdf, other

    cs.AI

    Molecule Language Model with Augmented Pairs and Expertise Transfer

    Authors: Namkyeong Lee, Siddhartha Laghuvarapu, Chanyoung Park, Jimeng Sun

    Abstract: Understanding the molecules and their textual descriptions via molecule language models (MoLM) recently got a surge of interest among researchers. However, unique challenges exist in the field of MoLM due to 1) a limited amount of molecule-text paired data and 2) missing expertise that occurred due to the specialized areas of focus among the experts. To this end, we propose AMOLE, which 1) augment… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: ACL 2024 Workshop on Languages and Molecule

  33. arXiv:2407.09041  [pdf, other

    eess.SP

    Optimization of Long-Haul C+L+S Systems by means of a Closed Form EGN Model

    Authors: Y. Jiang, J. Sarkis, A. Nespola, F. Forghieri, S. Piciaccia, A. Tanzi, M. Ranjbar Zefreh, P. Poggiolini

    Abstract: We investigate C+L+S long-haul systems using a closed-form GN/EGN non-linearity model. We perform accurate launch power and Raman pump optimization. We show a potential 4x throughput increase over legacy C-band systems in 1000 km links, using moderate S-only Raman amplification. We simultaneously achieve extra-flat GSNR, within +/-0.5 dB across the whole C+L+S spectrum.

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: The paper is identical to a manuscript submitted to PTL in June 2024, except this arXiv version has been updated in the references. Ref. [8] and [10] are about CFM6 and its experimental validation

  34. arXiv:2407.09038  [pdf, other

    eess.IV

    High-Resolution Hyperspectral Video Imaging Using A Hexagonal Camera Array

    Authors: Frank Sippel, Jürgen Seiler, André Kaup

    Abstract: Retrieving the reflectance spectrum from objects is an essential task for many classification and detection problems, since many materials and processes have a unique spectral behaviour. In many cases, it is highly desirable to capture hyperspectral images due to the high spectral flexibility. Often, it is even necessary to capture hyperspectral videos or at least to be able to record a hyperspect… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  35. arXiv:2407.09036  [pdf, ps, other

    math.AG

    On the structure of the complement of skeleton

    Authors: Morgan Brown, Jiachang Xu, Muyuan Zhang

    Abstract: We study the higher dimensional geometry of Berkovich spaces using open fiber disks, which are given by open disks in a relative dimension $1$ fibration. Inspired by birational geometry, we conjecture that the Berkovich skeleton is the complement of the union of all open fiber disks, and prove this conjecture for $\mathcal{X}$ admitting a strictly semistable model with semiample canonical class.

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Comments are welcome!

    MSC Class: 14G22; 14E30

  36. arXiv:2407.09035  [pdf, other

    eess.IV cs.CV

    GPC: Generative and General Pathology Image Classifier

    Authors: Anh Tien Nguyen, ** Tae Kwak

    Abstract: Deep learning has been increasingly incorporated into various computational pathology applications to improve its efficiency, accuracy, and robustness. Although successful, most previous approaches for image classification have crucial drawbacks. There exist numerous tasks in pathology, but one needs to build a model per task, i.e., a task-specific model, thereby increasing the number of models, t… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: MICCAI-MedAGI 2023 (Best Paper Honorable Mention)

  37. arXiv:2407.09032  [pdf, other

    math.NA cs.LG

    DRM Revisited: A Complete Error Analysis

    Authors: Yuling Jiao, Ruoxuan Li, Peiying Wu, Jerry Zhijian Yang, **wen Zhang

    Abstract: In this work, we address a foundational question in the theoretical analysis of the Deep Ritz Method (DRM) under the over-parameteriztion regime: Given a target precision level, how can one determine the appropriate number of training samples, the key architectural parameters of the neural networks, the step size for the projected gradient descent optimization procedure, and the requisite number o… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  38. arXiv:2407.09030  [pdf, other

    eess.IV cs.CV

    CAMP: Continuous and Adaptive Learning Model in Pathology

    Authors: Anh Tien Nguyen, Keunho Byeon, Kyungeun Kim, Boram Song, Seoung Wan Chae, ** Tae Kwak

    Abstract: There exist numerous diagnostic tasks in pathology. Conventional computational pathology formulates and tackles them as independent and individual image classification problems, thereby resulting in computational inefficiency and high costs. To address the challenges, we propose a generic, unified, and universal framework, called a continuous and adaptive learning model in pathology (CAMP), for pa… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Under review

  39. arXiv:2407.09029  [pdf, other

    cs.MM cs.CV cs.SD eess.AS

    Enhancing Emotion Recognition in Incomplete Data: A Novel Cross-Modal Alignment, Reconstruction, and Refinement Framework

    Authors: Haoqin Sun, Shiwan Zhao, Shaokai Li, Xiangyu Kong, Xuechen Wang, Aobo Kong, Jiaming Zhou, Yong Chen, Wenjia Zeng, Yong Qin

    Abstract: Multimodal emotion recognition systems rely heavily on the full availability of modalities, suffering significant performance declines when modal data is incomplete. To tackle this issue, we present the Cross-Modal Alignment, Reconstruction, and Refinement (CM-ARR) framework, an innovative approach that sequentially engages in cross-modal alignment, reconstruction, and refinement phases to handle… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  40. arXiv:2407.09027  [pdf, other

    quant-ph

    Exploring the role of criticality in the quantum Otto cycle fueled by the anisotropic quantum Rabi-Stark model

    Authors: He-Guang Xu, Jiasen **, Norton G. de Almeida, G. D. de Moraes Neto

    Abstract: Quantum heat machines, encompassing heat engines, refrigerators, heaters, and accelerators, represent the forefront of quantum thermodynamics, offering a novel paradigm for converting heat energy into useful mechanical work. Leveraging quantum mechanical principles, these machines promise superior efficiency and performance compared to classical counterparts, with potential applications in renewab… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  41. arXiv:2407.09025  [pdf, other

    cs.AI

    SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

    Authors: Yuzhang Tian, Jianbo Zhao, Haoyu Dong, Junyu Xiong, Shiyu Xia, Mengyu Zhou, Yun Lin, José Cambronero, Yeye He, Shi Han, Dongmei Zhang

    Abstract: Spreadsheets, with their extensive two-dimensional grids, various layouts, and diverse formatting options, present notable challenges for large language models (LLMs). In response, we introduce SpreadsheetLLM, pioneering an efficient encoding method designed to unleash and optimize LLMs' powerful understanding and reasoning capability on spreadsheets. Initially, we propose a vanilla serialization… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  42. arXiv:2407.09024  [pdf, other

    cs.LG

    Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control

    Authors: Huayu Chen, Kaiwen Zheng, Hang Su, Jun Zhu

    Abstract: Drawing upon recent advances in language model alignment, we formulate offline Reinforcement Learning as a two-stage optimization problem: First pretraining expressive generative policies on reward-free behavior datasets, then fine-tuning these policies to align with task-specific annotations like Q-values. This strategy allows us to leverage abundant and diverse behavior data to enhance generaliz… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  43. arXiv:2407.09021  [pdf, other

    eess.AS

    Squeeze-and-Excite ResNet-Conformers for Sound Event Localization, Detection, and Distance Estimation for DCASE 2024 Challenge

    Authors: Jun Wei Yeow, Ee-Leng Tan, Jisheng Bai, Santi Peksi, Woon-Seng Gan

    Abstract: This technical report details our systems submitted for Task 3 of the DCASE 2024 Challenge: Audio and Audiovisual Sound Event Localization and Detection (SELD) with Source Distance Estimation (SDE). We address only the audio-only SELD with SDE (SELDDE) task in this report. We propose to improve the existing ResNet-Conformer architectures with Squeeze-and-Excitation blocks in order to introduce add… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Technical report for DCASE 2024 Challenge Task 3

  44. arXiv:2407.09020  [pdf, other

    cs.CL

    3M-Health: Multimodal Multi-Teacher Knowledge Distillation for Mental Health Detection

    Authors: Rina Carines Cabral, Siwen Luo, Soyeon Caren Han, Josiah Poon

    Abstract: The significance of mental health classification is paramount in contemporary society, where digital platforms serve as crucial sources for monitoring individuals' well-being. However, existing social media mental health datasets primarily consist of text-only samples, potentially limiting the efficacy of models trained on such data. Recognising that humans utilise cross-modal information to compr… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  45. arXiv:2407.09017  [pdf, other

    cs.LG cs.CR cs.IR

    AI-Driven Guided Response for Security Operation Centers with Microsoft Copilot for Security

    Authors: Scott Freitas, Jovan Kalajdjieski, Amir Gharib, Rob McCann

    Abstract: Security operation centers contend with a constant stream of security incidents, ranging from straightforward to highly complex. To address this, we developed Copilot Guided Response (CGR), an industry-scale ML architecture that guides security analysts across three key tasks -- (1) investigation, providing essential historical context by identifying similar incidents; (2) triaging to ascertain th… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  46. arXiv:2407.09016  [pdf, other

    cs.RO

    OVExp: Open Vocabulary Exploration for Object-Oriented Navigation

    Authors: Meng Wei, Tai Wang, Yilun Chen, Hanqing Wang, Jiangmiao Pang, Xihui Liu

    Abstract: Object-oriented embodied navigation aims to locate specific objects, defined by category or depicted in images. Existing methods often struggle to generalize to open vocabulary goals without extensive training data. While recent advances in Vision-Language Models (VLMs) offer a promising solution by extending object recognition beyond predefined categories, efficient goal-oriented exploration beco… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  47. arXiv:2407.09014  [pdf, other

    cs.CL

    CompAct: Compressing Retrieved Documents Actively for Question Answering

    Authors: Chanwoong Yoon, Taewhoo Lee, Hyeon Hwang, Minbyul Jeong, Jaewoo Kang

    Abstract: Retrieval-augmented generation supports language models to strengthen their factual groundings by providing external contexts. However, language models often face challenges when given extensive information, diminishing their effectiveness in solving questions. Context compression tackles this issue by filtering out irrelevant information, but current methods still struggle in realistic scenarios… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Code available at https://github.com/dmis-lab/CompAct

  48. arXiv:2407.09012  [pdf, other

    cs.CV cs.AI

    TCAN: Animating Human Images with Temporally Consistent Pose Guidance using Diffusion Models

    Authors: Jeongho Kim, Min-Jung Kim, Junsoo Lee, Jaegul Choo

    Abstract: Pose-driven human-image animation diffusion models have shown remarkable capabilities in realistic human video synthesis. Despite the promising results achieved by previous approaches, challenges persist in achieving temporally consistent animation and ensuring robustness with off-the-shelf pose detectors. In this paper, we present TCAN, a pose-driven human image animation method that is robust to… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: The first two authors contributed equally

  49. arXiv:2407.09009  [pdf, other

    astro-ph.EP

    Probing Cold-to-Temperate Exoplanetary Atmospheres: The Role of Water Condensation on Surface Identification with JWST

    Authors: Ziyu Huang, Xinting Yu, Shang-Min Tsai, Julianne I. Moses, Kazumasa Ohno, Joshua Krissansen-Totton, Xi Zhang, Jonathan Fortney

    Abstract: Understanding the surface temperature and interior structure of cold-to-temperate sub-Neptunes is critical for assessing their habitability, yet direct observations are challenging. In this study, we investigate the impact of water condensation on the atmospheric compositions of sub-Neptunes, focusing on the implications for JWST spectroscopic observations. By modeling the atmospheric photochemist… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 21 pages, 7 figures

  50. arXiv:2407.09005  [pdf, other

    cs.CV cs.AI eess.IV

    Introducing VaDA: Novel Image Segmentation Model for Maritime Object Segmentation Using New Dataset

    Authors: Yong** Kim, **bum Park, Sanha Kang, Hanguen Kim

    Abstract: The maritime ship** industry is undergoing rapid evolution driven by advancements in computer vision artificial intelligence (AI). Consequently, research on AI-based object recognition models for maritime transportation is steadily growing, leveraging advancements in sensor technology and computing performance. However, object recognition in maritime environments faces challenges such as light r… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 11 pages, 9 figures, whitepaper