Skip to main content

Showing 1–50 of 417,517 results for author: P.

.
  1. arXiv:2407.03321  [pdf, other

    cs.CL cs.AI cs.LG

    Planetarium: A Rigorous Benchmark for Translating Text to Structured Planning Languages

    Authors: Max Zuo, Francisco Piedrahita Velez, Xiaochen Li, Michael L. Littman, Stephen H. Bach

    Abstract: Many recent works have explored using language models for planning problems. One line of research focuses on translating natural language descriptions of planning tasks into structured planning languages, such as the planning domain definition language (PDDL). While this approach is promising, accurately measuring the quality of generated PDDL code continues to pose significant challenges. First,… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  2. arXiv:2407.03320  [pdf, other

    cs.CV cs.CL

    InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

    Authors: Pan Zhang, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Rui Qian, Lin Chen, Qipeng Guo, Haodong Duan, Bin Wang, Linke Ouyang, Songyang Zhang, Wenwei Zhang, Yining Li, Yang Gao, Peng Sun, Xinyue Zhang, Wei Li, **gwen Li, Wenhai Wang, Hang Yan, Conghui He, Xingcheng Zhang, Kai Chen, Jifeng Dai, Yu Qiao , et al. (2 additional authors not shown)

    Abstract: We present InternLM-XComposer-2.5 (IXC-2.5), a versatile large-vision language model that supports long-contextual input and output. IXC-2.5 excels in various text-image comprehension and composition applications, achieving GPT-4V level capabilities with merely 7B LLM backend. Trained with 24K interleaved image-text contexts, it can seamlessly extend to 96K long contexts via RoPE extrapolation. Th… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Technical Report. https://github.com/InternLM/InternLM-XComposer

  3. arXiv:2407.03316  [pdf, other

    nucl-ex hep-ex

    An Upper Limit on the Photoproduction Cross Section of the Spin-Exotic $π_1(1600)$

    Authors: F. Afzal, C. S. Akondi, M. Albrecht, M. Amaryan, S. Arrigo, V. Arroyave, A. Asaturyan, A. Austregesilo, Z. Baldwin, F. Barbosa, J. Barlow, E. Barriga, R. Barsotti, D. Barton, V. Baturin, V. V. Berdnikov, T. Black, W. Boeglin, M. Boer, W. J. Briscoe, T. Britton, S. Cao, E. Chudakov, G. Chung, P. L. Cole , et al. (124 additional authors not shown)

    Abstract: The spin-exotic hybrid meson $π_{1}(1600)$ is predicted to have a large decay rate to the $ωππ$ final state. Using 76.6~pb$^{-1}$ of data collected with the GlueX detector, we measure the cross sections for the reactions $γp \to ωπ^+ π^- p$, $γp \to ωπ^0 π^0 p$, and $γp\toωπ^-π^0Δ^{++}$ in the range $E_γ=$ 8-10 GeV. Using isospin conservation, we set the first upper limits on the photoproduction c… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 6 pages, 3 figures plus supplemental materials

  4. arXiv:2407.03314  [pdf, other

    cs.CV cs.CL cs.DB

    BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations

    Authors: Zhantao Yang, Ruili Feng, Keyu Yan, Huangji Wang, Zhicai Wang, Shangwen Zhu, Han Zhang, Jie Xiao, **yu Wu, Kai Zhu, Jixuan Chen, Chen-Wei Xie, Chaojie Mao, Yue Yang, Hongyang Zhang, Yu Liu, Fan Cheng

    Abstract: This paper presents Bag-of-Concept Graph (BACON) to gift models with limited linguistic abilities to taste the privilege of Vision Language Models (VLMs) and boost downstream tasks such as detection, visual question answering (VQA), and image generation. Since the visual scenes in physical worlds are structured with complex relations between objects, BACON breaks down annotations into basic minimu… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  5. arXiv:2407.03307  [pdf, other

    eess.IV cs.CV

    HoloHisto: End-to-end Gigapixel WSI Segmentation with 4K Resolution Sequential Tokenization

    Authors: Yucheng Tang, Yufan He, Vishwesh Nath, Pengfeig Guo, Ruining Deng, Tianyuan Yao, Quan Liu, Can Cui, Mengmeng Yin, Ziyue Xu, Holger Roth, Daguang Xu, Haichun Yang, Yuankai Huo

    Abstract: In digital pathology, the traditional method for deep learning-based image segmentation typically involves a two-stage process: initially segmenting high-resolution whole slide images (WSI) into smaller patches (e.g., 256x256, 512x512, 1024x1024) and subsequently reconstructing them to their original scale. This method often struggles to capture the complex details and vast scope of WSIs. In this… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  6. A Graded Mesh Refinement for 2D Poisson's Equation on Non Convex Polygonal Domains

    Authors: Charuka D. Wickramasinghe, Priyanka Ahire

    Abstract: This work delves into solving the two dimensional Poisson problem through the Finite Element Method which is relevant in various physical scenarios including heat conduction, electrostatics, gravity potential, and fluid dynamics. However, finding exact solutions to these problems can be complicated and challenging due to complexities in the domains such as re-entrant corners, cracks, and discontin… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 18 pages,10 figures

    Journal ref: Asia Pac. J. Math. 2024 11:63

  7. arXiv:2407.03293  [pdf, other

    cond-mat.mes-hall cond-mat.str-el cond-mat.supr-con

    Microscopic theory for electron-phonon coupling in twisted bilayer graphene

    Authors: Ziyan Zhu, Thomas P. Devereaux

    Abstract: The origin of superconductivity in twisted bilayer graphene -- whether phonon-driven or electron-driven -- remains unresolved. The answer to this question is hindered by the absence of a quantitative and efficient model for electron-phonon coupling (EPC). In this work, we develop a first-principles-based microscopic theory to calculate EPC in twisted bilayer graphene for arbitrary twist angles wit… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  8. arXiv:2407.03289  [pdf, other

    cs.IT cs.CR cs.LG

    Correlated Privacy Mechanisms for Differentially Private Distributed Mean Estimation

    Authors: Sajani Vithana, Viveck R. Cadambe, Flavio P. Calmon, Haewon Jeong

    Abstract: Differentially private distributed mean estimation (DP-DME) is a fundamental building block in privacy-preserving federated learning, where a central server estimates the mean of $d$-dimensional vectors held by $n$ users while ensuring $(ε,δ)$-DP. Local differential privacy (LDP) and distributed DP with secure aggregation (SecAgg) are the most common notions of DP used in DP-DME settings with an u… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  9. arXiv:2407.03282  [pdf, other

    cs.CL

    LLM Internal States Reveal Hallucination Risk Faced With a Query

    Authors: Ziwei Ji, Delong Chen, Etsuko Ishii, Samuel Cahyawijaya, Ye** Bang, Bryan Wilie, Pascale Fung

    Abstract: The hallucination problem of Large Language Models (LLMs) significantly limits their reliability and trustworthiness. Humans have a self-awareness process that allows us to recognize what we don't know when faced with queries. Inspired by this, our paper investigates whether LLMs can estimate their own hallucination risk before response generation. We analyze the internal mechanisms of LLMs broadl… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  10. arXiv:2407.03281  [pdf, other

    astro-ph.SR

    Direct evidence of hybrid nature of EUV waves and the reflection of the fast-mode wave

    Authors: Ramesh Chandra, P. F. Chen, Pooja Devi

    Abstract: In current study, we perform the analysis of an extreme ultraviolet (EUV) wave on 2022 March 31. The event originated from the from NOAA active region (AR) 12975 (location: N13W52) in the Atmospheric imaging Assembly (AIA) onboard Solar Dynamics Observatory (SDO) satellite and exactly the west limb in Solar Terrestrial Relations Observatory-Ahead (STEREO-A) observations. The EUV wave was associate… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 6 figures, 16 pages

  11. arXiv:2407.03278  [pdf, ps, other

    math.CA math.FA

    A survey of the Hornich-Hlawka inequality

    Authors: Dan-Ştefan Marinescu, Constantin P. Niculescu

    Abstract: In this survey, we review the many faces of the Hornich-Hlawka inequality. Several open problems that seem of utmost interest are mentioned.

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 17 pages

    MSC Class: 46B04; 46B20; 26D15

  12. arXiv:2407.03271  [pdf, ps, other

    astro-ph.HE

    Timing of millisecond pulsars in NGC\,6752 -- III. On the presence of non-luminous matter in the cluster's core

    Authors: A. Corongiu, A. Ridolfi, F. Abbate, M. Bailes, A. Possenti, M. Geyer, R. N. Manchester, M. Kramer, P. C. C. Freire, M. Burgay, S. Buchner, F. Camilo

    Abstract: Millisecond pulsars are subject to accelerations in globular clusters that manifest themselves in both the first and second spin period time derivatives, and can be used to explore the mass distribution of the potentials they inhabit. Here we report on over 20 years of pulsar timing observations of five millisecond radio pulsars in the core of the core-collapse globular cluster NGC\,6752 with the… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Accepted for publication in ApJ

  13. arXiv:2407.03268  [pdf, other

    cs.CV

    For a semiotic AI: Bridging computer vision and visual semiotics for computational observation of large scale facial image archives

    Authors: Lia Morra, Antonio Santangelo, Pietro Basci, Luca Piano, Fabio Garcea, Fabrizio Lamberti, Massimo Leone

    Abstract: Social networks are creating a digital world in which the cognitive, emotional, and pragmatic value of the imagery of human faces and bodies is arguably changing. However, researchers in the digital humanities are often ill-equipped to study these phenomena at scale. This work presents FRESCO (Face Representation in E-Societies through Computational Observation), a framework designed to explore th… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  14. arXiv:2407.03267  [pdf

    cond-mat.mtrl-sci

    Insulator-to-Metal Transition and Isotropic Gigantic Magnetoresistance in Layered Magnetic Semiconductors

    Authors: Gokul Acharya, Bimal Neupane, Chia-Hsiu Hsu, Xian P. Yang, David Graf, Eun Sang Choi, Krishna Pandey, Md Rafique Un Nabi, Santosh Karki Chhetri, Rabindra Basnet, Sumaya Rahman, Jian Wang, Zhengxin Hu, Bo Da, Hugh Churchill, Guoqing Chang, M. Zahid Hasan, Yuanxi Wang, ** Hu

    Abstract: Magnetotransport, the response of electrical conduction to external magnetic field, acts as an important tool to reveal fundamental concepts behind exotic phenomena and plays a key role in enabling spintronic applications. Magnetotransport is generally sensitive to magnetic field orientations. In contrast, efficient and isotropic modulation of electronic transport, which is useful in technology ap… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 44 pages, 18 figures

  15. arXiv:2407.03262  [pdf, ps, other

    cs.DS cs.LG

    Nearly Linear Sparsification of $\ell_p$ Subspace Approximation

    Authors: David P. Woodruff, Taisuke Yasuda

    Abstract: The $\ell_p$ subspace approximation problem is an NP-hard low rank approximation problem that generalizes the median hyperplane problem ($p = 1$), principal component analysis ($p = 2$), and the center hyperplane problem ($p = \infty$). A popular approach to cope with the NP-hardness of this problem is to compute a strong coreset, which is a small weighted subset of the input points which simultan… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  16. arXiv:2407.03260  [pdf

    cond-mat.mtrl-sci physics.app-ph

    Improvement of the perovskite photodiodes performance via advanced interface engineering with polymer dielectric

    Authors: A. P. Morozov, L. O. Luchnikov, S. Yu. Yurchuk, A. R. Ishteev, P. A. Gostishchev, S. I. Didenko, N. S. Saratovsky, S. S. Kozlov, D. S. Muratov, Yu. N. Luponosov, D. S. Saranin

    Abstract: Halide perovskite-based photodiodes are promising for efficient detection across a broad spectral range. Perovskite absorber thin-films have a microcrystalline morphology, characterized by a high density of surface states and defects at inter-grain interfaces. In this work, we used dielectric-ferroelectric poly(vinylidene-fluoride-trifluoroethylene-P(VDF-TrFE) to modify the bulk interfaces and ele… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  17. arXiv:2407.03249  [pdf, other

    quant-ph cond-mat.quant-gas physics.atom-ph

    Quantum coarsening and collective dynamics on a programmable quantum simulator

    Authors: Tom Manovitz, Sophie H. Li, Sepehr Ebadi, Rhine Samajdar, Alexandra A. Geim, Simon J. Evered, Dolev Bluvstein, Hengyun Zhou, Nazli Uğur Köylüoğlu, Johannes Feldmeier, Pavel E. Dolgirev, Nishad Maskara, Marcin Kalinowski, Subir Sachdev, David A. Huse, Markus Greiner, Vladan Vuletić, Mikhail D. Lukin

    Abstract: Understanding the collective quantum dynamics of nonequilibrium many-body systems is an outstanding challenge in quantum science. In particular, dynamics driven by quantum fluctuations are important for the formation of exotic quantum phases of matter \cite{altman2023quantum}, fundamental high-energy processes \cite{bauer2023highenergy}, quantum metrology \cite{degen2017sensing, li2023scrambling},… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 25 pages, 14 figures

  18. arXiv:2407.03227  [pdf, other

    cs.CL cs.AI cs.DB

    Improving Retrieval-augmented Text-to-SQL with AST-based Ranking and Schema Pruning

    Authors: Zhili Shen, Pavlos Vougiouklis, Chenxin Diao, Kaustubh Vyas, Yuanyi Ji, Jeff Z. Pan

    Abstract: We focus on Text-to-SQL semantic parsing from the perspective of Large Language Models. Motivated by challenges related to the size of commercial database schemata and the deployability of business intelligence solutions, we propose an approach that dynamically retrieves input database information and uses abstract syntax trees to select few-shot examples for in-context learning. Furthermore, we… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  19. arXiv:2407.03225  [pdf, other

    astro-ph.GA

    The large-scale structure around the Fornax-Eridanus Complex

    Authors: Maria Angela Raj, Petra Awad, Reynier F. Peletier, Rory Smith, Ulrike Kuchner, Rien van de Weygaert, Noam I. Libeskind, Marco Canducci, Peter Tino, Kerstin Bunte

    Abstract: Our objectives are to map the filamentary network around the Fornax-Eridanus Complex and probe the influence of the local environment on galaxy morphology. We employ the novel machine-learning tool, 1-DREAM (1-Dimensional, Recovery, Extraction, and Analysis of Manifolds) to detect and model filaments around the Fornax cluster. We then use the morphology-density relation of galaxies to examine the… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Accepted for publication in A&A. 21 pages with 15 figures

  20. arXiv:2407.03220  [pdf, other

    cond-mat.mes-hall

    Impact of planar defects on the reversal time of single magnetic domain nanoparticles

    Authors: Hugo Bocquet, Armin Kleibert, Peter M. Derlet

    Abstract: Recent experimental investigations of individual magnetic nanoparticles reveal a diverse range of magnetic relaxation times which cannot be explained by considering their size, shape, and surface anisotropy, suggesting other factors associated with the internal microstructure of the particles are at play. In this letter, we apply Langer's theory of thermal activation to single magnetic domain fcc… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 8 pages, 5 figures

  21. arXiv:2407.03216  [pdf, other

    cs.CV cs.AI

    Learning Disentangled Representation in Object-Centric Models for Visual Dynamics Prediction via Transformers

    Authors: Sanket Gandhi, Atul, Samanyu Mahajan, Vishal Sharma, Rushil Gupta, Arnab Kumar Mondal, Parag Singla

    Abstract: Recent work has shown that object-centric representations can greatly help improve the accuracy of learning dynamics while also bringing interpretability. In this work, we take this idea one step further, ask the following question: "can learning disentangled representation further improve the accuracy of visual dynamics prediction in object-centric models?" While there has been some attempt to le… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  22. arXiv:2407.03215  [pdf, other

    physics.ins-det cond-mat.mtrl-sci cs.DC cs.NI

    Streaming Large-Scale Electron Microscopy Data to a Supercomputing Facility

    Authors: Samuel S. Welborn, Chris Harris, Stephanie M. Ribet, Georgios Varnavides, Colin Ophus, Bjoern Enders, Peter Ercius

    Abstract: Data management is a critical component of modern experimental workflows. As data generation rates increase, transferring data from acquisition servers to processing servers via conventional file-based methods is becoming increasingly impractical. The 4D Camera at the National Center for Electron Microscopy (NCEM) generates data at a nominal rate of 480 Gbit/s (87,000 frames/s) producing a 700 GB… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  23. arXiv:2407.03207  [pdf, other

    physics.ins-det

    Loss rate of ultracold neutrons due to the absorption by trap walls in large material traps

    Authors: Pavel D. Grigoriev, Vladislav D. Kochev, Victor A. Tsyplukhin, Alexander M. Dyugaev, Ilya Ya. Polishchuk

    Abstract: The most accurate neutron lifetime measurements now use the material or magnetic traps of ultracold neutrons (UCN). The precision of these experiments is determined by the accuracy of estimating the neutron loss rate. In material UCN traps the main source of neutron losses is the absorption by trap walls. In this paper we analyze the standard methods and their approximations for the calculation of… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 12 pages, 6 figures

  24. arXiv:2407.03199  [pdf, other

    astro-ph.EP

    BOWIE-ALIGN: How formation and migration histories of giant planets impact atmospheric compositions

    Authors: Anna B. T. Penzlin, Richard A. Booth, James Kirk, James E. Owen, Eva-Maria Ahrer, Duncan A. Christie, Alastair B. Claringbold, Emma Esparza-Borges, M. López-Morales, N. J. Mayne, Mason McCormack, Annabella Meech, Vatsal Panwar, Diana Powell, Denis E. Sergeev, Jake Taylor, Peter J. Wheatley, Maria Zamyatina

    Abstract: Hot Jupiters present a unique opportunity for measuring how planet formation history shapes present-day atmospheric composition. However, due to the myriad pathways influencing composition, a well-constructed sample of planets is needed to determine whether formation history can be accurately traced back from atmospheric composition. To this end, the BOWIE-ALIGN survey will compare the composition… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 11pages 10 figures, (appendix: 6 page, 4 figures), submitted to mnras

  25. arXiv:2407.03198  [pdf, other

    astro-ph.EP

    BOWIE-ALIGN: A JWST comparative survey of aligned vs misaligned hot Jupiters to test the dependence of atmospheric composition on migration history

    Authors: James Kirk, Eva-Maria Ahrer, Anna B. T. Penzlin, James E. Owen, Richard A. Booth, Lili Alderson, Duncan A. Christie, Alastair B. Claringbold, Emma Esparza-Borges, Chloe E. Fisher, Mercedes López-Morales, N. J. Mayne, Mason McCormack, Annabella Meech, Vatsal Panwar, Diana Powell, Jake Taylor, Denis E. Sergeev, Daniel Valentine, Hannah R. Wakeford, Peter J. Wheatley, Maria Zamyatina

    Abstract: A primary objective of exoplanet atmosphere characterisation is to learn about planet formation and evolution, however, this is challenged by degeneracies. To determine whether differences in atmospheric composition can be reliably traced to differences in evolution, we are undertaking a new survey with JWST to compare the compositions of a sample of hot Jupiters that orbit F stars above the Kraft… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 13 pages, 8 figures, submitted to RASTI

  26. arXiv:2407.03192  [pdf, other

    cs.DL cs.CL

    CiteAssist: A System for Automated Preprint Citation and BibTeX Generation

    Authors: Lars Benedikt Kaesberg, Terry Ruas, Jan Philip Wahle, Bela Gipp

    Abstract: We present CiteAssist, a system to automate the generation of BibTeX entries for preprints, streamlining the process of bibliographic annotation. Our system extracts metadata, such as author names, titles, publication dates, and keywords, to create standardized annotations within the document. CiteAssist automatically attaches the BibTeX citation to the end of a PDF and links it on the first page… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Published at SDProc @ ACL 2024

  27. arXiv:2407.03191  [pdf, other

    cond-mat.mtrl-sci physics.chem-ph physics.comp-ph physics.optics quant-ph

    Controlling Plasmonic Catalysis via Strong Coupling with Electromagnetic Resonators

    Authors: Jakub Fojt, Paul Erhart, Christian Schäfer

    Abstract: Plasmonic excitations decay within femtoseconds, leaving non-thermal (often referred to as "hot") charge carriers behind that can be injected into molecular structures to trigger chemical reactions that are otherwise out of reach -- a process known as plasmonic catalysis. In this Letter, we demonstrate that strong coupling between resonator structures and plasmonic nanoparticles can be used to con… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  28. arXiv:2407.03187  [pdf

    cs.CY

    Holistic view of the road transportation system based on real-time data sharing mechanism

    Authors: Li Tao, Dong Xiang, Hao Junfeng, Yin **, Xu Xiaoxue, Lai Maokai, Li Yuan, Peng Ting

    Abstract: Traditional manual driving and single-vehicle-based intelligent driving have limitations in real-time and accurate acquisition of the current driving status and intentions of surrounding vehicles, leading to vehicles typically maintaining appropriate safe distances from each other. Yet, accidents still frequently occur, especially in merging areas; meanwhile, it is difficult to comprehensively obt… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  29. arXiv:2407.03183  [pdf, other

    eess.SY cs.AI

    A Formal Model for Artificial Intelligence Applications in Automation Systems

    Authors: Marvin Schieseck, Philip Topalis, Lasse Reinpold, Felix Gehlhoff, Alexander Fay

    Abstract: The integration of Artificial Intelligence (AI) into automation systems has the potential to enhance efficiency and to address currently unsolved existing technical challenges. However, the industry-wide adoption of AI is hindered by the lack of standardized documentation for the complex compositions of automation systems, AI software, production hardware, and their interdependencies. This paper p… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  30. arXiv:2407.03179  [pdf, other

    cs.CV cs.AI cs.LG

    Motion meets Attention: Video Motion Prompts

    Authors: Qixiang Chen, Lei Wang, Piotr Koniusz, Tom Gedeon

    Abstract: Videos contain rich spatio-temporal information. Traditional methods for extracting motion, used in tasks such as action recognition, often rely on visual contents rather than precise motion features. This phenomenon is referred to as 'blind motion extraction' behavior, which proves inefficient in capturing motions of interest due to a lack of motion-guided cues. Recently, attention mechanisms hav… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Research report

  31. arXiv:2407.03168  [pdf, other

    cs.CV

    LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control

    Authors: Jianzhu Guo, Dingyun Zhang, Xiaoqiang Liu, Zhizhou Zhong, Yuan Zhang, Pengfei Wan, Di Zhang

    Abstract: Portrait Animation aims to synthesize a lifelike video from a single source image, using it as an appearance reference, with motion (i.e., facial expressions and head pose) derived from a driving video, audio, text, or generation. Instead of following mainstream diffusion-based methods, we explore and extend the potential of the implicit-keypoint-based framework, which effectively balances computa… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  32. arXiv:2407.03146  [pdf, other

    cs.CY cs.AI cs.CV cs.GT cs.LG

    Enhancing Class Fairness in Classification with A Two-Player Game Approach

    Authors: Yunpeng Jiang, Paul Weng, Yutong Ban

    Abstract: Data augmentation is widely applied and has shown its benefits in different machine learning tasks. However, as recently observed in some downstream tasks, data augmentation may introduce an unfair impact on classifications. While it can improve the performance of some classes, it can actually be detrimental for other classes, which can be problematic in some application domains. In this paper, to… ▽ More

    Submitted 30 May, 2024; originally announced July 2024.

  33. arXiv:2407.03140  [pdf, other

    cs.CV

    Machine Learning Models for Improved Tracking from Range-Doppler Map Images

    Authors: Elizabeth Hou, Ross Greenwood, Piyush Kumar

    Abstract: Statistical tracking filters depend on accurate target measurements and uncertainty estimates for good tracking performance. In this work, we propose novel machine learning models for target detection and uncertainty estimation in range-Doppler map (RDM) images for Ground Moving Target Indicator (GMTI) radars. We show that by using the outputs of these models, we can significantly improve the perf… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  34. arXiv:2407.03138  [pdf, ps, other

    quant-ph

    Superselection rules and bosonic quantum computational resources

    Authors: Eloi Descamps, Nicolas Fabre, Astghik Saharyan, Arne Keller, Pérola Milman

    Abstract: We present a method to systematically identify and classify quantum optical non-classical resources based on the computational power they generate in a bosonic quantum computer. To achieve this, we establish a one-to-one correspondence between arbitrary continuous variable states in a multimode Hilbert space and single photons occupying each a single mode, which are used to define a bosonic quantu… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  35. arXiv:2407.03137  [pdf, other

    astro-ph.SR astro-ph.GA astro-ph.IM

    X-Shooting ULLYSES: Massive Stars at low metallicity -- IV. Spectral analysis methods and exemplary results for O stars

    Authors: A. A. C. Sander, J. -C. Bouret, M. Bernini-Peron, J. Puls, F. Backs, S. R. Berlanas, J. M. Bestenlehner, S. A. Brands, A. Herrero, F. Martins, O. Maryeva, D. Pauli, V. Ramachandran, P. A. Crowther, V. M. A. Gómez-González, A. C. Gormaz-Matamala, W. -R. Hamann, D. J. Hillier, R. Kuiper, C. J. K. Larkin, R. R. Lefever, A. Mehner, F. Najarro, L. M. Oskinova, E. C. Schösser , et al. (4 additional authors not shown)

    Abstract: CONTEXT: The spectral analysis of hot, massive stars is a fundamental astrophysical method to obtain their intrinsic properties and their feedback. Quantitative spectroscopy for hot, massive stars requires detailed numerical modeling of the atmosphere and an iterative treatment to obtain the best solution within a given framework. AIMS: We present an overview of different techniques for the quanti… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 18+15 pages, 21+4 figures, under review at A&A, condensed abstract

  36. arXiv:2407.03132  [pdf, other

    cs.SD cs.AI cs.CL cs.LG eess.AS

    Speaker- and Text-Independent Estimation of Articulatory Movements and Phoneme Alignments from Speech

    Authors: Tobias Weise, Philipp Klumpp, Kubilay Can Demir, Paula Andrea Pérez-Toro, Maria Schuster, Elmar Noeth, Bjoern Heismann, Andreas Maier, Seung Hee Yang

    Abstract: This paper introduces a novel combination of two tasks, previously treated separately: acoustic-to-articulatory speech inversion (AAI) and phoneme-to-articulatory (PTA) motion estimation. We refer to this joint task as acoustic phoneme-to-articulatory speech inversion (APTAI) and explore two different approaches, both working speaker- and text-independently during inference. We use a multi-task le… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: to be published in Interspeech 2024 proceedings

  37. arXiv:2407.03128  [pdf

    cond-mat.mtrl-sci physics.optics

    Thorium doped strontium fluoride crystal: a unique candidate for solid nuclear optical clock material

    Authors: Qiaorui Gong, Shanming Li, Shulong Zhang, Siliang Tao, Guoliang Deng, Peixiong Zhang, Chengchun Zhao, Yin Hang, Shining Zhu, Longsheng Ma

    Abstract: We report a candidate with unique advantages in the cultivation of solid-state nuclear clock material, Th:SrF2 crystal. It not only has a segregation coefficient close to 1, which can achieve highly efficient and uniform do** of Th, but also ensures a high transmittance (~69% at 150 nm) while achieving extremely high do** concentration (232Th>6*10^20 cm^(-3). In addition, SrF2 crystal will not… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  38. arXiv:2407.03119  [pdf, other

    quant-ph

    Entanglement-assisted authenticated BB84 protocol

    Authors: Pol Julià Farré, Vladlen Galetsky, Soham Ghosh, Janis Nötzel, Christian Deppe

    Abstract: This work delivers a novel user-server authentication procedure exploiting the features of maximally entangled pairs in both an idealistic noiseless scenario and a moderately noisy one. Additionally, we leverage the specific features of our design, which are conveniently suited for inlaying it into the well known BB84 quantum communication protocol. We first define a trivial extension of our initi… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 11 pages, 5 figures

  39. arXiv:2407.03110  [pdf, other

    cs.SD cs.AI eess.AS

    A Toolchain for Comprehensive Audio/Video Analysis Using Deep Learning Based Multimodal Approach (A use case of riot or violent context detection)

    Authors: Lam Pham, Phat Lam, Tin Nguyen, Hieu Tang, Alexander Schindler

    Abstract: In this paper, we present a toolchain for a comprehensive audio/video analysis by leveraging deep learning based multimodal approach. To this end, different specific tasks of Speech to Text (S2T), Acoustic Scene Classification (ASC), Acoustic Event Detection (AED), Visual Object Detection (VOD), Image Captioning (IC), and Video Captioning (VC) are conducted and integrated into the toolchain. By co… ▽ More

    Submitted 2 May, 2024; originally announced July 2024.

  40. arXiv:2407.03093  [pdf, other

    cs.SE cs.AI cs.CR cs.LG

    Revisiting the Performance of Deep Learning-Based Vulnerability Detection on Realistic Datasets

    Authors: Partha Chakraborty, Krishna Kanth Arumugam, Mahmoud Alfadel, Meiyappan Nagappan, Shane McIntosh

    Abstract: The impact of software vulnerabilities on everyday software systems is significant. Despite deep learning models being proposed for vulnerability detection, their reliability is questionable. Prior evaluations show high recall/F1 scores of up to 99%, but these models underperform in practical scenarios, particularly when assessed on entire codebases rather than just the fixing commit. This paper i… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    ACM Class: D.2; I.2

    Journal ref: 10.1109/TSE.2024.3423712

  41. arXiv:2407.03091  [pdf, other

    cs.RO cs.NI

    Performance Comparison of ROS2 Middlewares for Multi-robot Mesh Networks in Planetary Exploration

    Authors: Loïck Pierre Chovet, Gabriel Manuel Garcia, Abhishek Bera, Antoine Richard, Kazuya Yoshida, Miguel Angel Olivares-Mendez

    Abstract: Recent advancements in Multi-Robot Systems (MRS) and mesh network technologies pave the way for innovative approaches to explore extreme environments. The Artemis Accords, a series of international agreements, have further catalyzed this progress by fostering cooperation in space exploration, emphasizing the use of cutting-edge technologies. In parallel, the widespread adoption of the Robot Operat… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: PrePrint

  42. arXiv:2407.03087  [pdf, other

    quant-ph

    Improved finite-size key rates for discrete-modulated continuous variable quantum key distribution under coherent attacks

    Authors: Carlos Pascual-García, Stefan Bäuml, Mateus Araújo, Rotem Liss, Antonio Acín

    Abstract: Continuous variable quantum key distribution (CVQKD) with discrete modulation combines advantages of CVQKD, such as the implementability using readily available technologies, with advantages of discrete variable quantum key distribution, such as easier error correction procedures. We consider a prepare-and-measure CVQKD protocol, where Alice chooses from a set of four coherent states and Bob perfo… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 24 pages, 5 figures

  43. arXiv:2407.03080  [pdf, other

    cs.LG cs.AI

    Artificial Inductive Bias for Synthetic Tabular Data Generation in Data-Scarce Scenarios

    Authors: Patricia A. Apellániz, Ana Jiménez, Borja Arroyo Galende, Juan Parras, Santiago Zazo

    Abstract: While synthetic tabular data generation using Deep Generative Models (DGMs) offers a compelling solution to data scarcity and privacy concerns, their effectiveness relies on substantial training data, often unavailable in real-world applications. This paper addresses this challenge by proposing a novel methodology for generating realistic and reliable synthetic tabular data with DGMs in limited re… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 19 pages, 6 Figures

    MSC Class: I.2.0

  44. arXiv:2407.03076  [pdf, other

    cs.CL

    A Case Study on Context-Aware Neural Machine Translation with Multi-Task Learning

    Authors: Ramakrishna Appicharla, Baban Gain, Santanu Pal, Asif Ekbal, Pushpak Bhattacharyya

    Abstract: In document-level neural machine translation (DocNMT), multi-encoder approaches are common in encoding context and source sentences. Recent studies \cite{li-etal-2020-multi-encoder} have shown that the context encoder generates noise and makes the model robust to the choice of context. This paper further investigates this observation by explicitly modelling context encoding through multi-task lear… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Accepted to EAMT 2024 (poster)

  45. arXiv:2407.03058  [pdf, other

    hep-lat cond-mat.str-el hep-th physics.comp-ph quant-ph

    Tensor Networks for Lattice Gauge Theories beyond one dimension: a Roadmap

    Authors: Giuseppe Magnifico, Giovanni Cataldi, Marco Rigobello, Peter Majcen, Daniel Jaschke, Pietro Silvi, Simone Montangero

    Abstract: Tensor network methods are a class of numerical tools and algorithms to study many-body quantum systems in and out of equilibrium, based on tailored variational wave functions. They have found significant applications in simulating lattice gauge theories approaching relevant problems in high-energy physics. Compared to Monte Carlo methods, they do not suffer from the sign problem, allowing them to… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 14 pages, 6 figures

  46. arXiv:2407.03034  [pdf, ps, other

    eess.IV cs.AI

    Attention Incorporated Network for Sharing Low-rank, Image and K-space Information during MR Image Reconstruction to Achieve Single Breath-hold Cardiac Cine Imaging

    Authors: Siying Xu, Kerstin Hammernik, Andreas Lingg, Jens Kuebler, Patrick Krumm, Daniel Rueckert, Sergios Gatidis, Thomas Kuestner

    Abstract: Cardiac Cine Magnetic Resonance Imaging (MRI) provides an accurate assessment of heart morphology and function in clinical practice. However, MRI requires long acquisition times, with recent deep learning-based methods showing great promise to accelerate imaging and enhance reconstruction quality. Existing networks exhibit some common limitations that constrain further acceleration possibilities,… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  47. arXiv:2407.03033  [pdf

    cs.CV

    ISWSST: Index-space-wave State Superposition Transformers for Multispectral Remotely Sensed Imagery Semantic Segmentation

    Authors: Chang Li, Pengfei Zhang, Yu Wang

    Abstract: Currently the semantic segmentation task of multispectral remotely sensed imagery (MSRSI) faces the following problems: 1) Usually, only single domain feature (i.e., space domain or frequency domain) is considered; 2) downsampling operation in encoder generally leads to the accuracy loss of edge extraction; 3) multichannel features of MSRSI are not fully considered; and 4) prior knowledge of remot… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  48. arXiv:2407.03032  [pdf, other

    cs.CL

    Strategies for Arabic Readability Modeling

    Authors: Juan Piñeros Liberato, Bashar Alhafni, Muhamed Al Khalil, Nizar Habash

    Abstract: Automatic readability assessment is relevant to building NLP applications for education, content analysis, and accessibility. However, Arabic readability assessment is a challenging task due to Arabic's morphological richness and limited readability resources. In this paper, we present a set of experimental results on Arabic readability assessment using a diverse range of approaches, from rule-bas… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Accepted to ArabicNLP 2024, ACL

  49. arXiv:2407.03031  [pdf, other

    gr-qc

    Entangled pairs in evaporating black holes without event horizons

    Authors: Ivan Agullo, Paula Calizaya Cabrera, Beatriz Elizaga Navascués

    Abstract: Investigations into Hawking radiation often assume a black hole model featuring an event horizon, despite the growing consensus that such causal structures may not exist in nature. While this assumption is not crucial for deriving the local properties of radiation at future null infinity, it plays a significant role in discussions about Hawking partners -- the field modes that purify Hawking radia… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 31 pages, 5 figures, 3 appendices

  50. Differentially Processed Optimized Collaborative Rich Text Editor

    Authors: Nishtha Jatana, Mansehej Singh, Charu Gupta, Geetika Dhand, Shaily Malik, Pankaj Dadheech, Nagender Aneja, Sandhya Aneja

    Abstract: A collaborative real-time text editor is an application that allows multiple users to edit a document simultaneously and merge their contributions automatically. It can be made collaborative by implementing a conflict resolution algorithm either on the client side (in peer-to-peer collaboration) or on the server side (when using web sockets and a central server to monitor state changes). Although… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Journal ref: Multimedia Tools and Applications (2024)