Skip to main content

Showing 1–50 of 428 results for author: Milan

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00179  [pdf, other

    cs.GR

    Standardized Data-Parallel Rendering Using ANARI

    Authors: Ingo Wald, Stefan Zellmann, Jefferson Amstutz, Qi Wu, Kevin Griffin, Milan Jaros, Stefan Wesner

    Abstract: We propose and discuss a paradigm that allows for expressing \emph{data-parallel} rendering with the classically non-parallel ANARI API. We propose this as a new standard for data-parallel sci-vis rendering, describe two different implementations of this paradigm, and use multiple sample integrations into existing apps to show how easy it is to adopt this paradigm, and what can be gained from doin… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  2. arXiv:2406.12502  [pdf, other

    cs.CL

    Code-Optimise: Self-Generated Preference Data for Correctness and Efficiency

    Authors: Leonidas Gee, Milan Gritta, Gerasimos Lampouras, Ignacio Iacobacci

    Abstract: Code Language Models have been trained to generate accurate solutions, typically with no regard for runtime. On the other hand, previous works that explored execution optimisation have observed corresponding drops in functional correctness. To that end, we introduce Code-Optimise, a framework that incorporates both correctness (passed, failed) and runtime (quick, slow) as learning signals via self… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Under review at ARR (for EMNLP 2024)

  3. arXiv:2406.12422  [pdf, other

    cs.CL

    Open-Source Web Service with Morphological Dictionary-Supplemented Deep Learning for Morphosyntactic Analysis of Czech

    Authors: Milan Straka, Jana Straková

    Abstract: We present an open-source web service for Czech morphosyntactic analysis. The system combines a deep learning model with rescoring by a high-precision morphological dictionary at inference time. We show that our hybrid method surpasses two competitive baselines: While the deep learning model ensures generalization for out-of-vocabulary words and better disambiguation, an improvement over an existi… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted to TSD 2024

  4. arXiv:2405.20994  [pdf, other

    cs.IR cs.CL

    CWRCzech: 100M Query-Document Czech Click Dataset and Its Application to Web Relevance Ranking

    Authors: Josef Vonášek, Milan Straka, Rostislav Krč, Lenka Lasoňová, Ekaterina Egorova, Jana Straková, Jakub Náplava

    Abstract: We present CWRCzech, Click Web Ranking dataset for Czech, a 100M query-document Czech click dataset for relevance ranking with user behavior data collected from search engine logs of Seznam.cz. To the best of our knowledge, CWRCzech is the largest click dataset with raw text published so far. It provides document positions in the search results as well as information about user behavior: 27.6M cli… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: Accepted to SIGIR 2024

  5. arXiv:2405.14058  [pdf, other

    cs.AI cs.LG eess.SY

    Formally Verifying Deep Reinforcement Learning Controllers with Lyapunov Barrier Certificates

    Authors: Udayan Mandal, Guy Amir, Haoze Wu, Ieva Daukantas, Fletcher Lee Newell, Umberto J. Ravaioli, Baoluo Meng, Michael Durling, Milan Ganai, Tobey Shim, Guy Katz, Clark Barrett

    Abstract: Deep reinforcement learning (DRL) is a powerful machine learning paradigm for generating agents that control autonomous systems. However, the "black box" nature of DRL agents limits their deployment in real-world safety-critical applications. A promising approach for providing strong guarantees on an agent's behavior is to use Neural Lyapunov Barrier (NLB) certificates, which are learned functions… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  6. arXiv:2405.13583  [pdf, other

    cs.LO

    Tools at the Frontiers of Quantitative Verification

    Authors: Roman Andriushchenko, Alexander Bork, Carlos E. Budde, Milan Češka, Kush Grover, Ernst Moritz Hahn, Arnd Hartmanns, Bryant Israelsen, Nils Jansen, Joshua Jeppson, Sebastian Junges, Maximilian A. Köhl, Bettina Könighofer, Jan Křetínský, Tobias Meggendorfer, David Parker, Stefan Pranger, Tim Quatmann, Enno Ruijters, Landon Taylor, Matthias Volk, Maximilian Weininger, Zhen Zhang

    Abstract: The analysis of formal models that include quantitative aspects such as timing or probabilistic choices is performed by quantitative verification tools. Broad and mature tool support is available for computing basic properties such as expected rewards on basic models such as Markov chains. Previous editions of QComp, the comparison of tools for the analysis of quantitative formal models, focused o… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  7. arXiv:2405.09948  [pdf, other

    cs.CL

    Mitigating Text Toxicity with Counterfactual Generation

    Authors: Milan Bhan, Jean-Noel Vittaut, Nina Achache, Victor Legrand, Nicolas Chesneau, Annabelle Blangero, Juliette Murris, Marie-Jeanne Lesot

    Abstract: Toxicity mitigation consists in rephrasing text in order to remove offensive or harmful meaning. Neural natural language processing (NLP) models have been widely used to target and mitigate textual toxicity. However, existing methods fail to detoxify text while preserving the initial non-toxic meaning at the same time. In this work, we propose to apply counterfactual generation methods from the eX… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  8. arXiv:2405.09186  [pdf, other

    cs.CL

    HumanRankEval: Automatic Evaluation of LMs as Conversational Assistants

    Authors: Milan Gritta, Gerasimos Lampouras, Ignacio Iacobacci

    Abstract: Language models (LMs) as conversational assistants recently became popular tools that help people accomplish a variety of tasks. These typically result from adapting LMs pretrained on general domain text sequences through further instruction-tuning and possibly preference optimisation methods. The evaluation of such LMs would ideally be performed using human judgement, however, this is not scalabl… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: Accepted to NACCL 2024 main conference

  9. arXiv:2405.03069  [pdf, ps, other

    math.LO cs.AI cs.CC cs.LO

    On Probabilistic and Causal Reasoning with Summation Operators

    Authors: Duligur Ibeling, Thomas F. Icard, Milan Mossé

    Abstract: Ibeling et al. (2023). axiomatize increasingly expressive languages of causation and probability, and Mosse et al. (2024) show that reasoning (specifically the satisfiability problem) in each causal language is as difficult, from a computational complexity perspective, as reasoning in its merely probabilistic or "correlational" counterpart. Introducing a summation operator to capture common device… ▽ More

    Submitted 18 May, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

  10. arXiv:2405.02770  [pdf, other

    cs.LG

    PhilHumans: Benchmarking Machine Learning for Personal Health

    Authors: Vadim Liventsev, Vivek Kumar, Allmin Pradhap Singh Susaiyah, Zixiu Wu, Ivan Rodin, Asfand Yaar, Simone Balloccu, Marharyta Beraziuk, Sebastiano Battiato, Giovanni Maria Farinella, Aki Härmä, Rim Helaoui, Milan Petkovic, Diego Reforgiato Recupero, Ehud Reiter, Daniele Riboni, Raymond Sterling

    Abstract: The use of machine learning in Healthcare has the potential to improve patient outcomes as well as broaden the reach and affordability of Healthcare. The history of other application areas indicates that strong benchmarks are essential for the development of intelligent systems. We present Personal Health Interfaces Leveraging HUman-MAchine Natural interactions (PhilHumans), a holistic suite of be… ▽ More

    Submitted 16 May, 2024; v1 submitted 4 May, 2024; originally announced May 2024.

  11. arXiv:2404.14202  [pdf, other

    cs.LG stat.ML

    An Adaptive Approach for Infinitely Many-armed Bandits under Generalized Rotting Constraints

    Authors: Jung-hun Kim, Milan Vojnovic, Se-Young Yun

    Abstract: In this study, we consider the infinitely many-armed bandit problems in a rested rotting setting, where the mean reward of an arm may decrease with each pull, while otherwise, it remains unchanged. We explore two scenarios regarding the rotting of rewards: one in which the cumulative amount of rotting is bounded by $V_T$, referred to as the slow-rotting case, and the other in which the cumulative… ▽ More

    Submitted 24 May, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

  12. arXiv:2404.14167  [pdf, other

    cs.RO

    A multi-robot system for the detection of explosive devices

    Authors: Ken Hasselmann, Mario Malizia, Rafael Caballero, Fabio Polisano, Shashank Govindaraj, Jakob Stigler, Oleksii Ilchenko, Milan Bajic, Geert De Cubber

    Abstract: In order to clear the world of the threat posed by landmines and other explosive devices, robotic systems can play an important role. However, the development of such field robots that need to operate in hazardous conditions requires the careful consideration of multiple aspects related to the perception, mobility, and collaboration capabilities of the system. In the framework of a European challe… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Journal ref: IEEE ICRA Workshop on Field Robotics 2024

  13. arXiv:2404.10271  [pdf, other

    cs.LG cs.AI cs.CL cs.CY cs.GT

    Social Choice Should Guide AI Alignment in Dealing with Diverse Human Feedback

    Authors: Vincent Conitzer, Rachel Freedman, Jobst Heitzig, Wesley H. Holliday, Bob M. Jacobs, Nathan Lambert, Milan Mossé, Eric Pacuit, Stuart Russell, Hailey Schoelkopf, Emanuel Tewolde, William S. Zwicker

    Abstract: Foundation models such as GPT-4 are fine-tuned to avoid unsafe or otherwise problematic behavior, such as hel** to commit crimes or producing racist text. One approach to fine-tuning, called reinforcement learning from human feedback, learns from humans' expressed preferences over multiple outputs. Another approach is constitutional AI, in which the input from humans is a list of high-level prin… ▽ More

    Submitted 4 June, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: 15 pages, 4 figures

    MSC Class: 68T01; 68T50; 91B14; 91B12 ACM Class: I.2.0; I.2.7; K.4.2; I.2.m; J.4

  14. arXiv:2404.05839  [pdf, other

    cs.CL

    ÚFAL LatinPipe at EvaLatin 2024: Morphosyntactic Analysis of Latin

    Authors: Milan Straka, Jana Straková, Federica Gamba

    Abstract: We present LatinPipe, the winning submission to the EvaLatin 2024 Dependency Parsing shared task. Our system consists of a fine-tuned concatenation of base and large pre-trained LMs, with a dot-product attention head for parsing and softmax classification heads for morphology to jointly learn both dependency parsing and morphological analysis. It is trained by sampling from seven publicly availabl… ▽ More

    Submitted 29 May, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

    Comments: Accepted to LT4HALA 2024

  15. arXiv:2404.03555  [pdf, other

    cs.CL

    From News to Summaries: Building a Hungarian Corpus for Extractive and Abstractive Summarization

    Authors: Botond Barta, Dorina Lakatos, Attila Nagy, Milán Konor Nyist, Judit Ács

    Abstract: Training summarization models requires substantial amounts of training data. However for less resourceful languages like Hungarian, openly available models and datasets are notably scarce. To address this gap our paper introduces HunSum-2 an open-source Hungarian corpus suitable for training abstractive and extractive summarization models. The dataset is assembled from segments of the Common Crawl… ▽ More

    Submitted 12 April, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

  16. arXiv:2404.02840  [pdf, ps, other

    cs.DC

    A Survey on Error-Bounded Lossy Compression for Scientific Datasets

    Authors: Sheng Di, **yang Liu, Kai Zhao, Xin Liang, Robert Underwood, Zhaorui Zhang, Milan Shah, Yafan Huang, Jiajun Huang, Xiaodong Yu, Congrong Ren, Hanqi Guo, Grant Wilkins, Dingwen Tao, Jiannan Tian, Sian **, Zizhe Jian, Daoce Wang, MD Hasanur Rahman, Boyuan Zhang, Jon C. Calhoun, Guanpeng Li, Kazutomo Yoshii, Khalid Ayed Alharthi, Franck Cappello

    Abstract: Error-bounded lossy compression has been effective in significantly reducing the data storage/transfer burden while preserving the reconstructed data fidelity very well. Many error-bounded lossy compressors have been developed for a wide range of parallel and distributed use cases for years. These lossy compressors are designed with distinct compression models and design principles, such that each… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: submitted to ACM Computing journal, requited to be 35 pages including references

  17. arXiv:2403.14489  [pdf, other

    cs.CV

    Adversary-Robust Graph-Based Learning of WSIs

    Authors: Saba Heidari Gheshlaghi, Milan Aryal, Nasim Yahyasoltani, Masoud Ganji

    Abstract: Enhancing the robustness of deep learning models against adversarial attacks is crucial, especially in critical domains like healthcare where significant financial interests heighten the risk of such attacks. Whole slide images (WSIs) are high-resolution, digitized versions of tissue samples mounted on glass slides, scanned using sophisticated imaging equipment. The digital analysis of WSIs presen… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  18. arXiv:2403.13763  [pdf, other

    cs.CV

    Practical End-to-End Optical Music Recognition for Pianoform Music

    Authors: Jiří Mayer, Milan Straka, Jan Hajič jr., Pavel Pecina

    Abstract: The majority of recent progress in Optical Music Recognition (OMR) has been achieved with Deep Learning methods, especially models following the end-to-end paradigm, reading input images and producing a linear sequence of tokens. Unfortunately, many music scores, especially piano music, cannot be easily converted to a linear sequence. This has led OMR researchers to use custom linearized encodings… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: 15+4 pages, 6 figures

    ACM Class: I.4.9; J.5

  19. arXiv:2403.08843  [pdf, other

    cs.AI

    Fuzzy Fault Trees Formalized

    Authors: Thi Kim Nhung Dang, Milan Lopuhaä-Zwakenberg, Mariëlle Stoelinga

    Abstract: Fault tree analysis is a vital method of assessing safety risks. It helps to identify potential causes of accidents, assess their likelihood and severity, and suggest preventive measures. Quantitative analysis of fault trees is often done via the dependability metrics that compute the system's failure behaviour over time. However, the lack of precise data is a major obstacle to quantitative analys… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 14 pages

  20. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  21. arXiv:2402.16996  [pdf, other

    cs.HC cs.LG cs.SD eess.AS q-bio.NC

    Towards Decoding Brain Activity During Passive Listening of Speech

    Authors: Milán András Fodor, Tamás Gábor Csapó, Frigyes Viktor Arthur

    Abstract: The aim of the study is to investigate the complex mechanisms of speech perception and ultimately decode the electrical changes in the brain accruing while listening to speech. We attempt to decode heard speech from intracranial electroencephalographic (iEEG) data using deep learning methods. The goal is to aid the advancement of brain-computer interface (BCI) technology for speech synthesis, and,… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: 27 pages, 7 figures

  22. arXiv:2402.14053  [pdf, other

    math.CO cs.IT

    Self-adhesivity in lattices of abstract conditional independence models

    Authors: Tobias Boege, Janneke H. Bolt, Milan Studený

    Abstract: We introduce an algebraic concept of the frame for abstract conditional independence (CI) models, together with basic operations with respect to which such a frame should be closed: copying and marginalization. Three standard examples of such frames are (discrete) probabilistic CI structures, semi-graphoids and structural semi-graphoids. We concentrate on those frames which are closed under the op… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 32 pages, 4 figures

    MSC Class: 62B10 (primary) 06A15; 68T27; 68V05; 05B35 (secondary)

  23. arXiv:2402.12038  [pdf, other

    cs.LG cs.CL

    Self-AMPLIFY: Improving Small Language Models with Self Post Hoc Explanations

    Authors: Milan Bhan, Jean-Noel Vittaut, Nicolas Chesneau, Marie-Jeanne Lesot

    Abstract: Incorporating natural language rationales in the prompt and In-Context Learning (ICL) have led to a significant improvement of Large Language Models (LLMs) performance. However, generating high-quality rationales require human-annotation or the use of auxiliary proxy models. In this work, we propose Self-AMPLIFY to automatically generate rationales from post hoc explanation methods applied to Smal… ▽ More

    Submitted 17 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  24. arXiv:2402.11044  [pdf, other

    math.CO cs.CG

    Star-Forest Decompositions of Complete Graphs

    Authors: Todor Antić, Jelena Glišić, Milan Milivojčević

    Abstract: We deal with the problem of decomposing a complete geometric graph into plane star-forests. In particular, we disprove a recent conjecture by Pach, Saghafian and Schnider by constructing for each $n$ a complete geometric graph on $n$ vertices which can be decomposed into $\frac{n}{2}+1$ plane star-forests. Additionally we prove that for even $n$, every decomposition of complete abstract graph on… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: 12 pages, 6 figures

    MSC Class: 05C10 (Primary) 68R10 (Secondary) ACM Class: G.2

  25. arXiv:2402.01155  [pdf, other

    cs.CL

    CABINET: Content Relevance based Noise Reduction for Table Question Answering

    Authors: Sohan Patnaik, Heril Changwal, Milan Aggarwal, Sumit Bhatia, Yaman Kumar, Balaji Krishnamurthy

    Abstract: Table understanding capability of Large Language Models (LLMs) has been extensively studied through the task of question-answering (QA) over tables. Typically, only a small part of the whole table is relevant to derive the answer for a given question. The irrelevant parts act as noise and are distracting information, resulting in sub-optimal performance due to the vulnerability of LLMs to noise. T… ▽ More

    Submitted 13 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: Accepted at ICLR 2024 (spotlight)

  26. arXiv:2401.16986  [pdf, other

    stat.ML cs.LG

    Causal Machine Learning for Cost-Effective Allocation of Development Aid

    Authors: Milan Kuzmanovic, Dennis Frauen, Tobias Hatt, Stefan Feuerriegel

    Abstract: The Sustainable Development Goals (SDGs) of the United Nations provide a blueprint of a better future by 'leaving no one behind', and, to achieve the SDGs by 2030, poor countries require immense volumes of development aid. In this paper, we develop a causal machine learning framework for predicting heterogeneous treatment effects of aid disbursements to inform effective aid allocation. Specificall… ▽ More

    Submitted 15 June, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

  27. arXiv:2401.16015  [pdf

    cs.LO

    Querying Fault and Attack Trees: Property Specification on a Water Network

    Authors: Stefano M. Nicoletti, Milan Lopuhaä-Zwakenberg, E. Moritz Hahn, Mariëlle Stoelinga

    Abstract: We provide an overview of three different query languages whose objective is to specify properties on the highly popular formalisms of fault trees (FTs) and attack trees (ATs). These are BFL, a Boolean Logic for FTs, PFL, a probabilistic extension of BFL and ATM, a logic for security metrics on ATs. We validate the framework composed by these three logics by applying them to the case study of a wa… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  28. arXiv:2401.12346  [pdf, other

    cs.CR

    Fuzzy quantitative attack tree analysis

    Authors: Thi Kim Nhung Dang, Milan Lopuhaä-Zwakenberg, Mariëlle Stoelinga

    Abstract: Attack trees are important for security, as they help to identify weaknesses and vulnerabilities in a system. Quantitative attack tree analysis supports a number security metrics, which formulate important KPIs such as the shortest, most likely and cheapest attacks. A key bottleneck in quantitative analysis is that the values are usually not known exactly, due to insufficient data and/or lack of… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 23 pages, 6 figures, FASE2024

  29. Automation of Triangle Ruler-and-Compass Constructions Using Constraint Solvers

    Authors: Milan Banković

    Abstract: In this paper, we present an approach to automated solving of triangle ruler-and-compass construction problems using finite-domain constraint solvers. The constraint model is described in the MiniZinc modeling language, and is based on the automated planning. The main benefit of using general constraint solvers for such purpose, instead of develo** dedicated tools, is that we can rely on the… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: In Proceedings ADG 2023, arXiv:2401.10725

    Journal ref: EPTCS 398, 2024, pp. 62-72

  30. arXiv:2401.10008  [pdf, ps, other

    cs.CR math.CT

    Attack tree metrics are operad algebras

    Authors: Milan Lopuhaä-Zwakenberg

    Abstract: Attack Trees (ATs) are a widely used tool for security analysis. ATs can be employed in quantitative security analysis through metrics, which assign a security value to an AT. Many different AT metrics exist, and there exist multiple general definitions that aim to study a wide variety of AT metrics at once. However, these all have drawbacks: they do not capture all metrics, and they do not easily… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  31. arXiv:2401.03461  [pdf

    cs.CY cs.AI

    Amplification of Addictive New Media Features in the Metaverse

    Authors: Ljubisa Bojic, Joerg Matthes, Milan Cabarkapa

    Abstract: The emergence of the metaverse, envisioned as a hyperreal virtual universe facilitating boundless human interaction, stands to revolutionize our conception of media, with significant impacts on addiction, creativity, relationships, and social polarization. This paper aims to dissect the addictive potential of the metaverse due to its immersive and interactive features, scrutinize the effects of it… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: 14 pages, 1 figure, 1 table

  32. arXiv:2401.02940  [pdf, other

    quant-ph cs.LG

    Digital-analog quantum learning on Rydberg atom arrays

    Authors: Jonathan Z. Lu, Lucy Jiao, Kristina Wolinski, Milan Kornjača, Hong-Ye Hu, Sergio Cantu, Fangli Liu, Susanne F. Yelin, Sheng-Tao Wang

    Abstract: We propose hybrid digital-analog learning algorithms on Rydberg atom arrays, combining the potentially practical utility and near-term realizability of quantum learning with the rapidly scaling architectures of neutral atoms. Our construction requires only single-qubit operations in the digital setting and global driving according to the Rydberg Hamiltonian in the analog setting. We perform a comp… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: 22 pages, 20 figures

  33. arXiv:2312.13927  [pdf, other

    cs.LG cs.AI

    On the Convergence of Loss and Uncertainty-based Active Learning Algorithms

    Authors: Daniel Haimovich, Dima Karamshuk, Fridolin Linder, Niek Tax, Milan Vojnovic

    Abstract: We investigate the convergence rates and data sample sizes required for training a machine learning model using a stochastic gradient descent (SGD) algorithm, where data points are sampled based on either their loss value or uncertainty value. These training methods are particularly relevant for active learning and data subset selection problems. For SGD with a constant step size update, we presen… ▽ More

    Submitted 11 June, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

  34. arXiv:2312.12649  [pdf, other

    eess.IV cs.CV

    Surf-CDM: Score-Based Surface Cold-Diffusion Model For Medical Image Segmentation

    Authors: Fahim Ahmed Zaman, Mathews Jacob, Amanda Chang, Kan Liu, Milan Sonka, Xiaodong Wu

    Abstract: Diffusion models have shown impressive performance for image generation, often times outperforming other generative models. Since their introduction, researchers have extended the powerful noise-to-image denoising pipeline to discriminative tasks, including image segmentation. In this work we propose a conditional score-based generative modeling framework for medical image segmentation which relie… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 5 pages, 5 figures, conference

  35. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  36. arXiv:2312.11176  [pdf, other

    cs.LG cs.CE math.NA

    Harnessing the Power of Neural Operators with Automatically Encoded Conservation Laws

    Authors: Ning Liu, Yiming Fan, Xianyi Zeng, Milan Klöwer, Lu Zhang, Yue Yu

    Abstract: Neural operators (NOs) have emerged as effective tools for modeling complex physical systems in scientific machine learning. In NOs, a central characteristic is to learn the governing physical laws directly from data. In contrast to other machine learning applications, partial knowledge is often known a priori about the physical system at hand whereby quantities such as mass, energy and momentum a… ▽ More

    Submitted 4 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  37. arXiv:2312.09545  [pdf

    cs.CL cs.AI cs.CY

    GPT-4 Surpassing Human Performance in Linguistic Pragmatics

    Authors: Ljubisa Bojic, Predrag Kovacevic, Milan Cabarkapa

    Abstract: As Large Language Models (LLMs) become increasingly integrated into everyday life, their capabilities to understand and emulate human cognition are under steady examination. This study investigates the ability of LLMs to comprehend and interpret linguistic pragmatics, an aspect of communication that considers context and implied meanings. Using Grice's communication principles, LLMs and human subj… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: 19 pages, 1 figure, 2 tables

  38. Performance evaluation of Private and Public Blockchains for multi-cloud service federation

    Authors: Adam Zahir, Milan Groshev, Kiril Antevski, Carlos J. Bernardos, Constantine Ayimba, Antonio de la Oliva

    Abstract: The stringent low-latency, high reliability, availability and resilience requirements of 6G use cases will present challenges to cloud providers. Currently, cloud providers lack simple, efficient, and secure implementation of provisioning solutions that meet these challenges. Multi-cloud federation is a promising approach. In this paper, we evaluate the application of private and public blockchain… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: 5 pages, 5 figures, conference

  39. arXiv:2312.07947  [pdf, other

    cs.CR

    Adaptive Differentially Quantized Subspace Perturbation (ADQSP): A Unified Framework for Privacy-Preserving Distributed Average Consensus

    Authors: Qiongxiu Li, Jaron Skovsted Gundersen, Milan Lopuhaa-Zwakenberg, Richard Heusdens

    Abstract: Privacy-preserving distributed average consensus has received significant attention recently due to its wide applicability. Based on the achieved performances, existing approaches can be broadly classified into perfect accuracy-prioritized approaches such as secure multiparty computation (SMPC), and worst-case privacy-prioritized approaches such as differential privacy (DP). Methods of the first c… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  40. arXiv:2312.07192  [pdf, other

    cs.NI cs.RO

    waveSLAM: Empowering Accurate Indoor Map** Using Off-the-Shelf Millimeter-wave Self-sensing

    Authors: Pablo Picazo, Milan Groshev, Alejandro Blanco, Claudio Fiandrino, Antonio de la Oliva, Joerg Widmer

    Abstract: This paper presents the design, implementation and evaluation of waveSLAM, a low-cost mobile robot system that uses the millimetre wave (mmWave) communication devices to enhance the indoor map** process targeting environments with reduced visibility or glass/mirror walls. A unique feature of waveSLAM is that it only leverages existing Commercial-Off-The-Shelf (COTS) hardware (Lidar and mmWave ra… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Journal ref: VTC FALL 2023

  41. arXiv:2312.05836  [pdf, other

    cs.DS

    Fault tree reliability analysis via squarefree polynomials

    Authors: Milan Lopuhaä-Zwakenberg

    Abstract: Fault tree (FT) analysis is a prominent risk assessment method in industrial systems. Unreliability is one of the key safety metrics in quantitative FT analysis. Existing algorithms for unreliability analysis are based on binary decision diagrams, for which it is hard to give time complexity guarantees beyond a worst-case exponential bound. In this paper, we present a novel method to calculate FT… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: To be presented at MODELSWARD 2024

  42. arXiv:2311.17791  [pdf, other

    eess.IV cs.CV

    U-Net v2: Rethinking the Skip Connections of U-Net for Medical Image Segmentation

    Authors: Yaopeng Peng, Milan Sonka, Danny Z. Chen

    Abstract: In this paper, we introduce U-Net v2, a new robust and efficient U-Net variant for medical image segmentation. It aims to augment the infusion of semantic information into low-level features while simultaneously refining high-level features with finer details. For an input image, we begin by extracting multi-level features with a deep neural network encoder. Next, we enhance the feature map of eac… ▽ More

    Submitted 30 March, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

  43. arXiv:2311.17243  [pdf, other

    cs.CV eess.IV

    PHG-Net: Persistent Homology Guided Medical Image Classification

    Authors: Yaopeng Peng, Hongxiao Wang, Milan Sonka, Danny Z. Chen

    Abstract: Modern deep neural networks have achieved great successes in medical image analysis. However, the features captured by convolutional neural networks (CNNs) or Transformers tend to be optimized for pixel intensities and neglect key anatomical structures such as connected components and loops. In this paper, we propose a persistent homology guided approach (PHG-Net) that explores topological feature… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: Accepted by WACV 2024

  44. arXiv:2311.16712  [pdf, other

    q-bio.QM cs.DC

    Onedata4Sci: Life science data management solution based on Onedata

    Authors: Tomáš Svoboda, Tomáš Raček, Josef Handl, Jozef Sabo, Adrián Rošinec, Łukasz Opioła, Wojciech Jesionek, Milan Ešner, Markéta Pernisová, Natallia Madzia Valasevich, Aleš Křenek, Radka Svobodová

    Abstract: Life-science experimental methods generate vast and ever-increasing volumes of data, which provide highly valuable research resources. However, management of these data is nontrivial and applicable software solutions are currently subject to intensive development. The solutions mainly fall into one of the two groups: general data management systems (e.g. Onedata, iRODS, B2SHARE, CERNBox) or very s… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  45. arXiv:2311.14391  [pdf, other

    cs.CL

    ÚFAL CorPipe at CRAC 2023: Larger Context Improves Multilingual Coreference Resolution

    Authors: Milan Straka

    Abstract: We present CorPipe, the winning entry to the CRAC 2023 Shared Task on Multilingual Coreference Resolution. Our system is an improved version of our earlier multilingual coreference pipeline, and it surpasses other participants by a large margin of 4.5 percent points. CorPipe first performs mention detection, followed by coreference linking via an antecedent-maximization approach on the retrieved s… ▽ More

    Submitted 7 December, 2023; v1 submitted 24 November, 2023; originally announced November 2023.

    Comments: Accepted to CRAC 2023 (the Sixth Workshop on Computational Models of Reference, Anaphora and Coreference)

  46. arXiv:2311.07222  [pdf, other

    physics.ao-ph cs.LG physics.comp-ph

    Neural General Circulation Models for Weather and Climate

    Authors: Dmitrii Kochkov, Janni Yuval, Ian Langmore, Peter Norgaard, Jamie Smith, Griffin Mooers, Milan Klöwer, James Lottes, Stephan Rasp, Peter Düben, Sam Hatfield, Peter Battaglia, Alvaro Sanchez-Gonzalez, Matthew Willson, Michael P. Brenner, Stephan Hoyer

    Abstract: General circulation models (GCMs) are the foundation of weather and climate prediction. GCMs are physics-based simulators which combine a numerical solver for large-scale dynamics with tuned representations for small-scale processes such as cloud formation. Recently, machine learning (ML) models trained on reanalysis data achieved comparable or better skill than GCMs for deterministic weather fore… ▽ More

    Submitted 7 March, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: 92 pages, 54 figures

  47. arXiv:2310.18192  [pdf, other

    eess.IV cs.CV

    Artifact-Robust Graph-Based Learning in Digital Pathology

    Authors: Saba Heidari Gheshlaghi, Milan Aryal, Nasim Yahyasoltani, Masoud Ganji

    Abstract: Whole slide images~(WSIs) are digitized images of tissues placed in glass slides using advanced scanners. The digital processing of WSIs is challenging as they are gigapixel images and stored in multi-resolution format. A common challenge with WSIs is that perturbations/artifacts are inevitable during storing the glass slides and digitizing them. These perturbations include motion, which often ari… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

  48. arXiv:2310.16999  [pdf, other

    cs.CV cs.LG eess.IV

    Trust, but Verify: Robust Image Segmentation using Deep Learning

    Authors: Fahim Ahmed Zaman, Xiaodong Wu, Weiyu Xu, Milan Sonka, Raghuraman Mudumbai

    Abstract: We describe a method for verifying the output of a deep neural network for medical image segmentation that is robust to several classes of random as well as worst-case perturbations i.e. adversarial attacks. This method is based on a general approach recently developed by the authors called "Trust, but Verify" wherein an auxiliary verification network produces predictions about certain masked feat… ▽ More

    Submitted 19 December, 2023; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: 5 Pages, 8 Figures, conference

  49. arXiv:2310.12892  [pdf, other

    cs.CL

    A Systematic Study of Performance Disparities in Multilingual Task-Oriented Dialogue Systems

    Authors: Songbo Hu, Han Zhou, Moy Yuan, Milan Gritta, Guchun Zhang, Ignacio Iacobacci, Anna Korhonen, Ivan Vulić

    Abstract: Achieving robust language technologies that can perform well across the world's many languages is a central goal of multilingual NLP. In this work, we take stock of and empirically analyse task performance disparities that exist between multilingual task-oriented dialogue (ToD) systems. We first define new quantitative measures of absolute and relative equivalence in system performance, capturing… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023

  50. arXiv:2310.06203  [pdf, ps, other

    math.CO cs.DM

    Graphs with three and four distinct eigenvalues based on circulants

    Authors: Milan Bašić

    Abstract: In this paper, we aim to address the open questions raised in various recent papers regarding characterization of circulant graphs with three or four distinct eigenvalues in their spectra. Our focus is on providing characterizations and constructing classes of graphs falling under this specific category. We present a characterization of circulant graphs with prime number order and unitary Cayley g… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: 24 pages

    MSC Class: 05C50; 05E30; 11A07; 11A15; 11A25