Skip to main content

Showing 1–50 of 110 results for author: Weber, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19509  [pdf

    cs.DB

    Semantic orchestration and exploitation of material data: A dataspace solution demonstrated on steel and cooper applications

    Authors: Yoav Nahshon, Lukas Morand, Matthias Büschelberger, Dirk Helm, Kiran Kumaraswamy, Paul Zierep, Matthias Weber, Pablo de Andrés

    Abstract: In the field of materials science and manufacturing, a vast amount of heterogeneous data exists, encompassing measurement and simulation data, machine data, publications, and more. This data serves as the bedrock of valuable knowledge that can be leveraged for various engineering applications. However, efficiently storing and handling such diverse data remain significantly challenging, often due t… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2406.07550  [pdf, other

    cs.CV

    An Image is Worth 32 Tokens for Reconstruction and Generation

    Authors: Qihang Yu, Mark Weber, Xueqing Deng, Xiaohui Shen, Daniel Cremers, Liang-Chieh Chen

    Abstract: Recent advancements in generative models have highlighted the crucial role of image tokenization in the efficient synthesis of high-resolution images. Tokenization, which transforms images into latent representations, reduces computational demands compared to directly processing pixels and enhances the effectiveness and efficiency of the generation process. Prior methods, such as VQGAN, typically… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: A compact 1D Image Tokenization method, leading to SOTA generation performance while being substantially faster. Project page at https://yucornetto.github.io/projects/titok.html

  3. arXiv:2406.01461  [pdf, other

    cs.LG math.DG stat.ML

    Hardness of Learning Neural Networks under the Manifold Hypothesis

    Authors: Bobak T. Kiani, Jason Wang, Melanie Weber

    Abstract: The manifold hypothesis presumes that high-dimensional data lies on or near a low-dimensional manifold. While the utility of encoding geometric structure has been demonstrated empirically, rigorous analysis of its impact on the learnability of neural networks is largely missing. Several recent results have established hardness results for learning feedforward and equivariant neural networks under… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  4. arXiv:2405.14477  [pdf, other

    cs.LG cs.CV

    LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion Models

    Authors: Seyedmorteza Sadat, Jakob Buhmann, Derek Bradley, Otmar Hilliges, Romann M. Weber

    Abstract: Advances in latent diffusion models (LDMs) have revolutionized high-resolution image generation, but the design space of the autoencoder that is central to these systems remains underexplored. In this paper, we introduce LiteVAE, a family of autoencoders for LDMs that leverage the 2D discrete wavelet transform to enhance scalability and computational efficiency over standard variational autoencode… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  5. arXiv:2405.03314  [pdf, other

    cs.CV cs.LG

    Deep Learning-based Point Cloud Registration for Augmented Reality-guided Surgery

    Authors: Maximilian Weber, Daniel Wild, Jens Kleesiek, Jan Egger, Christina Gsaxner

    Abstract: Point cloud registration aligns 3D point clouds using spatial transformations. It is an important task in computer vision, with applications in areas such as augmented reality (AR) and medical imaging. This work explores the intersection of two research trends: the integration of AR into image-guided surgery and the use of deep learning for point cloud registration. The main objective is to evalua… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 5 pages, 4 figures; accepted at IEEE ISBI 2024

  6. arXiv:2404.19535  [pdf, other

    physics.app-ph cs.ET

    Ferroelectrically-enhanced Schottky barrier transistors for Logic-in-Memory applications

    Authors: Daniele Nazzari, Lukas Wind, Masiar Sistani, Dominik Mayr, Kihye Kim, Walter M. Weber

    Abstract: Artificial neural networks (ANNs) have had an enormous impact on a multitude of sectors, from research to industry, generating an unprecedented demand for tailor-suited hardware platforms. Their training and execution is highly memory-intensive, clearly evidencing the limitations affecting the currently available hardware based on the von Neumann architecture, which requires frequent data shuttlin… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  7. arXiv:2404.19109  [pdf, other

    cs.LG q-fin.GN

    The Shape of Money Laundering: Subgraph Representation Learning on the Blockchain with the Elliptic2 Dataset

    Authors: Claudio Bellei, Muhua Xu, Ross Phillips, Tom Robinson, Mark Weber, Tim Kaler, Charles E. Leiserson, Arvind, Jie Chen

    Abstract: Subgraph representation learning is a technique for analyzing local structures (or shapes) within complex networks. Enabled by recent developments in scalable Graph Neural Networks (GNNs), this approach encodes relational information at a subgroup level (multiple connected nodes) rather than at a node level of abstraction. We posit that certain domain applications, such as anti-money laundering (A… ▽ More

    Submitted 1 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  8. arXiv:2404.07654  [pdf, ps, other

    cs.CL

    rollama: An R package for using generative large language models through Ollama

    Authors: Johannes B. Gruber, Maximilian Weber

    Abstract: rollama is an R package that wraps the Ollama API, which allows you to run different Generative Large Language Models (GLLM) locally. The package and learning material focus on making it easy to use Ollama for annotating textual or imagine data with open-source models as well as use these models for document embedding. But users can use or extend rollama to do essentially anything else that is pos… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  9. arXiv:2403.17778  [pdf, other

    cs.AI cs.DB cs.DL

    Towards a FAIR Documentation of Workflows and Models in Applied Mathematics

    Authors: Marco Reidelbach, Björn Schembera, Marcus Weber

    Abstract: Modeling-Simulation-Optimization workflows play a fundamental role in applied mathematics. The Mathematical Research Data Initiative, MaRDI, responded to this by develo** a FAIR and machine-interpretable template for a comprehensive documentation of such workflows. MaRDMO, a Plugin for the Research Data Management Organiser, enables scientists from diverse fields to document and publish their wo… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    ACM Class: H.3.3; H.3.7; E.0

  10. arXiv:2403.07621  [pdf, other

    cs.CV

    Smartphone region-wise image indoor localization using deep learning for indoor tourist attraction

    Authors: Gabriel Toshio Hirokawa Higa, Rodrigo Stuqui Monzani, Jorge Fernando da Silva Cecatto, Maria Fernanda Balestieri Mariano de Souza, Vanessa Aparecida de Moraes Weber, Hemerson Pistori, Edson Takashi Matsubara

    Abstract: Smart indoor tourist attractions, such as smart museums and aquariums, usually require a significant investment in indoor localization devices. The smartphone Global Positional Systems use is unsuitable for scenarios where dense materials such as concrete and metal block weaken the GPS signals, which is the most common scenario in an indoor tourist attraction. Deep learning makes it possible to pe… ▽ More

    Submitted 12 June, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  11. arXiv:2403.07137  [pdf, other

    eess.IV cs.CV cs.LG

    Exploring Cluster Analysis in Nelore Cattle Visual Score Attribution

    Authors: Alexandre de Oliveira Bezerra, Rodrigo Goncalves Mateus, Vanessa Ap. de Moraes Weber, Fabricio de Lima Weber, Yasmin Alves de Arruda, Rodrigo da Costa Gomes, Gabriel Toshio Hirokawa Higa, Hemerson Pistori

    Abstract: Assessing the biotype of cattle through human visual inspection is a very common and important practice in precision cattle breeding. This paper presents the results of a correlation analysis between scores produced by humans for Nelore cattle and a variety of measurements that can be derived from images or other instruments. It also presents a study using the k-means algorithm to generate new way… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  12. I see an IC: A Mixed-Methods Approach to Study Human Problem-Solving Processes in Hardware Reverse Engineering

    Authors: René Walendy, Markus Weber, **gjie Li, Steffen Becker, Carina Wiesen, Malte Elson, Younghyun Kim, Kassem Fawaz, Nikol Rummel, Christof Paar

    Abstract: Trust in digital systems depends on secure hardware, often assured through Hardware Reverse Engineering (HRE). This work develops methods for investigating human problem-solving processes in HRE, an underexplored yet critical aspect. Since reverse engineers rely heavily on visual information, eye tracking holds promise for studying their cognitive processes. To gain further insights, we additional… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  13. arXiv:2401.01869  [pdf, other

    cs.LG cs.DS math.ST stat.ML

    On the hardness of learning under symmetries

    Authors: Bobak T. Kiani, Thien Le, Hannah Lawrence, Stefanie Jegelka, Melanie Weber

    Abstract: We study the problem of learning equivariant neural networks via gradient descent. The incorporation of known symmetries ("equivariance") into neural nets has empirically improved the performance of learning pipelines, in domains ranging from biology to computer vision. However, a rich yet separate line of learning theoretic research has demonstrated that actually learning shallow, fully-connected… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Comments: 52 pages, 4 figures

  14. arXiv:2401.00284  [pdf, other

    cs.CL

    Evaluation is all you need. Prompting Generative Large Language Models for Annotation Tasks in the Social Sciences. A Primer using Open Models

    Authors: Maximilian Weber, Merle Reichardt

    Abstract: This paper explores the use of open generative Large Language Models (LLMs) for annotation tasks in the social sciences. The study highlights the challenges associated with proprietary models, such as limited reproducibility and privacy concerns, and advocates for the adoption of open (source) models that can be operated on independent devices. Two examples of annotation tasks, sentiment analysis… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

  15. arXiv:2312.10904  [pdf

    cs.AI

    Dynamic Retrieval Augmented Generation of Ontologies using Artificial Intelligence (DRAGON-AI)

    Authors: Sabrina Toro, Anna V Anagnostopoulos, Sue Bello, Kai Blumberg, Rhiannon Cameron, Leigh Carmody, Alexander D Diehl, Damion Dooley, William Duncan, Petra Fey, Pascale Gaudet, Nomi L Harris, Marcin Joachimiak, Leila Kiani, Tiago Lubiana, Monica C Munoz-Torres, Shawn O'Neil, David Osumi-Sutherland, Aleix Puig, Justin P Reese, Leonore Reiser, Sofia Robb, Troy Ruem**, James Seager, Eric Sid , et al. (5 additional authors not shown)

    Abstract: Background: Ontologies are fundamental components of informatics infrastructure in domains such as biomedical, environmental, and food sciences, representing consensus knowledge in an accurate and computable form. However, their construction and maintenance demand substantial resources and necessitate substantial collaboration between domain experts, curators, and ontology experts. We present Dyna… ▽ More

    Submitted 12 June, 2024; v1 submitted 17 December, 2023; originally announced December 2023.

  16. arXiv:2312.10188  [pdf, other

    cs.LG

    WordScape: a Pipeline to extract multilingual, visually rich Documents with Layout Annotations from Web Crawl Data

    Authors: Maurice Weber, Carlo Siebenschuh, Rory Butler, Anton Alexandrov, Valdemar Thanner, Georgios Tsolakis, Haris Jabbar, Ian Foster, Bo Li, Rick Stevens, Ce Zhang

    Abstract: We introduce WordScape, a novel pipeline for the creation of cross-disciplinary, multilingual corpora comprising millions of pages with annotations for document layout detection. Relating visual and textual items on document pages has gained further significance with the advent of multimodal models. Various approaches proved effective for visual question answering or layout segmentation. However,… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2023 Datasets and Benchmarks

  17. arXiv:2312.06552  [pdf, other

    cs.CE

    Open Data-Driven Automation of Residential Distribution Grid Modeling with Minimal Data Requirements

    Authors: Moritz Weber, Luc Janecke, Hüseyin K. Çakmak, Veit Hagenmeyer

    Abstract: In the present paper, we introduce a new method for the automated generation of residential distribution grid models based on novel building load estimation methods and a two-stage optimization for the generation of the 20 kV and 400 V grid topologies. Using the introduced load estimation methods, various open or proprietary data sources can be utilized to estimate the load of residential building… ▽ More

    Submitted 12 December, 2023; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: 11 pages, 12 figures, submitted to IEEE Transactions on Smart Grid

  18. arXiv:2311.14864  [pdf, other

    cs.LG stat.ML

    Effective Structural Encodings via Local Curvature Profiles

    Authors: Lukas Fesser, Melanie Weber

    Abstract: Structural and Positional Encodings can significantly improve the performance of Graph Neural Networks in downstream tasks. Recent literature has begun to systematically investigate differences in the structural properties that these approaches encode, as well as performance trade-offs between them. However, the question of which structural properties yield the most effective encoding remains open… ▽ More

    Submitted 13 March, 2024; v1 submitted 24 November, 2023; originally announced November 2023.

  19. arXiv:2311.07422  [pdf, other

    cs.PL

    Sidekick compilation with xDSL

    Authors: Mathieu Fehr, Michel Weber, Christian Ulmann, Alexandre Lopoukhine, Martin Lücke, Théo Degioanni, Michel Steuwer, Tobias Grosser

    Abstract: Traditionally, compiler researchers either conduct experiments within an existing production compiler or develop their own prototype compiler; both options come with trade-offs. On one hand, prototy** in a production compiler can be cumbersome, as they are often optimized for program compilation speed at the expense of software simplicity and development speed. On the other hand, the transition… ▽ More

    Submitted 16 June, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: 14 pages, 15 figures; updated twice to include acknowledgements

  20. arXiv:2310.17347  [pdf, other

    cs.CV

    CADS: Unleashing the Diversity of Diffusion Models through Condition-Annealed Sampling

    Authors: Seyedmorteza Sadat, Jakob Buhmann, Derek Bradley, Otmar Hilliges, Romann M. Weber

    Abstract: While conditional diffusion models are known to have good coverage of the data distribution, they still face limitations in output diversity, particularly when sampled with a high classifier-free guidance scale for optimal image quality or when trained on small datasets. We attribute this problem to the role of the conditioning signal in inference and offer an improved sampling strategy for diffus… ▽ More

    Submitted 13 May, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: Published as a conference paper at ICLR 2024

    Journal ref: The Twelfth International Conference on Learning Representations (ICLR 2024)

  21. arXiv:2309.10048  [pdf

    cs.HC

    What does ChatGPT know about natural science and engineering?

    Authors: Lukas Schulze Balhorn, Jana M. Weber, Stefan Buijsman, Julian R. Hildebrandt, Martina Ziefle, Artur M. Schweidtmann

    Abstract: ChatGPT is a powerful language model from OpenAI that is arguably able to comprehend and generate text. ChatGPT is expected to have a large impact on society, research, and education. An essential step to understand ChatGPT's expected impact is to study its domain-specific answering capabilities. Here, we perform a systematic empirical assessment of its abilities to answer questions across the nat… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  22. arXiv:2309.09384  [pdf, other

    cs.LG stat.ML

    Mitigating Over-Smoothing and Over-Squashing using Augmentations of Forman-Ricci Curvature

    Authors: Lukas Fesser, Melanie Weber

    Abstract: While Graph Neural Networks (GNNs) have been successfully leveraged for learning on graph-structured data across domains, several potential pitfalls have been described recently. Those include the inability to accurately leverage information encoded in long-range connections (over-squashing), as well as difficulties distinguishing the learned representations of nearby nodes with growing network de… ▽ More

    Submitted 19 March, 2024; v1 submitted 17 September, 2023; originally announced September 2023.

  23. arXiv:2309.05740  [pdf, other

    cs.CR cs.HC

    REVERSIM: A Game-Based Environment to Study Human Aspects in Hardware Reverse Engineering

    Authors: Steffen Becker, René Walendy, Markus Weber, Carina Wiesen, Nikol Rummel, Christof Paar

    Abstract: Hardware Reverse Engineering (HRE) is a technique for analyzing Integrated Circuits (ICs). Experts employ HRE for security-critical tasks, such as detecting Trojans or intellectual property violations. They rely not only on their experience and customized tools but also on their cognitive abilities. Conducting controlled experiments to assess the cognitive processes involved in HRE can open new av… ▽ More

    Submitted 24 March, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

  24. Learning Control Policies for Variable Objectives from Offline Data

    Authors: Marc Weber, Phillip Swazinna, Daniel Hein, Steffen Udluft, Volkmar Sterzing

    Abstract: Offline reinforcement learning provides a viable approach to obtain advanced control strategies for dynamical systems, in particular when direct interaction with the environment is not available. In this paper, we introduce a conceptual extension for model-based policy search methods, called variable objective policy (VOP). With this approach, policies are trained to generalize efficiently over a… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: 8 pages, 7 figures

    Journal ref: 2023 IEEE Symposium Series on Computational Intelligence

  25. arXiv:2307.10155  [pdf, other

    cs.SI cs.DM cs.LG math.CO stat.ML

    Curvature-based Clustering on Graphs

    Authors: Yu Tian, Zachary Lubberts, Melanie Weber

    Abstract: Unsupervised node clustering (or community detection) is a classical graph learning task. In this paper, we study algorithms, which exploit the geometry of the graph to identify densely connected substructures, which form clusters or communities. Our method implements discrete Ricci curvatures and their associated geometric flows, under which the edge weights of the graph evolve to reveal its comm… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

    Comments: 65 pages, 19 figures

    MSC Class: 05C82; 05C10; 53C21; 68R10; 05C75

  26. arXiv:2307.02378  [pdf, other

    math.DG cs.LG math.AP stat.ML

    Continuum Limits of Ollivier's Ricci Curvature on data clouds: pointwise consistency and global lower bounds

    Authors: Nicolas Garcia Trillos, Melanie Weber

    Abstract: Let $\mathcal{M} \subseteq \mathbb{R}^d$ denote a low-dimensional manifold and let $\mathcal{X}= \{ x_1, \dots, x_n \}$ be a collection of points uniformly sampled from $\mathcal{M}$. We study the relationship between the curvature of a random geometric graph built from $\mathcal{X}$ and the curvature of the manifold $\mathcal{M}$ via continuum limits of Ollivier's discrete Ricci curvature. We pro… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

  27. arXiv:2306.15938  [pdf, other

    cs.LG cs.AI cs.NI stat.AP

    Interpretable Anomaly Detection in Cellular Networks by Learning Concepts in Variational Autoencoders

    Authors: Amandeep Singh, Michael Weber, Markus Lange-Hegermann

    Abstract: This paper addresses the challenges of detecting anomalies in cellular networks in an interpretable way and proposes a new approach using variational autoencoders (VAEs) that learn interpretable representations of the latent space for each Key Performance Indicator (KPI) in the dataset. This enables the detection of anomalies based on reconstruction loss and Z-scores. We ensure the interpretabilit… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    ACM Class: C.2.m; C.2.3; G.3; I.2.6; I.5.3

  28. arXiv:2306.06474  [pdf, other

    math.CO cs.DM math.MG

    Augmentations of Forman's Ricci Curvature and their Applications in Community Detection

    Authors: Lukas Fesser, Sergio Serrano de Haro Iváñez, Karel Devriendt, Melanie Weber, Renaud Lambiotte

    Abstract: The notion of curvature on graphs has recently gained traction in the networks community, with the Ollivier-Ricci curvature (ORC) in particular being used for several tasks in network analysis, such as community detection. In this work, we choose a different approach and study augmentations of the discretization of the Ricci curvature proposed by Forman (AFRC). We empirically and theoretically inv… ▽ More

    Submitted 10 June, 2023; originally announced June 2023.

    Comments: 22 pages, 11 figures

    MSC Class: 05C82; 05C10 (Primary) 53C21; 68R10; 05C75 (Secondary)

  29. arXiv:2305.09011  [pdf, other

    eess.IV cs.CV

    The Brain Tumor Segmentation (BraTS) Challenge 2023: Brain MR Image Synthesis for Tumor Segmentation (BraSyn)

    Authors: Hongwei Bran Li, Gian Marco Conte, Syed Muhammad Anwar, Florian Kofler, Ivan Ezhov, Koen van Leemput, Marie Piraud, Maria Diaz, Byrone Cole, Evan Calabrese, Jeff Rudie, Felix Meissen, Maruf Adewole, Anastasia Janas, Anahita Fathi Kazerooni, Dominic LaBella, Ahmed W. Moawad, Keyvan Farahani, James Eddy, Timothy Bergquist, Verena Chung, Russell Takeshi Shinohara, Farouk Dako, Walter Wiggins, Zachary Reitman , et al. (43 additional authors not shown)

    Abstract: Automated brain tumor segmentation methods have become well-established and reached performance levels offering clear clinical utility. These methods typically rely on four input magnetic resonance imaging (MRI) modalities: T1-weighted images with and without contrast enhancement, T2-weighted images, and FLAIR images. However, some sequences are often missing in clinical practice due to time const… ▽ More

    Submitted 28 June, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: Technical report of BraSyn

  30. arXiv:2305.08992  [pdf, other

    eess.IV cs.CV cs.LG

    The Brain Tumor Segmentation (BraTS) Challenge 2023: Local Synthesis of Healthy Brain Tissue via Inpainting

    Authors: Florian Kofler, Felix Meissen, Felix Steinbauer, Robert Graf, Eva Oswald, Ezequiel de da Rosa, Hongwei Bran Li, Ujjwal Baid, Florian Hoelzl, Oezguen Turgut, Izabela Horvath, Diana Waldmannstetter, Christina Bukas, Maruf Adewole, Syed Muhammad Anwar, Anastasia Janas, Anahita Fathi Kazerooni, Dominic LaBella, Ahmed W Moawad, Keyvan Farahani, James Eddy, Timothy Bergquist, Verena Chung, Russell Takeshi Shinohara, Farouk Dako , et al. (43 additional authors not shown)

    Abstract: A myriad of algorithms for the automatic analysis of brain MR images is available to support clinicians in their decision-making. For brain tumor patients, the image acquisition time series typically starts with a scan that is already pathological. This poses problems, as many algorithms are designed to analyze healthy brains and provide no guarantees for images featuring lesions. Examples include… ▽ More

    Submitted 9 August, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: 5 pages, 1 figure

  31. arXiv:2304.12906  [pdf, other

    cs.LG stat.ML

    The Score-Difference Flow for Implicit Generative Modeling

    Authors: Romann M. Weber

    Abstract: Implicit generative modeling (IGM) aims to produce samples of synthetic data matching the characteristics of a target data distribution. Recent work (e.g. score-matching networks, diffusion models) has approached the IGM problem from the perspective of pushing synthetic source data toward the target distribution via dynamical perturbations or flows in the ambient space. In this direction, we prese… ▽ More

    Submitted 18 July, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

    Comments: 25 pages, 5 figures, 4 tables. To appear in Transactions on Machine Learning Research (TMLR)

    Journal ref: Transactions on Machine Learning Research (2023)

  32. arXiv:2303.13006  [pdf, other

    cs.CV cs.GR cs.LG

    Controllable Inversion of Black-Box Face Recognition Models via Diffusion

    Authors: Manuel Kansy, Anton Raël, Graziana Mignone, Jacek Naruniec, Christopher Schroers, Markus Gross, Romann M. Weber

    Abstract: Face recognition models embed a face image into a low-dimensional identity vector containing abstract encodings of identity-specific facial features that allow individuals to be distinguished from one another. We tackle the challenging task of inverting the latent space of pre-trained face recognition models without full model access (i.e. black-box setting). A variety of methods have been propose… ▽ More

    Submitted 30 September, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: 8 pages main paper + 23 pages supplementary material. Moderate revisions from v1 (different template, added user study, wording). Presented at AMFG workshop at ICCV 2023. Project page: https://studios.disneyresearch.com/2023/10/02/controllable-inversion-of-black-box-face-recognition-models-via-diffusion/

    ACM Class: I.2; I.3.3; I.4

  33. arXiv:2303.02473  [pdf

    cs.SI physics.soc-ph

    Disparity in the Evolving COVID-19 Collaboration Network

    Authors: Huimin Xu, Redoan Rahman, Ajay Jaiswal, Julia Fensel, Abhinav Peri, Ka-mesh Peri, Griffin M Weber, Ying Ding

    Abstract: The COVID 19 pandemic has paused many ongoing research projects and unified researchers' attention to focus on COVID 19 related issues. Our project traces 712294 scientists' publications related to COVID 19 for two years, from January 2020 to December 2021, to detect the dynamic evolution patterns of the COVID 19 collaboration network over time. By studying the collaboration network of COVID 19 sc… ▽ More

    Submitted 4 March, 2023; originally announced March 2023.

  34. arXiv:2212.08204  [pdf, other

    cs.CL cs.CY

    LegalRelectra: Mixed-domain Language Modeling for Long-range Legal Text Comprehension

    Authors: Wenyue Hua, Yuchen Zhang, Zhe Chen, Josie Li, Melanie Weber

    Abstract: The application of Natural Language Processing (NLP) to specialized domains, such as the law, has recently received a surge of interest. As many legal services rely on processing and analyzing large collections of documents, automating such tasks with NLP tools emerges as a key challenge. Many popular language models, such as BERT or RoBERTa, are general-purpose models, which have limitations on p… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

  35. arXiv:2211.16943  [pdf, other

    quant-ph cs.LG

    Predicting Properties of Quantum Systems with Conditional Generative Models

    Authors: Haoxiang Wang, Maurice Weber, Josh Izaac, Cedric Yen-Yu Lin

    Abstract: Machine learning has emerged recently as a powerful tool for predicting properties of quantum many-body systems. For many ground states of gapped Hamiltonians, generative models can learn from measurements of a single quantum state to reconstruct the state accurately enough to predict local observables. Alternatively, classification and regression models can predict local observables by learning f… ▽ More

    Submitted 3 March, 2024; v1 submitted 30 November, 2022; originally announced November 2022.

    Comments: 10 pages, 14 figures, 5 pages appendix. Open-source code is available at https://github.com/PennyLaneAI/generative-quantum-states

  36. arXiv:2208.10773  [pdf, other

    cs.CV cs.CR cs.LG

    Adversarial Vulnerability of Temporal Feature Networks for Object Detection

    Authors: Svetlana Pavlitskaya, Nikolai Polley, Michael Weber, J. Marius Zöllner

    Abstract: Taking into account information across the temporal domain helps to improve environment perception in autonomous driving. However, it has not been studied so far whether temporally fused neural networks are vulnerable to deliberately generated perturbations, i.e. adversarial attacks, or whether temporal history is an inherent defense against them. In this work, we study whether temporal feature ne… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

    Comments: Accepted for publication at ECCV 2022 SAIAD workshop

  37. arXiv:2208.05013  [pdf, other

    math.OC cs.CC cs.DS math.FA

    Computing Brascamp-Lieb Constants through the lens of Thompson Geometry

    Authors: Melanie Weber, Suvrit Sra

    Abstract: This paper studies algorithms for efficiently computing Brascamp-Lieb constants, a task that has recently received much interest. In particular, we reduce the computation to a nonlinear matrix-valued iteration, whose convergence we analyze through the lens of fixed-point methods under the well-known Thompson metric. This approach permits us to obtain (weakly) polynomial time guarantees, and it off… ▽ More

    Submitted 14 April, 2024; v1 submitted 9 August, 2022; originally announced August 2022.

    Comments: Under Review

    MSC Class: 46N10; 49Q99; 53Z50; 68W40

  38. Physical Pooling Functions in Graph Neural Networks for Molecular Property Prediction

    Authors: Artur M. Schweidtmann, Jan G. Rittig, Jana M. Weber, Martin Grohe, Manuel Dahmen, Kai Leonhard, Alexander Mitsos

    Abstract: Graph neural networks (GNNs) are emerging in chemical engineering for the end-to-end learning of physicochemical properties based on molecular graphs. A key element of GNNs is the pooling function which combines atom feature vectors into molecular fingerprints. Most previous works use a standard pooling function to predict a variety of properties. However, unsuitable pooling functions can lead to… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

    Journal ref: Computers and Chemical Engineering Volume 172, April 2023, 108202

  39. arXiv:2206.11426  [pdf, other

    math.OC cs.LG

    On a class of geodesically convex optimization problems solved via Euclidean MM methods

    Authors: Melanie Weber, Suvrit Sra

    Abstract: We study geodesically convex (g-convex) problems that can be written as a difference of Euclidean convex functions. This structure arises in several optimization problems in statistics and machine learning, e.g., for matrix scaling, M-estimators for covariances, and Brascamp-Lieb inequalities. Our work offers efficient algorithms that on the one hand exploit g-convexity to ensure global optimality… ▽ More

    Submitted 20 October, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: Under Review

  40. Graph Machine Learning for Design of High-Octane Fuels

    Authors: Jan G. Rittig, Martin Ritzert, Artur M. Schweidtmann, Stefanie Winkler, Jana M. Weber, Philipp Morsch, K. Alexander Heufer, Martin Grohe, Alexander Mitsos, Manuel Dahmen

    Abstract: Fuels with high-knock resistance enable modern spark-ignition engines to achieve high efficiency and thus low CO2 emissions. Identification of molecules with desired autoignition properties indicated by a high research octane number and a high octane sensitivity is therefore of great practical relevance and can be supported by computer-aided molecular design (CAMD). Recent developments in the fiel… ▽ More

    Submitted 14 October, 2022; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: manuscript (26 pages, 9 figures, 2 tables), supporting information (12 pages, 8 figures, 1 table)

    Journal ref: AIChE Journal 69 (4), e17971, 2023

  41. arXiv:2205.15494  [pdf, other

    cs.LG cs.CY

    Certifying Some Distributional Fairness with Subpopulation Decomposition

    Authors: Mintong Kang, Linyi Li, Maurice Weber, Yang Liu, Ce Zhang, Bo Li

    Abstract: Extensive efforts have been made to understand and improve the fairness of machine learning models based on observational metrics, especially in high-stakes domains such as medical insurance, education, and hiring decisions. However, there is a lack of certified fairness considering the end-to-end performance of an ML model. In this paper, we first formulate the certified fairness of an ML model t… ▽ More

    Submitted 18 November, 2022; v1 submitted 30 May, 2022; originally announced May 2022.

    Comments: NeurIPS 2022, 38 pages, 11 pages for the main text

  42. arXiv:2203.12560  [pdf, other

    cs.CV

    DynamicEarthNet: Daily Multi-Spectral Satellite Dataset for Semantic Change Segmentation

    Authors: Aysim Toker, Lukas Kondmann, Mark Weber, Marvin Eisenberger, Andrés Camero, **gliang Hu, Ariadna Pregel Hoderlein, Çağlar Şenaras, Timothy Davis, Daniel Cremers, Giovanni Marchisio, Xiao Xiang Zhu, Laura Leal-Taixé

    Abstract: Earth observation is a fundamental tool for monitoring the evolution of land use in specific areas of interest. Observing and precisely defining change, in this context, requires both time-series data and pixel-wise segmentations. To that end, we propose the DynamicEarthNet dataset that consists of daily, multi-spectral satellite observations of 75 selected areas of interest distributed over the g… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR 2022, evaluation webpage: https://codalab.lisn.upsaclay.fr/competitions/2882

  43. arXiv:2202.01679  [pdf, other

    cs.LG

    Certifying Out-of-Domain Generalization for Blackbox Functions

    Authors: Maurice Weber, Linyi Li, Boxin Wang, Zhikuan Zhao, Bo Li, Ce Zhang

    Abstract: Certifying the robustness of model performance under bounded data distribution drifts has recently attracted intensive interest under the umbrella of distributional robustness. However, existing techniques either make strong assumptions on the model class and loss functions that can be certified, such as smoothness expressed via Lipschitz continuity of gradients, or require to solve complex optimi… ▽ More

    Submitted 30 July, 2022; v1 submitted 3 February, 2022; originally announced February 2022.

    Comments: 39th International Conference on Machine Learning (ICML) 2022

  44. arXiv:2201.07032  [pdf, other

    cs.AI math.RA

    The Mathematics of Comparing Objects

    Authors: Marcus Weber, Konstantin Fackeldey

    Abstract: "After reading two different crime stories, an artificial intelligence concludes that in both stories the police has found the murderer just by random." -- To what extend and under which assumptions this is a description of a realistic scenario?

    Submitted 30 March, 2022; v1 submitted 14 January, 2022; originally announced January 2022.

    MSC Class: 06Exx ACM Class: J.5

  45. arXiv:2112.10074  [pdf, other

    eess.IV cs.CV cs.LG

    QU-BraTS: MICCAI BraTS 2020 Challenge on Quantifying Uncertainty in Brain Tumor Segmentation - Analysis of Ranking Scores and Benchmarking Results

    Authors: Raghav Mehta, Angelos Filos, Ujjwal Baid, Chiharu Sako, Richard McKinley, Michael Rebsamen, Katrin Datwyler, Raphael Meier, Piotr Radojewski, Gowtham Krishnan Murugesan, Sahil Nalawade, Chandan Ganesh, Ben Wagner, Fang F. Yu, Baowei Fei, Ananth J. Madhuranthakam, Joseph A. Maldjian, Laura Daza, Catalina Gomez, Pablo Arbelaez, Chengliang Dai, Shuo Wang, Hadrien Reynaud, Yuan-han Mo, Elsa Angelini , et al. (67 additional authors not shown)

    Abstract: Deep learning (DL) models have provided state-of-the-art performance in various medical imaging benchmarking challenges, including the Brain Tumor Segmentation (BraTS) challenges. However, the task of focal pathology multi-compartment segmentation (e.g., tumor and lesion sub-regions) is particularly challenging, and potential errors hinder translating DL models into clinical workflows. Quantifying… ▽ More

    Submitted 23 August, 2022; v1 submitted 19 December, 2021; originally announced December 2021.

    Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA): https://www.melba-journal.org/papers/2022:026.html

    Journal ref: Machine.Learning.for.Biomedical.Imaging. 1 (2022)

  46. arXiv:2109.09946  [pdf, other

    cs.CY cs.AI cs.LG stat.ML

    Identifying biases in legal data: An algorithmic fairness perspective

    Authors: Jackson Sargent, Melanie Weber

    Abstract: The need to address representation biases and sentencing disparities in legal case data has long been recognized. Here, we study the problem of identifying and measuring biases in large-scale legal case data from an algorithmic fairness perspective. Our approach utilizes two regression models: A baseline that represents the decisions of a "typical" judge as given by the data and a "fair" judge tha… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: EEAMO 2021

    ACM Class: K.4; K.5

  47. arXiv:2107.02314  [pdf, other

    cs.CV

    The RSNA-ASNR-MICCAI BraTS 2021 Benchmark on Brain Tumor Segmentation and Radiogenomic Classification

    Authors: Ujjwal Baid, Satyam Ghodasara, Suyash Mohan, Michel Bilello, Evan Calabrese, Errol Colak, Keyvan Farahani, Jayashree Kalpathy-Cramer, Felipe C. Kitamura, Sarthak Pati, Luciano M. Prevedello, Jeffrey D. Rudie, Chiharu Sako, Russell T. Shinohara, Timothy Bergquist, Rong Chai, James Eddy, Julia Elliott, Walter Reade, Thomas Schaffter, Thomas Yu, Jiaxin Zheng, Ahmed W. Moawad, Luiz Otavio Coelho, Olivia McDonnell , et al. (78 additional authors not shown)

    Abstract: The BraTS 2021 challenge celebrates its 10th anniversary and is jointly organized by the Radiological Society of North America (RSNA), the American Society of Neuroradiology (ASNR), and the Medical Image Computing and Computer Assisted Interventions (MICCAI) society. Since its inception, BraTS has been focusing on being a common benchmarking venue for brain glioma segmentation algorithms, with wel… ▽ More

    Submitted 12 September, 2021; v1 submitted 5 July, 2021; originally announced July 2021.

    Comments: 19 pages, 2 figures, 1 table

  48. arXiv:2106.09748  [pdf, other

    cs.CV

    DeepLab2: A TensorFlow Library for Deep Labeling

    Authors: Mark Weber, Huiyu Wang, Siyuan Qiao, Jun Xie, Maxwell D. Collins, Yukun Zhu, Liangzhe Yuan, Dahun Kim, Qihang Yu, Daniel Cremers, Laura Leal-Taixe, Alan L. Yuille, Florian Schroff, Hartwig Adam, Liang-Chieh Chen

    Abstract: DeepLab2 is a TensorFlow library for deep labeling, aiming to provide a state-of-the-art and easy-to-use TensorFlow codebase for general dense pixel prediction problems in computer vision. DeepLab2 includes all our recently developed DeepLab model variants with pretrained checkpoints as well as model training and evaluation code, allowing the community to reproduce and further improve upon the sta… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: 4-page technical report. The first three authors contributed equally to this work

  49. arXiv:2106.02045  [pdf, other

    cs.DC math.NA physics.ins-det

    Least-squares fitting of Gaussian spots on graphics processing units

    Authors: Marcel Leutenegger, Michael Weber

    Abstract: The investigation of samples with a spatial resolution in the nanometer range relies on the precise and stable positioning of the sample. Due to inherent mechanical instabilities of typical sample stages in optical microscopes, it is usually required to control and/or monitor the sample position during the acquisition. The tracking of sparsely distributed fiducial markers at high speed allows stab… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

    Comments: 16 pages

  50. arXiv:2105.05874  [pdf, other

    eess.IV cs.CV

    The Federated Tumor Segmentation (FeTS) Challenge

    Authors: Sarthak Pati, Ujjwal Baid, Maximilian Zenk, Brandon Edwards, Micah Sheller, G. Anthony Reina, Patrick Foley, Alexey Gruzdev, Jason Martin, Shadi Albarqouni, Yong Chen, Russell Taki Shinohara, Annika Reinke, David Zimmerer, John B. Freymann, Justin S. Kirby, Christos Davatzikos, Rivka R. Colen, Aikaterini Kotrotsou, Daniel Marcus, Mikhail Milchenko, Arash Nazeri, Hassan Fathallah-Shaykh, Roland Wiest, Andras Jakab , et al. (7 additional authors not shown)

    Abstract: This manuscript describes the first challenge on Federated Learning, namely the Federated Tumor Segmentation (FeTS) challenge 2021. International challenges have become the standard for validation of biomedical image analysis methods. However, the actual performance of participating (even the winning) algorithms on "real-world" clinical data often remains unclear, as the data included in challenge… ▽ More

    Submitted 13 May, 2021; v1 submitted 12 May, 2021; originally announced May 2021.