Skip to main content

Showing 1–38 of 38 results for author: Arnold, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00764  [pdf, other

    cs.CL cs.AI

    Characterizing Stereotypical Bias from Privacy-preserving Pre-Training

    Authors: Stefan Arnold, Rene Gröbner, Annika Schreiner

    Abstract: Differential Privacy (DP) can be applied to raw text by exploiting the spatial arrangement of words in an embedding space. We investigate the implications of such text privatization on Language Models (LMs) and their tendency towards stereotypical associations. Since previous studies documented that linguistic proficiency correlates with stereotypical bias, one could assume that techniques for tex… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  2. arXiv:2406.18620  [pdf, other

    cs.DL cs.AI

    Documentation Practices of Artificial Intelligence

    Authors: Stefan Arnold, Dilara Yesilbas, Rene Gröbner, Dominik Riedelbauch, Maik Horn, Sven Weinzierl

    Abstract: Artificial Intelligence (AI) faces persistent challenges in terms of transparency and accountability, which requires rigorous documentation. Through a literature review on documentation practices, we provide an overview of prevailing trends, persistent issues, and the multifaceted interplay of factors influencing the documentation. Our examination of key characteristics such as scope, target audie… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  3. arXiv:2406.13121  [pdf, other

    cs.CL cs.AI cs.IR

    Can Long-Context Language Models Subsume Retrieval, RAG, SQL, and More?

    Authors: **hyuk Lee, Anthony Chen, Zhuyun Dai, Dheeru Dua, Devendra Singh Sachan, Michael Boratko, Yi Luan, Sébastien M. R. Arnold, Vincent Perot, Siddharth Dalmia, Hexiang Hu, Xudong Lin, Panupong Pasupat, Aida Amini, Jeremy R. Cole, Sebastian Riedel, Iftekhar Naim, Ming-Wei Chang, Kelvin Guu

    Abstract: Long-context language models (LCLMs) have the potential to revolutionize our approach to tasks traditionally reliant on external tools like retrieval systems or databases. Leveraging LCLMs' ability to natively ingest and process entire corpora of information offers numerous advantages. It enhances user-friendliness by eliminating the need for specialized knowledge of tools, provides robust end-to-… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 29 pages. Dataset available at https://github.com/google-deepmind/loft

  4. arXiv:2404.12631  [pdf

    cs.NE cs.AI

    Breaching the Bottleneck: Evolutionary Transition from Reward-Driven Learning to Reward-Agnostic Domain-Adapted Learning in Neuromodulated Neural Nets

    Authors: Solvi Arnold, Reiji Suzuki, Takaya Arita, Kimitoshi Yamazaki

    Abstract: Advanced biological intelligence learns efficiently from an information-rich stream of stimulus information, even when feedback on behaviour quality is sparse or absent. Such learning exploits implicit assumptions about task domains. We refer to such learning as Domain-Adapted Learning (DAL). In contrast, AI learning algorithms rely on explicit externally provided measures of behaviour quality to… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 9 pages, 5 figures

    ACM Class: I.2.6

  5. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  6. arXiv:2402.17937  [pdf, other

    cs.RO

    Can an LLM-Powered Socially Assistive Robot Effectively and Safely Deliver Cognitive Behavioral Therapy? A Study With University Students

    Authors: Mina J. Kian, Mingyu Zong, Katrin Fischer, Abhyuday Singh, Anna-Maria Velentza, Pau Sang, Shriya Upadhyay, Anika Gupta, Misha A. Faruki, Wallace Browning, Sebastien M. R. Arnold, Bhaskar Krishnamachari, Maja J. Mataric

    Abstract: Cognitive behavioral therapy (CBT) is a widely used therapeutic method for guiding individuals toward restructuring their thinking patterns as a means of addressing anxiety, depression, and other challenges. We developed a large language model (LLM)-powered prompt-engineered socially assistive robot (SAR) that guides participants through interactive CBT at-home exercises. We evaluated the performa… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  7. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  8. arXiv:2310.11363  [pdf, other

    cs.CL

    Disentangling the Linguistic Competence of Privacy-Preserving BERT

    Authors: Stefan Arnold, Nils Kemmerzell, Annika Schreiner

    Abstract: Differential Privacy (DP) has been tailored to address the unique challenges of text-to-text privatization. However, text-to-text privatization is known for degrading the performance of language models when trained on perturbed text. Employing a series of interpretation techniques on the internal representations extracted from BERT trained on perturbed pre-text, we intend to disentangle at the lin… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  9. arXiv:2310.07899  [pdf, other

    cs.AI cs.RO

    RoboCLIP: One Demonstration is Enough to Learn Robot Policies

    Authors: Sumedh A Sontakke, Jesse Zhang, Sébastien M. R. Arnold, Karl Pertsch, Erdem Bıyık, Dorsa Sadigh, Chelsea Finn, Laurent Itti

    Abstract: Reward specification is a notoriously difficult problem in reinforcement learning, requiring extensive expert supervision to design robust reward functions. Imitation learning (IL) methods attempt to circumvent these problems by utilizing expert demonstrations but typically require a large number of in-domain expert demonstrations. Inspired by advances in the field of Video-and-Language Models (VL… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  10. arXiv:2306.01471  [pdf, other

    cs.CL cs.CR cs.LG

    Guiding Text-to-Text Privatization by Syntax

    Authors: Stefan Arnold, Dilara Yesilbas, Sven Weinzierl

    Abstract: Metric Differential Privacy is a generalization of differential privacy tailored to address the unique challenges of text-to-text privatization. By adding noise to the representation of words in the geometric space of embeddings, words are replaced with words located in the proximity of the noisy representation. Since embeddings are trained based on word co-occurrences, this mechanism ensures that… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

  11. arXiv:2306.01457  [pdf, other

    cs.CL cs.LG

    Driving Context into Text-to-Text Privatization

    Authors: Stefan Arnold, Dilara Yesilbas, Sven Weinzierl

    Abstract: \textit{Metric Differential Privacy} enables text-to-text privatization by adding calibrated noise to the vector of a word derived from an embedding space and projecting this noisy vector back to a discrete vocabulary using a nearest neighbor search. Since words are substituted without context, this mechanism is expected to fall short at finding substitutes for words with ambiguous meanings, such… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

  12. arXiv:2305.01827  [pdf, other

    eess.IV cs.CV cs.LG

    Cortical analysis of heterogeneous clinical brain MRI scans for large-scale neuroimaging studies

    Authors: Karthik Gopinath, Douglas N. Greve, Sudeshna Das, Steve Arnold, Colin Magdamo, Juan Eugenio Iglesias

    Abstract: Surface analysis of the cortex is ubiquitous in human neuroimaging with MRI, e.g., for cortical registration, parcellation, or thickness estimation. The convoluted cortical geometry requires isotropic scans (e.g., 1mm MPRAGEs) and good gray-white matter contrast for 3D reconstruction. This precludes the analysis of most brain MRI scans acquired for clinical purposes. Analyzing such scans would ena… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  13. arXiv:2302.06009  [pdf, other

    cs.LG cs.CV

    Policy-Induced Self-Supervision Improves Representation Finetuning in Visual RL

    Authors: Sébastien M. R. Arnold, Fei Sha

    Abstract: We study how to transfer representations pretrained on source tasks to target tasks in visual percept based RL. We analyze two popular approaches: freezing or finetuning the pretrained representations. Empirical studies on a set of popular tasks reveal several properties of pretrained representations. First, finetuning is required even when pretrained representations perfectly capture the informat… ▽ More

    Submitted 12 February, 2023; originally announced February 2023.

  14. A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems

    Authors: Megan M. Baker, Alexander New, Mario Aguilar-Simon, Ziad Al-Halah, Sébastien M. R. Arnold, Ese Ben-Iwhiwhu, Andrew P. Brna, Ethan Brooks, Ryan C. Brown, Zachary Daniels, Anurag Daram, Fabien Delattre, Ryan Dellana, Eric Eaton, Haotian Fu, Kristen Grauman, Jesse Hostetler, Shariq Iqbal, Cassandra Kent, Nicholas Ketz, Soheil Kolouri, George Konidaris, Dhireesha Kudithipudi, Erik Learned-Miller, Seungwon Lee , et al. (22 additional authors not shown)

    Abstract: Despite the advancement of machine learning techniques in recent years, state-of-the-art systems lack robustness to "real world" events, where the input distributions and tasks encountered by the deployed systems will not be limited to the original training context, and systems will instead need to adapt to novel distributions and tasks while deployed. This critical gap may be addressed through th… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

    Comments: To appear in Neural Networks

  15. arXiv:2210.07436  [pdf, other

    cs.CV cs.HC cs.LG

    Smart Headset, Computer Vision and Machine Learning for Efficient Prawn Farm Management

    Authors: Mingze Xi, Ashfaqur Rahman, Chuong Nguyen, Stuart Arnold, John McCulloch

    Abstract: Understanding the growth and distribution of the prawns is critical for optimising the feed and harvest strategies. An inadequate understanding of prawn growth can lead to reduced financial gain, for example, crops are harvested too early. The key to maintaining a good understanding of prawn growth is frequent sampling. However, the most commonly adopted sampling practice, the cast net approach, i… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: Submitted to Elsevier Aquacultural Engineering

    ACM Class: I.4; J.0

  16. Robust machine learning segmentation for large-scale analysis of heterogeneous clinical brain MRI datasets

    Authors: Benjamin Billot, Colin Magdamo, You Cheng, Steven E. Arnold, Sudeshna Das, Juan. E. Iglesias

    Abstract: Every year, millions of brain MRI scans are acquired in hospitals, which is a figure considerably larger than the size of any research dataset. Therefore, the ability to analyse such scans could transform neuroimaging research. Yet, their potential remains untapped, since no automated algorithm is robust enough to cope with the high variability in clinical acquisitions (MR contrasts, resolutions,… ▽ More

    Submitted 4 January, 2023; v1 submitted 5 September, 2022; originally announced September 2022.

    Comments: under review, extension of MICCAI 2022 paper

  17. arXiv:2206.10920  [pdf

    cs.RO cs.AI

    Recognising Affordances in Predicted Futures to Plan with Consideration of Non-canonical Affordance Effects

    Authors: Solvi Arnold, Mami Kuroishi, Tadashi Adachi, Kimitoshi Yamazaki

    Abstract: We propose a novel system for action sequence planning based on a combination of affordance recognition and a neural forward model predicting the effects of affordance execution. By performing affordance recognition on predicted futures, we avoid reliance on explicit affordance effect definitions for multi-step planning. Because the system learns affordance effects from experience data, the system… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

    Comments: 8 pages, 8 figures, video: http://youtu.be/4naJ5IghHcg

    ACM Class: I.2.9; I.2.6

  18. arXiv:2206.02840  [pdf, other

    cs.RO cs.CV

    Spatial Acoustic Projection for 3D Imaging Sonar Reconstruction

    Authors: Sascha Arnold, Bilal Wehbe

    Abstract: In this work we present a novel method for reconstructing 3D surfaces using a multi-beam imaging sonar. We integrate the intensities measured by the sonar from different viewpoints for fixed cell positions in a 3D grid. For each cell we integrate a feature vector that holds the mean intensity for a discretized range of viewpoints. Based on the feature vectors and independent sparse range measureme… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

    Comments: Preprint

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA) 2022

  19. Deep Learning for Prawn Farming: Forecasting and Anomaly Detection

    Authors: Joel Janek Dabrowski, Ashfaqur Rahman, Andrew Hellicar, Mashud Rana, Stuart Arnold

    Abstract: We present a decision support system for managing water quality in prawn ponds. The system uses various sources of data and deep learning models in a novel way to provide 24-hour forecasting and anomaly detection of water quality parameters. It provides prawn farmers with tools to proactively avoid a poor growing environment, thereby optimising growth and reducing the risk of losing stock. This is… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

    Journal ref: Advances in Knowledge Discovery and Data Mining. PAKDD 2022. Lecture Notes in Computer Science, vol 13282. Springer, Cham

  20. arXiv:2203.01969  [pdf, other

    eess.IV cs.CV

    Robust Segmentation of Brain MRI in the Wild with Hierarchical CNNs and no Retraining

    Authors: Benjamin Billot, Magdamo Colin, Sean E. Arnold, Sudeshna Das, Juan. E. Iglesias

    Abstract: Retrospective analysis of brain MRI scans acquired in the clinic has the potential to enable neuroimaging studies with sample sizes much larger than those found in research datasets. However, analysing such clinical images "in the wild" is challenging, since subjects are scanned with highly variable protocols (MR contrast, resolution, orientation, etc.). Nevertheless, recent advances in convolutio… ▽ More

    Submitted 4 January, 2023; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: MICCAI 2022

  21. arXiv:2202.07808  [pdf, other

    cs.LG

    Policy Learning and Evaluation with Randomized Quasi-Monte Carlo

    Authors: Sebastien M. R. Arnold, Pierre L'Ecuyer, Liyu Chen, Yi-fan Chen, Fei Sha

    Abstract: Reinforcement learning constantly deals with hard integrals, for example when computing expectations in policy evaluation and policy iteration. These integrals are rarely analytically solvable and typically estimated with the Monte Carlo method, which induces high variance in policy values and gradients. In this work, we propose to replace Monte Carlo samples with low-discrepancy point sets. We co… ▽ More

    Submitted 21 February, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

    Comments: AISTATS 2022 camera ready; more info at: http://seba1511.net/projects/qrl/

  22. arXiv:2108.01662  [pdf, other

    cs.LG cs.AI cs.CV

    Uniform Sampling over Episode Difficulty

    Authors: Sébastien M. R. Arnold, Guneet S. Dhillon, Avinash Ravichandran, Stefano Soatto

    Abstract: Episodic training is a core ingredient of few-shot learning to train models on tasks with limited labelled data. Despite its success, episodic training remains largely understudied, prompting us to ask the question: what is the best way to sample episodes? In this paper, we first propose a method to approximate episode sampling distributions based on their difficulty. Building on this method, we p… ▽ More

    Submitted 15 January, 2022; v1 submitted 3 August, 2021; originally announced August 2021.

    Comments: NeurIPS'21 camera ready

  23. arXiv:2108.00775  [pdf, other

    cs.IR cs.CL

    Self-supervised Answer Retrieval on Clinical Notes

    Authors: Paul Grundmann, Sebastian Arnold, Alexander Löser

    Abstract: Retrieving answer passages from long documents is a complex task requiring semantic understanding of both discourse and document context. We approach this challenge specifically in a clinical scenario, where doctors retrieve cohorts of patients based on diagnoses and other latent medical aspects. We introduce CAPR, a rule-based self-supervision objective for training Transformer language models fo… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

  24. arXiv:2104.07255  [pdf, other

    cs.LG cs.CV

    Embedding Adaptation is Still Needed for Few-Shot Learning

    Authors: Sébastien M. R. Arnold, Fei Sha

    Abstract: Constructing new and more challenging tasksets is a fruitful methodology to analyse and understand few-shot classification methods. Unfortunately, existing approaches to building those tasksets are somewhat unsatisfactory: they either assume train and test task distributions to be identical -- which leads to overly optimistic evaluations -- or take a "worst-case" philosophy -- which typically requ… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

    Comments: In submission

  25. arXiv:2103.11226  [pdf, other

    cs.LG

    Demystifying the Effects of Non-Independence in Federated Learning

    Authors: Stefan Arnold, Dilara Yesilbas

    Abstract: Federated Learning (FL) enables statistical models to be built on user-generated data without compromising data security and user privacy. For this reason, FL is well suited for on-device learning from mobile devices where data is abundant and highly privatized. Constrained by the temporal availability of mobile devices, only a subset of devices is accessible to participate in the iterative protoc… ▽ More

    Submitted 20 March, 2021; originally announced March 2021.

    Comments: 8 pages, 7 figures

  26. arXiv:2103.08137  [pdf

    cs.RO cs.AI

    Cloth Manipulation Planning on Basis of Mesh Representations with Incomplete Domain Knowledge and Voxel-to-Mesh Estimation

    Authors: Solvi Arnold, Daisuke Tanaka, Kimitoshi Yamazaki

    Abstract: We consider the problem of open-goal planning for robotic cloth manipulation. Core of our system is a neural network trained as a forward model of cloth behaviour under manipulation, with planning performed through backpropagation. We introduce a neural network-based routine for estimating mesh representations from voxel input, and perform planning in mesh format internally. We address the problem… ▽ More

    Submitted 12 November, 2021; v1 submitted 15 March, 2021; originally announced March 2021.

    Comments: 27 pages, 13 figures

  27. arXiv:2008.12284  [pdf, ps, other

    cs.LG cs.CV cs.RO stat.ML

    learn2learn: A Library for Meta-Learning Research

    Authors: Sébastien M. R. Arnold, Praateek Mahajan, Debajyoti Datta, Ian Bunner, Konstantinos Saitas Zarkias

    Abstract: Meta-learning researchers face two fundamental issues in their empirical work: prototy** and reproducibility. Researchers are prone to make mistakes when prototy** new algorithms and tasks because modern meta-learning methods rely on unconventional functionalities of machine learning frameworks. In turn, reproducing existing results becomes a tedious endeavour -- a situation exacerbated by the… ▽ More

    Submitted 27 August, 2020; v1 submitted 27 August, 2020; originally announced August 2020.

    Comments: Software available at: https://github.com/learnables/learn2learn

  28. Learning Contextualized Document Representations for Healthcare Answer Retrieval

    Authors: Sebastian Arnold, Betty van Aken, Paul Grundmann, Felix A. Gers, Alexander Löser

    Abstract: We present Contextual Discourse Vectors (CDV), a distributed document representation for efficient answer retrieval from long healthcare documents. Our approach is based on structured query tuples of entities and aspects from free text and medical taxonomies. Our model leverages a dual encoder architecture with hierarchical LSTM layers and multi-task training to encode the position of clinical ent… ▽ More

    Submitted 3 February, 2020; originally announced February 2020.

    Comments: The Web Conference 2020 (WWW '20)

  29. arXiv:1910.13603  [pdf, other

    cs.LG stat.ML

    When MAML Can Adapt Fast and How to Assist When It Cannot

    Authors: Sébastien M. R. Arnold, Shariq Iqbal, Fei Sha

    Abstract: Model-Agnostic Meta-Learning (MAML) and its variants have achieved success in meta-learning tasks on many datasets and settings. On the other hand, we have just started to understand and analyze how they are able to adapt fast to new tasks. For example, one popular hypothesis is that the algorithms learn good representations for transfer, as in multi-task learning. In this work, we contribute by p… ▽ More

    Submitted 24 January, 2021; v1 submitted 29 October, 2019; originally announced October 2019.

    Comments: Accepted at AISTATS 2021

  30. arXiv:1910.01249  [pdf, other

    cs.LG stat.ML

    Analyzing the Variance of Policy Gradient Estimators for the Linear-Quadratic Regulator

    Authors: James A. Preiss, Sébastien M. R. Arnold, Chen-Yu Wei, Marius Kloft

    Abstract: We study the variance of the REINFORCE policy gradient estimator in environments with continuous state and action spaces, linear dynamics, quadratic cost, and Gaussian noise. These simple environments allow us to derive bounds on the estimator variance in terms of the environment and noise parameters. We compare the predictions of our bounds to the empirical variance in simulation experiments.

    Submitted 2 October, 2019; originally announced October 2019.

    Comments: Accepted at NeurIPS 2019 Workshop on Optimization Foundations for Reinforcement Learning. 7 pages + 6 pages appendix

  31. arXiv:1906.03532  [pdf, other

    cs.LG math.OC stat.ML

    Reducing the variance in online optimization by transporting past gradients

    Authors: Sébastien M. R. Arnold, Pierre-Antoine Manzagol, Reza Babanezhad, Ioannis Mitliagkas, Nicolas Le Roux

    Abstract: Most stochastic optimization methods use gradients once before discarding them. While variance reduction methods have shown that reusing past gradients can be beneficial when there is a finite number of datapoints, they do not easily extend to the online setting. One issue is the staleness due to using past gradients. We propose to correct this staleness using the idea of implicit gradient transpo… ▽ More

    Submitted 18 June, 2019; v1 submitted 8 June, 2019; originally announced June 2019.

    Comments: Open-source implementation available at: https://github.com/seba-1511/igt.pth

  32. arXiv:1902.04793  [pdf, other

    cs.CL

    SECTOR: A Neural Model for Coherent Topic Segmentation and Classification

    Authors: Sebastian Arnold, Rudolf Schneider, Philippe Cudré-Mauroux, Felix A. Gers, Alexander Löser

    Abstract: When searching for information, a human reader first glances over a document, spots relevant sections and then focuses on a few sentences for resolving her intention. However, the high variance of document structure complicates to identify the salient topic of a given section at a glance. To tackle this challenge, we present SECTOR, a model to support machine reading systems by segmenting document… ▽ More

    Submitted 13 February, 2019; originally announced February 2019.

    Comments: Author's final version, accepted for publication at TACL, 2019

  33. arXiv:1805.08011  [pdf, other

    cs.RO

    Robust Model-Aided Inertial Localization for Autonomous Underwater Vehicles

    Authors: Sascha Arnold, Lashika Medagoda

    Abstract: This paper presents a manifold based Unscented Kalman Filter that applies a novel strategy for inertial, model-aiding and Acoustic Doppler Current Profiler (ADCP) measurement incorporation. The filter is capable of observing and utilizing the Earth rotation for heading estimation with a tactical grade IMU, and utilizes information from the vehicle model during DVL drop outs. The drag and thrust mo… ▽ More

    Submitted 27 November, 2018; v1 submitted 21 May, 2018; originally announced May 2018.

    Comments: IEEE International Conference on Robotics and Automation (ICRA) 2018, Accepted

  34. arXiv:1709.05070  [pdf, other

    cs.LG cs.RO

    Shapechanger: Environments for Transfer Learning

    Authors: Sébastien M. R. Arnold, Tsam Kiu Pun, Théo-Tim J. Denisart, Francisco J. Valero-Cuevas

    Abstract: We present Shapechanger, a library for transfer reinforcement learning specifically designed for robotic tasks. We consider three types of knowledge transfer---from simulation to simulation, from simulation to real, and from real to real---and a wide range of tasks with continuous states and actions. Shapechanger is under active development and open-sourced at: https://github.com/seba-1511/shapech… ▽ More

    Submitted 15 September, 2017; originally announced September 2017.

    Comments: Presented at the SoCal 2017 Robotics Symposium

  35. arXiv:1709.05069  [pdf, other

    cs.LG

    Accelerating SGD for Distributed Deep-Learning Using Approximated Hessian Matrix

    Authors: Sébastien M. R. Arnold, Chunming Wang

    Abstract: We introduce a novel method to compute a rank $m$ approximation of the inverse of the Hessian matrix in the distributed regime. By leveraging the differences in gradients and parameters of multiple Workers, we are able to efficiently implement a distributed approximation of the Newton-Raphson method. We also present preliminary results which underline advantages and challenges of second-order meth… ▽ More

    Submitted 15 September, 2017; originally announced September 2017.

    Comments: ICLR17 Workshop Track

  36. arXiv:1609.03666  [pdf, other

    cs.LG stat.ML

    A Greedy Algorithm to Cluster Specialists

    Authors: Sébastien Arnold

    Abstract: Several recent deep neural networks experiments leverage the generalist-specialist paradigm for classification. However, no formal study compared the performance of different clustering algorithms for class assignment. In this paper we perform such a study, suggest slight modifications to the clustering procedures, and propose a novel algorithm designed to optimize the performance of of the specia… ▽ More

    Submitted 12 September, 2016; originally announced September 2016.

  37. arXiv:1608.06757  [pdf, other

    cs.CL

    Robust Named Entity Recognition in Idiosyncratic Domains

    Authors: Sebastian Arnold, Felix A. Gers, Torsten Kilias, Alexander Löser

    Abstract: Named entity recognition often fails in idiosyncratic domains. That causes a problem for depending tasks, such as entity linking and relation extraction. We propose a generic and robust approach for high-recall named entity recognition. Our approach is easy to train and offers strong generalization over diverse domain-specific language, such as news documents (e.g. Reuters) or biomedical text (e.g… ▽ More

    Submitted 24 August, 2016; originally announced August 2016.

    Comments: 8 pages, 1 figure

  38. On Cloud-based Oversubscription

    Authors: Rachel Householder, Scott Arnold, Robert Green

    Abstract: Rising trends in the number of customers turning to the cloud for their computing needs has made effective resource allocation imperative for cloud service providers. In order to maximize profits and reduce waste, providers have started to explore the role of oversubscribing cloud resources. However, the benefits of cloud-based oversubscription are not without inherent risks. This paper attempts t… ▽ More

    Submitted 5 March, 2014; v1 submitted 19 February, 2014; originally announced February 2014.

    Comments: 7 pages, 3 figures

    Journal ref: International Journal of Engineering Trends and Technology(IJETT), V8(8),425-431 February 2014. ISSN:2231-5381