Skip to main content

Showing 1–8 of 8 results for author: Barker, D

Searching in archive cs. Search in all archives.
.
  1. Thelxinoë: Recognizing Human Emotions Using Pupillometry and Machine Learning

    Authors: Darlene Barker, Haim Levkowitz

    Abstract: In this study, we present a method for emotion recognition in Virtual Reality (VR) using pupillometry. We analyze pupil diameter responses to both visual and auditory stimuli via a VR headset and focus on extracting key features in the time-domain, frequency-domain, and time-frequency domain from VR generated data. Our approach utilizes feature selection to identify the most impactful features usi… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 14 pages, 9 figures, 1 table, journal

    Journal ref: Machine Learning and Applications: An International Journal (MLAIJ), vol. 11, no. 1, pp. 1-14, Mar. 2024

  2. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  3. arXiv:2306.11706  [pdf, other

    cs.RO cs.LG

    RoboCat: A Self-Improving Generalist Agent for Robotic Manipulation

    Authors: Konstantinos Bousmalis, Giulia Vezzani, Dushyant Rao, Coline Devin, Alex X. Lee, Maria Bauza, Todor Davchev, Yuxiang Zhou, Agrim Gupta, Akhil Raju, Antoine Laurens, Claudio Fantacci, Valentin Dalibard, Martina Zambelli, Murilo Martins, Rugile Pevceviciute, Michiel Blokzijl, Misha Denil, Nathan Batchelor, Thomas Lampe, Emilio Parisotto, Konrad Żołna, Scott Reed, Sergio Gómez Colmenarejo, Jon Scholz , et al. (14 additional authors not shown)

    Abstract: The ability to leverage heterogeneous robotic experience from different robots and tasks to quickly master novel skills and embodiments has the potential to transform robot learning. Inspired by recent advances in foundation models for vision and language, we propose a multi-embodiment, multi-task generalist agent for robotic manipulation. This agent, named RoboCat, is a visual goal-conditioned de… ▽ More

    Submitted 22 December, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: Transactions on Machine Learning Research (12/2023)

  4. arXiv:2210.06397  [pdf, other

    cs.OH

    Star Anagram Detection and Classification

    Authors: Jason Parker, Dan Barker

    Abstract: A star anagram is a rearrangement of the letters of one word to produce another word where no letter retains its original neighbors. These maximally shuffled anagrams are rare, comprising only about 5.7% of anagrams in English. They can also be depicted as unicursal polygons with varying forms, including the eponymous stars. We develop automated methods for detecting stars among other anagrams and… ▽ More

    Submitted 18 September, 2022; originally announced October 2022.

    Comments: 14 pages, 14 figures in main article. Appendix contains several thousand figures over 250+ pages. In preparation for submission to Computational Geometry

    MSC Class: I.3.5

  5. arXiv:2010.06666  [pdf, other

    cs.CL

    Probing for Multilingual Numerical Understanding in Transformer-Based Language Models

    Authors: Devin Johnson, Denise Mak, Drew Barker, Lexi Loessberg-Zahl

    Abstract: Natural language numbers are an example of compositional structures, where larger numbers are composed of operations on smaller numbers. Given that compositional reasoning is a key to natural language understanding, we propose novel multilingual probing tasks tested on DistilBERT, XLM, and BERT to investigate for evidence of compositional reasoning over numerical data in various natural language n… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

    Comments: BlackboxNLP (EMNLP 2020)

  6. arXiv:2009.14711  [pdf, other

    cs.RO cs.CV cs.LG

    S3K: Self-Supervised Semantic Keypoints for Robotic Manipulation via Multi-View Consistency

    Authors: Mel Vecerik, Jean-Baptiste Regli, Oleg Sushkov, David Barker, Rugile Pevceviciute, Thomas Rothörl, Christopher Schuster, Raia Hadsell, Lourdes Agapito, Jonathan Scholz

    Abstract: A robot's ability to act is fundamentally constrained by what it can perceive. Many existing approaches to visual representation learning utilize general-purpose training criteria, e.g. image reconstruction, smoothness in latent space, or usefulness for control, or else make use of large datasets annotated with specific features (bounding boxes, segmentations, etc.). However, both approaches often… ▽ More

    Submitted 13 October, 2020; v1 submitted 30 September, 2020; originally announced September 2020.

    Comments: 11 pages, supplementary material available at: https://sites.google.com/view/2020-s3k/home

  7. arXiv:1909.12200  [pdf, other

    cs.RO cs.LG

    Scaling data-driven robotics with reward sketching and batch reinforcement learning

    Authors: Serkan Cabi, Sergio Gómez Colmenarejo, Alexander Novikov, Ksenia Konyushkova, Scott Reed, Rae Jeong, Konrad Zolna, Yusuf Aytar, David Budden, Mel Vecerik, Oleg Sushkov, David Barker, Jonathan Scholz, Misha Denil, Nando de Freitas, Ziyu Wang

    Abstract: We present a framework for data-driven robotics that makes use of a large dataset of recorded robot experience and scales to several tasks using learned reward functions. We show how to apply this framework to accomplish three different object manipulation tasks on a real robot platform. Given demonstrations of a task together with task-agnostic recorded experience, we use a special form of human… ▽ More

    Submitted 4 June, 2020; v1 submitted 26 September, 2019; originally announced September 2019.

    Comments: Project website: https://sites.google.com/view/data-driven-robotics/

    Journal ref: Robotics: Science and Systems Conference 2020

  8. arXiv:1810.01531  [pdf, other

    cs.RO

    A Practical Approach to Insertion with Variable Socket Position Using Deep Reinforcement Learning

    Authors: Mel Vecerik, Oleg Sushkov, David Barker, Thomas Rothörl, Todd Hester, Jon Scholz

    Abstract: Insertion is a challenging haptic and visual control problem with significant practical value for manufacturing. Existing approaches in the model-based robotics community can be highly effective when task geometry is known, but are complex and cumbersome to implement, and must be tailored to each individual problem by a qualified engineer. Within the learning community there is a long history of i… ▽ More

    Submitted 8 October, 2018; v1 submitted 2 October, 2018; originally announced October 2018.