Skip to main content

Showing 1–15 of 15 results for author: Davoodi, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  2. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  3. arXiv:2303.01074  [pdf, other

    cs.GT cs.LG

    Learning not to Regret

    Authors: David Sychrovský, Michal Šustr, Elnaz Davoodi, Michael Bowling, Marc Lanctot, Martin Schmid

    Abstract: The literature on game-theoretic equilibrium finding predominantly focuses on single games or their repeated play. Nevertheless, numerous real-world scenarios feature playing a game sampled from a distribution of similar, but not identical games, such as playing poker with different public cards or trading correlated assets on the stock market. As these similar games feature similar equilibra, we… ▽ More

    Submitted 19 February, 2024; v1 submitted 2 March, 2023; originally announced March 2023.

  4. arXiv:2211.01480  [pdf, other

    cs.MA cs.CL cs.HC

    Over-communicate no more: Situated RL agents learn concise communication protocols

    Authors: Aleksandra Kalinowska, Elnaz Davoodi, Florian Strub, Kory W Mathewson, Ivana Kajic, Michael Bowling, Todd D Murphey, Patrick M Pilarski

    Abstract: While it is known that communication facilitates cooperation in multi-agent settings, it is unclear how to design artificial agents that can learn to effectively and efficiently communicate with each other. Much research on communication emergence uses reinforcement learning (RL) and explores unsituated communication in one-step referential tasks -- the tasks are not temporally interactive and lac… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

  5. arXiv:2205.12191  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    Reassessing Evaluation Practices in Visual Question Answering: A Case Study on Out-of-Distribution Generalization

    Authors: Aishwarya Agrawal, Ivana Kajić, Emanuele Bugliarello, Elnaz Davoodi, Anita Gergely, Phil Blunsom, Aida Nematzadeh

    Abstract: Vision-and-language (V&L) models pretrained on large-scale multimodal data have demonstrated strong performance on various tasks such as image captioning and visual question answering (VQA). The quality of such models is commonly assessed by measuring their performance on unseen data that typically comes from the same distribution as the training data. However, when evaluated under out-of-distribu… ▽ More

    Submitted 1 April, 2023; v1 submitted 24 May, 2022; originally announced May 2022.

    Comments: Findings of EACL 2023. Aishwarya, Ivana, Emanuele and Aida had equal first author contributions. Elnaz and Anita had equal contributions. Aida and Aishwarya had equal senior contributions

  6. arXiv:2203.09498  [pdf, other

    cs.AI cs.CL cs.LG cs.MA

    The Frost Hollow Experiments: Pavlovian Signalling as a Path to Coordination and Communication Between Agents

    Authors: Patrick M. Pilarski, Andrew Butcher, Elnaz Davoodi, Michael Bradley Johanson, Dylan J. A. Brenneis, Adam S. R. Parker, Leslie Acker, Matthew M. Botvinick, Joseph Modayil, Adam White

    Abstract: Learned communication between agents is a powerful tool when approaching decision-making problems that are hard to overcome by any single agent in isolation. However, continual coordination and communication learning between machine agents or human-machine partnerships remains a challenging open problem. As a step** stone toward solving the continual communication learning problem, in this paper… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: 54 pages, 29 figures, 4 tables

  7. arXiv:2201.03709  [pdf, other

    cs.AI cs.LG cs.MA

    Pavlovian Signalling with General Value Functions in Agent-Agent Temporal Decision Making

    Authors: Andrew Butcher, Michael Bradley Johanson, Elnaz Davoodi, Dylan J. A. Brenneis, Leslie Acker, Adam S. R. Parker, Adam White, Joseph Modayil, Patrick M. Pilarski

    Abstract: In this paper, we contribute a multi-faceted study into Pavlovian signalling -- a process by which learned, temporally extended predictions made by one agent inform decision-making by another agent. Signalling is intimately connected to time and timing. In service of generating and receiving signals, humans and other animals are known to represent time, determine time since past events, predict th… ▽ More

    Submitted 10 January, 2022; originally announced January 2022.

    Comments: 9 pages, 7 figures

  8. arXiv:2112.07774  [pdf, other

    cs.AI cs.HC cs.MA

    Assessing Human Interaction in Virtual Reality With Continually Learning Prediction Agents Based on Reinforcement Learning Algorithms: A Pilot Study

    Authors: Dylan J. A. Brenneis, Adam S. Parker, Michael Bradley Johanson, Andrew Butcher, Elnaz Davoodi, Leslie Acker, Matthew M. Botvinick, Joseph Modayil, Adam White, Patrick M. Pilarski

    Abstract: Artificial intelligence systems increasingly involve continual learning to enable flexibility in general situations that are not encountered during system training. Human interaction with autonomous systems is broadly studied, but research has hitherto under-explored interactions that occur while the system is actively learning, and can noticeably change its behaviour in minutes. In this pilot stu… ▽ More

    Submitted 22 April, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

  9. arXiv:2112.03178  [pdf, other

    cs.AI cs.GT cs.LG

    Student of Games: A unified learning algorithm for both perfect and imperfect information games

    Authors: Martin Schmid, Matej Moravcik, Neil Burch, Rudolf Kadlec, Josh Davidson, Kevin Waugh, Nolan Bard, Finbarr Timbers, Marc Lanctot, G. Zacharias Holland, Elnaz Davoodi, Alden Christianson, Michael Bowling

    Abstract: Games have a long history as benchmarks for progress in artificial intelligence. Approaches using search and learning produced strong performance across many perfect information games, and approaches using game-theoretic reasoning and learning demonstrated strong performance for specific imperfect information poker variants. We introduce Student of Games, a general-purpose algorithm that unifies p… ▽ More

    Submitted 15 November, 2023; v1 submitted 6 December, 2021; originally announced December 2021.

    Comments: Published in Science Advances

    Journal ref: Science Advances 9, eadg3256 (2023)

  10. arXiv:2101.04237  [pdf, other

    cs.AI cs.LG

    Solving Common-Payoff Games with Approximate Policy Iteration

    Authors: Samuel Sokota, Edward Lockhart, Finbarr Timbers, Elnaz Davoodi, Ryan D'Orazio, Neil Burch, Martin Schmid, Michael Bowling, Marc Lanctot

    Abstract: For artificially intelligent learning systems to have widespread applicability in real-world settings, it is important that they be able to operate decentrally. Unfortunately, decentralized control is difficult -- computing even an epsilon-optimal joint policy is a NEXP complete problem. Nevertheless, a recently rediscovered insight -- that a team of agents can coordinate via common knowledge -- h… ▽ More

    Submitted 11 January, 2021; originally announced January 2021.

    Comments: AAAI 2021

  11. arXiv:1709.02843  [pdf, ps, other

    cs.CL

    CLaC at SemEval-2016 Task 11: Exploring linguistic and psycho-linguistic Features for Complex Word Identification

    Authors: Elnaz Davoodi, Leila Kosseim

    Abstract: This paper describes the system deployed by the CLaC-EDLK team to the "SemEval 2016, Complex Word Identification task". The goal of the task is to identify if a given word in a given context is "simple" or "complex". Our system relies on linguistic features and cognitive complexity. We used several supervised models, however the Random Forest model outperformed the others. Overall our best configu… ▽ More

    Submitted 8 September, 2017; originally announced September 2017.

    Comments: In Proceedings of the International Workshop on Semantic Evaluation (SemEval-2016), a workshop of the 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-2016) pp 982-985. June 16-17, San Diego, California

  12. arXiv:1708.05857  [pdf, other

    cs.CL

    The CLaC Discourse Parser at CoNLL-2015

    Authors: Majid Laali, Elnaz Davoodi, Leila Kosseim

    Abstract: This paper describes our submission (kosseim15) to the CoNLL-2015 shared task on shallow discourse parsing. We used the UIMA framework to develop our parser and used ClearTK to add machine learning functionality to the UIMA framework. Overall, our parser achieves a result of 17.3 F1 on the identification of discourse relations on the blind CoNLL-2015 test set, ranking in sixth place.

    Submitted 19 August, 2017; originally announced August 2017.

    Comments: Proceedings of the Nineteenth Conference on Computational Natural Language Learning Shared Task (CoNLL 2015). Bei**g, China

  13. arXiv:1708.05800  [pdf, ps, other

    cs.CL

    On the Contribution of Discourse Structure on Text Complexity Assessment

    Authors: Elnaz Davoodi, Leila Kosseim

    Abstract: This paper investigates the influence of discourse features on text complexity assessment. To do so, we created two data sets based on the Penn Discourse Treebank and the Simple English Wikipedia corpora and compared the influence of coherence, cohesion, surface, lexical and syntactic features to assess text complexity. Results show that with both data sets coherence features are more correlated… ▽ More

    Submitted 18 August, 2017; originally announced August 2017.

    Comments: In Proceedings of the 17th Annual SigDial Meeting on Discourse and Dialogue (SigDial 2016). pp 166-174. September 13-15. Los Angeles, USA

  14. arXiv:1708.05797  [pdf, other

    cs.CL

    CLaC @ QATS: Quality Assessment for Text Simplification

    Authors: Elnaz Davoodi, Leila Kosseim

    Abstract: This paper describes our approach to the 2016 QATS quality assessment shared task. We trained three independent Random Forest classifiers in order to assess the quality of the simplified texts in terms of grammaticality, meaning preservation and simplicity. We used the language model of Google-Ngram as feature to predict the grammaticality. Meaning preservation is predicted using two complementary… ▽ More

    Submitted 18 August, 2017; originally announced August 2017.

    Comments: In Proceedings of the Workshop Shared task on Quality Assessment for Text Simplification (QATS-2016), a workshop of the 10th Language Resources and Evaluation Conference (LREC-2016), pp. 53-56, May 23-28, Portoroz, Slovenia

  15. arXiv:1708.03541  [pdf, ps, other

    cs.CL

    Automatic Identification of AltLexes using Monolingual Parallel Corpora

    Authors: Elnaz Davoodi, Leila Kosseim

    Abstract: The automatic identification of discourse relations is still a challenging task in natural language processing. Discourse connectives, such as "since" or "but", are the most informative cues to identify explicit relations; however discourse parsers typically use a closed inventory of such connectives. As a result, discourse relations signaled by markers outside these inventories (i.e. AltLexes) ar… ▽ More

    Submitted 11 August, 2017; originally announced August 2017.

    Comments: 6 pages, Proceedings of Recent Advances in Natural Language Processing (RANLP 2017)