Skip to main content

Showing 1–20 of 20 results for author: Jiménez, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.15793  [pdf, other

    cs.SE cs.AI cs.CL cs.HC cs.LG

    SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering

    Authors: John Yang, Carlos E. Jimenez, Alexander Wettig, Kilian Lieret, Shunyu Yao, Karthik Narasimhan, Ofir Press

    Abstract: Language model (LM) agents are increasingly being used to automate complicated tasks in digital environments. Just as humans benefit from powerful software applications, such as integrated development environments, for complex tasks like software engineering, we posit that LM agents represent a new category of end users with their own needs and abilities, and would benefit from specially-built int… ▽ More

    Submitted 30 May, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: Code, data, and demo available at https://swe-agent.com

  2. arXiv:2310.06770  [pdf, other

    cs.CL cs.AI cs.SE

    SWE-bench: Can Language Models Resolve Real-World GitHub Issues?

    Authors: Carlos E. Jimenez, John Yang, Alexander Wettig, Shunyu Yao, Kexin Pei, Ofir Press, Karthik Narasimhan

    Abstract: Language models have outpaced our ability to evaluate them effectively, but for their future development it is essential to study the frontier of their capabilities. We find real-world software engineering to be a rich, sustainable, and challenging testbed for evaluating the next generation of language models. To this end, we introduce SWE-bench, an evaluation framework consisting of $2,294$ softw… ▽ More

    Submitted 5 April, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: Data, code, and leaderboard are available at https://www.swebench.com ICLR 2024, https://openreview.net/forum?id=VTF8yNQM66

  3. arXiv:2308.16670  [pdf, other

    cs.SE

    Safety of the Intended Functionality Concept Integration into a Validation Tool Suite

    Authors: Víctor J. Expósito Jiménez, Bernhard Winkler, Joaquim M. Castella Triginer, Heiko Scharke, Hannes Schneider, Eugen Brenner, Georg Macher

    Abstract: Nowadays, the increasing complexity of Advanced Driver Assistance Systems (ADAS) and Automated Driving (AD) means that the industry must move towards a scenario-based approach to validation rather than relying on established technology-based methods. This new focus also requires the validation process to take into account Safety of the Intended Functionality (SOTIF), as many scenarios may trigger… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

  4. arXiv:2305.15093  [pdf, other

    cs.CL cs.AI cs.LG

    C-STS: Conditional Semantic Textual Similarity

    Authors: Ameet Deshpande, Carlos E. Jimenez, Howard Chen, Vishvak Murahari, Victoria Graf, Tanmay Rajpurohit, Ashwin Kalyan, Danqi Chen, Karthik Narasimhan

    Abstract: Semantic textual similarity (STS), a cornerstone task in NLP, measures the degree of similarity between a pair of sentences, and has broad application in fields such as information retrieval and natural language understanding. However, sentence similarity can be inherently ambiguous, depending on the specific aspect of interest. We resolve this ambiguity by proposing a novel task called Conditiona… ▽ More

    Submitted 6 November, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Published in EMNLP 2023

  5. arXiv:2302.12441  [pdf, other

    cs.LG cs.CL

    MUX-PLMs: Data Multiplexing for High-throughput Language Models

    Authors: Vishvak Murahari, Ameet Deshpande, Carlos E. Jimenez, Izhak Shafran, Mingqiu Wang, Yuan Cao, Karthik Narasimhan

    Abstract: The widespread adoption of large language models such as ChatGPT and Bard has led to unprecedented demand for these technologies. The burgeoning cost of inference for ever-increasing model sizes coupled with hardware shortages has limited affordable access and poses a pressing need for efficiency approaches geared towards high throughput and performance. Multi-input multi-output (MIMO) algorithms… ▽ More

    Submitted 22 May, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

  6. Triggering Conditions Analysis and Use Case for Validation of ADAS/ADS Functions

    Authors: Víctor J. Expósito Jiménez, Helmut Martin, Christian Schwarzl, Georg Macher, Eugen Brenner

    Abstract: Safety in the automotive domain is a well-known topic, which has been in constant development in the past years. The complexity of new systems that add more advanced components in each function has opened new trends that have to be covered from the safety perspective. In this case, not only specifications and requirements have to be covered but also scenarios, which cover all relevant information… ▽ More

    Submitted 31 January, 2023; originally announced February 2023.

  7. State of the Art Study of the Safety Argumentation Frameworks for Automated Driving System Safety

    Authors: Ilona Cieslik, Víctor J. Expósito Jiménez, Helmut Martin, Heiko Scharke, Hannes Schneider

    Abstract: The automotive industry is experiencing a transition from assisted to highly automated driving. New concepts for validation of Automated Driving System (ADS) include amongst other a shift from a "technology based" approach to a "scenario based" assessment. The safety validation and type approval process of ADS are seen as the biggest challenges for the automotive industry today. Having in mind a v… ▽ More

    Submitted 31 January, 2023; originally announced February 2023.

  8. arXiv:2209.13372  [pdf

    cs.SE cs.CY

    CSRE4SOC (CSR evaluation for software companies)

    Authors: Elisa Jimenez, Coral Calero, Maria Ángeles Moraga

    Abstract: Software development companies are increasingly concerned about their impact on the environment. This is translated into the incorporation of actions related to software sustainability in their Corporate Social Responsibility (CSR) document. CSR reflects a company's obligations to society and the environment. However, we have found that companies do not always have the necessary knowledge to be ab… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

  9. arXiv:2206.08308  [pdf

    eess.IV cs.CR cs.CV cs.LG

    Deepfake histological images for enhancing digital pathology

    Authors: Kianoush Falahkheirkhah, Saumya Tiwari, Kevin Yeh, Sounak Gupta, Loren Herrera-Hernandez, Michael R. McCarthy, Rafael E. Jimenez, John C. Cheville, Rohit Bhargava

    Abstract: An optical microscopic examination of thinly cut stained tissue on glass slides prepared from a FFPE tissue blocks is the gold standard for tissue diagnostics. In addition, the diagnostic abilities and expertise of any pathologist is dependent on their direct experience with common as well as rarer variant morphologies. Recently, deep learning approaches have been used to successfully show a high… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

  10. arXiv:2203.07613  [pdf, other

    cs.CL cs.CV

    CARETS: A Consistency And Robustness Evaluative Test Suite for VQA

    Authors: Carlos E. Jimenez, Olga Russakovsky, Karthik Narasimhan

    Abstract: We introduce CARETS, a systematic test suite to measure consistency and robustness of modern VQA models through a series of six fine-grained capability tests. In contrast to existing VQA test sets, CARETS features balanced question generation to create pairs of instances to test models, with each pair focusing on a specific capability such as rephrasing, logical symmetry or image obfuscation. We e… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

    Comments: ACL 2022

  11. arXiv:2202.09318  [pdf, other

    cs.LG cs.AI

    DataMUX: Data Multiplexing for Neural Networks

    Authors: Vishvak Murahari, Carlos E. Jimenez, Runzhe Yang, Karthik Narasimhan

    Abstract: In this paper, we introduce data multiplexing (DataMUX), a technique that enables deep neural networks to process multiple inputs simultaneously using a single compact representation. DataMUX demonstrates that neural networks are capable of generating accurate predictions over mixtures of inputs, resulting in increased throughput with minimal extra memory requirements. Our approach uses two key co… ▽ More

    Submitted 14 November, 2022; v1 submitted 18 February, 2022; originally announced February 2022.

    Comments: NeurIPS 2022

  12. arXiv:2010.10835  [pdf, other

    cs.RO

    Radar detection rate comparison through a mobile robot platform at the ZalaZONE proving ground

    Authors: Victor J. Exposito Jimenez, Christian Schwarzl, Szilard Josvai

    Abstract: Since an automotive driving vehicle is controlled by Advanced Driver-Assistance Systems (ADAS) / Automated Driving (AD) functions, the selected sensors for the perception process become a key component of the system. Therefore, the necessity of ensuring precise data is crucial. But the correctness of the data is not the only part that has to be ensured, the limitations of the different technologie… ▽ More

    Submitted 22 October, 2020; v1 submitted 21 October, 2020; originally announced October 2020.

  13. arXiv:2009.11741  [pdf

    cs.SE cs.NI

    Dynamic Buffer Sizing for Out-of-Order Event Compensation for Time-Sensitive Applications

    Authors: Wolfgang Weiss, Victor J. Exposito Jimenez, Herwig Zeiner

    Abstract: Today's sensor network implementations often comprise various types of nodes connected with different types of networks. These and various other aspects influence the delay of transmitting data and therefore of out-of-order data occurrences. This turns into a crucial problem in time-sensitive applications where data must be processed promptly and decisions must be reliable. In this paper, we wer… ▽ More

    Submitted 24 September, 2020; originally announced September 2020.

  14. Evaluation of an indoor localization system for a mobile robot

    Authors: Victor J. Exposito Jimenez, Christian Schwarzl, Helmut Martin

    Abstract: Although indoor localization has been a wide researched topic, obtained results may not fit the requirements that some domains need. Most approaches are not able to precisely localize a fast moving object even with a complex installation, which makes their implementation in the automated driving domain complicated. In this publication, common technologies were analyzed and a commercial product, ca… ▽ More

    Submitted 24 September, 2020; originally announced September 2020.

    Journal ref: 2019 IEEE International Conference on Connected Vehicles and Expo (ICCVE)

  15. arXiv:2005.12783  [pdf, other

    cs.DC cs.CY stat.AP

    CoronaSurveys: Using Surveys with Indirect Reporting to Estimate the Incidence and Evolution of Epidemics

    Authors: Oluwasegun Ojo, Augusto García-Agundez, Benjamin Girault, Harold Hernández, Elisa Cabana, Amanda García-García, Payman Arabshahi, Carlos Baquero, Paolo Casari, Ednaldo José Ferreira, Davide Frey, Chryssis Georgiou, Mathieu Goessens, Anna Ishchenko, Ernesto Jiménez, Oleksiy Kebkal, Rosa Lillo, Raquel Menezes, Nicolas Nicolaou, Antonio Ortega, Paul Patras, Julian C Roberts, Efstathios Stavrakis, Yuichi Tanaka, Antonio Fernández Anta

    Abstract: The world is suffering from a pandemic called COVID-19, caused by the SARS-CoV-2 virus. National governments have problems evaluating the reach of the epidemic, due to having limited resources and tests at their disposal. This problem is especially acute in low and middle-income countries (LMICs). Hence, any simple, cheap and flexible means of evaluating the incidence and evolution of the epidemic… ▽ More

    Submitted 26 June, 2020; v1 submitted 24 May, 2020; originally announced May 2020.

    Comments: Presented at The KDD Workshop on Humanitarian Map**, San Diego, California USA, August 24, 2020

  16. An Overview of Wireless IoT Protocol Security in the Smart Home Domain

    Authors: Stefan Marksteiner, Víctor Juan Expósito Jiménez, Heribert Vallant, Herwig Zeiner

    Abstract: While the application of IoT in smart technologies becomes more and more proliferated, the pandemonium of its protocols becomes increasingly confusing. More seriously, severe security deficiencies of these protocols become evident, as time-to- market is a key factor, which satisfaction comes at the price of a less thorough security design and testing. This applies especially to the smart home doma… ▽ More

    Submitted 22 January, 2018; originally announced January 2018.

    Comments: 8 pages, 4 figures

    Journal ref: Proceedings of the Joint 13th CTTE and 10th CMI Conference on Internet of Things Business Models, Users, and Networks, 2017

  17. arXiv:1604.03435  [pdf, other

    cs.NI

    Simulation of Underwater RF Wireless Sensor Networks using Castalia

    Authors: Sergio Valcarcel Macua, Santiago Zazo, Javier Zazo, Marina Pérez Jiménez, Iván Pérez-Álvarez, Eugenio Jiménez, Joaquín Hernández Brito

    Abstract: We use real measurements of the underwater channel to simulate a whole underwater RF wireless sensor networks, including propagation impairments (e.g., noise, interference), radio hardware (e.g., modulation scheme, bandwidth, transmit power), hardware limitations (e.g., clock drift, transmission buffer) and complete MAC and routing protocols. The results should be useful for designing centralized… ▽ More

    Submitted 12 April, 2016; originally announced April 2016.

    Comments: Underwater Communications and Networking 2016

  18. arXiv:1110.1842  [pdf, ps, other

    cs.DC cs.DS

    Failure Detectors in Homonymous Distributed Systems (with an Application to Consensus)

    Authors: Sergio Arévalo, Antonio Fernández Anta, Damien Imbs, Ernesto Jiménez, Michel Raynal

    Abstract: This paper addresses the consensus problem in homonymous distributed systems where processes are prone to crash failures and have no initial knowledge of the system membership ("homonymous" means that several processes may have the same identifier). New classes of failure detectors suited to these systems are first defined. Among them, the classes HΩ and HΣ are introduced that are the homonymous c… ▽ More

    Submitted 27 November, 2011; v1 submitted 9 October, 2011; originally announced October 2011.

  19. arXiv:0712.3980  [pdf, ps, other

    cs.DC

    Distributed Slicing in Dynamic Systems

    Authors: Antonio Fernandez, Vincent Gramoli, Ernesto Jimenez, Anne-Marie Kermarrec, Michel Raynal

    Abstract: Peer to peer (P2P) systems are moving from application specific architectures to a generic service oriented design philosophy. This raises interesting problems in connection with providing useful P2P middleware services capable of dealing with resource assignment and management in a large-scale, heterogeneous and unreliable environment. The slicing service, has been proposed to allow for an auto… ▽ More

    Submitted 26 December, 2007; originally announced December 2007.

    Report number: ICDCS07

    Journal ref: Dans The 27th International Conference on Distributed Computing Systems (ICDCS'07) (2007) 66

  20. arXiv:cs/0612035  [pdf, ps, other

    cs.DC

    Distributed Slicing in Dynamic Systems

    Authors: Antonio Fernandez, Vincent Gramoli, Ernesto Jimenez, Anne-Marie Kermarrec, Michel Raynal

    Abstract: Peer to peer (P2P) systems are moving from application specific architectures to a generic service oriented design philosophy. This raises interesting problems in connection with providing useful P2P middleware services that are capable of dealing with resource assignment and management in a large-scale, heterogeneous and unreliable environment. One such service, the slicing service, has been pr… ▽ More

    Submitted 6 December, 2006; originally announced December 2006.