Skip to main content

Showing 1–19 of 19 results for author: Sra, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16176  [pdf, other

    cs.AI cs.CL cs.LG

    GraphEval2000: Benchmarking and Improving Large Language Models on Graph Datasets

    Authors: Qiming Wu, Zichen Chen, Will Corcoran, Misha Sra, Ambuj K. Singh

    Abstract: Large language models (LLMs) have achieved remarkable success in natural language processing (NLP), demonstrating significant capabilities in processing and understanding text data. However, recent studies have identified limitations in LLMs' ability to reason about graph-structured data. To address this gap, we introduce GraphEval2000, the first comprehensive graph dataset, comprising 40 graph da… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: Submitted to NeurIPs 2024 Dataset and Benchmark track, under review

    MSC Class: H.2.8; I.2.6; I.2.7

  2. EntangleVR++: Evaluating the Potential of using Entanglement in an Interactive VR Scene Creation System

    Authors: Mengyu Chen, Marko Peljhan, Misha Sra

    Abstract: Interactive digital stories provide a sense of flexibility and freedom to players by allowing them to make choices at key junctions. These choices advance the narrative and determine, to some degree, how the story evolves for that player. As shown in prior work, the ability to control or participate in the construction of the narrative can give the player a high level of agency that results in a s… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: Preprint for Frontiers in Virtual Reality, December 2023

    ACM Class: H.5.1

    Journal ref: Front. Virtual Real. 4:1252551 (2023)

  3. ConnectVR: A Trigger-Action Interface for Creating Agent-based Interactive VR Stories

    Authors: Mengyu Chen, Marko Peljhan, Misha Sra

    Abstract: The demand for interactive narratives is growing with increasing popularity of VR and video gaming. This presents an opportunity to create interactive storytelling experiences that allow players to engage with a narrative from a first person perspective, both, immersively in VR and in 3D on a computer. However, for artists and storytellers without programming experience, authoring such experiences… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: Preprint for 2024 IEEE Conference Virtual Reality and 3D User Interfaces (VR)

    ACM Class: H.5.1

    Journal ref: in 2024 IEEE Conference Virtual Reality and 3D User Interfaces (VR), Orlando, FL, USA, 2024 pp. 286-297

  4. arXiv:2406.14373  [pdf, other

    cs.AI cs.CL cs.CY cs.HC cs.MA

    Artificial Leviathan: Exploring Social Evolution of LLM Agents Through the Lens of Hobbesian Social Contract Theory

    Authors: Gordon Dai, Weijia Zhang, **han Li, Siqi Yang, Chidera Onochie lbe, Srihas Rao, Arthur Caetano, Misha Sra

    Abstract: The emergence of Large Language Models (LLMs) and advancements in Artificial Intelligence (AI) offer an opportunity for computational social science research at scale. Building upon prior explorations of LLM agent design, our work introduces a simulated agent society where complex social relationships dynamically form and evolve over time. Agents are imbued with psychological drives and placed in… ▽ More

    Submitted 1 July, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  5. DanceGen: Supporting Choreography Ideation and Prototy** with Generative AI

    Authors: Yimeng Liu, Misha Sra

    Abstract: Choreography creation requires high proficiency in artistic and technical skills. Choreographers typically go through four stages to create a dance piece: preparation, studio, performance, and reflection. This process is often individualized, complicated, and challenging due to multiple constraints at each stage. To assist choreographers, most prior work has focused on designing digital tools to s… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: ACM Conference on Designing Interactive Systems (DIS '24)

  6. arXiv:2404.11120  [pdf, other

    cs.CV

    TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based Image Editing

    Authors: Sherry X. Chen, Yaron Vaxman, Elad Ben Baruch, David Asulin, Aviad Moreshet, Kuo-Chin Lien, Misha Sra, Pradeep Sen

    Abstract: Despite many attempts to leverage pre-trained text-to-image models (T2I) like Stable Diffusion (SD) for controllable image editing, producing good predictable results remains a challenge. Previous approaches have focused on either fine-tuning pre-trained T2I models on specific datasets to generate certain kinds of images (e.g., with a specific object or person), or on optimizing the weights, text… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: Conference on Computer Vision and Pattern Recognition (CVPR) 2024

  7. Exploring AI-assisted Ideation and Prototy** for Choreography

    Authors: Yimeng Liu, Misha Sra

    Abstract: Choreography creation is a multimodal endeavor, demanding cognitive abilities to develop creative ideas and technical expertise to convert choreographic ideas into physical dance movements. Previous endeavors have sought to reduce the complexities in the choreography creation process in both dimensions. Among them, non-AI-based systems have focused on reinforcing cognitive activities by hel** an… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  8. arXiv:2311.08614  [pdf, other

    cs.CL cs.AI

    XplainLLM: A QA Explanation Dataset for Understanding LLM Decision-Making

    Authors: Zichen Chen, Jianda Chen, Mitali Gaidhani, Ambuj Singh, Misha Sra

    Abstract: Large Language Models (LLMs) have recently made impressive strides in natural language understanding tasks. Despite their remarkable performance, understanding their decision-making process remains a big challenge. In this paper, we look into bringing some transparency to this process by introducing a new explanation dataset for question answering (QA) tasks that integrates knowledge graphs (KGs)… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: 17 pages, 6 figures, 7 tables. Our dataset is available at: https://github.com/chen-zichen/XplainLLM_dataset.git

  9. arXiv:2303.16537  [pdf, other

    cs.CL

    LMExplainer: a Knowledge-Enhanced Explainer for Language Models

    Authors: Zichen Chen, Ambuj K Singh, Misha Sra

    Abstract: Large language models (LLMs) such as GPT-4 are very powerful and can process different kinds of natural language processing (NLP) tasks. However, it can be difficult to interpret the results due to the multi-layer nonlinear model structure and millions of parameters. A lack of clarity and understanding of how the language models (LMs) work can make them unreliable, difficult to trust, and potentia… ▽ More

    Submitted 3 August, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: 12 pages, 1 figure, 7 tables, and 3 case studies

  10. arXiv:2303.06277  [pdf, other

    cs.CV

    SPOTR: Spatio-temporal Pose Transformers for Human Motion Prediction

    Authors: Avinash Ajit Nargund, Misha Sra

    Abstract: 3D human motion prediction is a research area of high significance and a challenge in computer vision. It is useful for the design of many applications including robotics and autonomous driving. Traditionally, autogregressive models have been used to predict human motion. However, these models have high computation needs and error accumulation that make it difficult to use them for realtime applic… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

  11. CardsVR: A Two-Person VR Experience with Passive Haptic Feedback from a Deck of Playing Cards

    Authors: Andrew Huard, Mengyu Chen, Misha Sra

    Abstract: Presence in virtual reality (VR) is meaningful for remotely connecting with others and facilitating social interactions despite great distance while providing a sense of "being there." This work presents CardsVR, a two-person VR experience that allows remote participants to play a game of cards together. An entire deck of tracked cards are used to recreate the sense of playing cards in-person. Pri… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

  12. arXiv:2207.04508  [pdf

    cs.HC

    Adaptive Virtual Neuroarchitecture

    Authors: Abhinandan Jain, Pattie Maes, Misha Sra

    Abstract: Our surrounding environment impacts our cognitive-emotional processes on a daily basis and shapes our physical, psychological and social wellbeing. Although the effects of the built environment on our psycho-physiological processes are well studied, virtual environment design with a potentially similar impact on the user, has received limited attention. Based on the influence of space design on a… ▽ More

    Submitted 10 July, 2022; originally announced July 2022.

  13. arXiv:2110.02950  [pdf, other

    cs.CL cs.CY cs.LG

    Self-Supervised Knowledge Assimilation for Expert-Layman Text Style Transfer

    Authors: Wenda Xu, Michael Saxon, Misha Sra, William Yang Wang

    Abstract: Expert-layman text style transfer technologies have the potential to improve communication between members of scientific communities and the general public. High-quality information produced by experts is often filled with difficult jargon laypeople struggle to understand. This is a particularly notable issue in the medical domain, where layman are often confused by medical text online. At present… ▽ More

    Submitted 18 December, 2021; v1 submitted 6 October, 2021; originally announced October 2021.

    Comments: 12 pages, 8 tables, 3 figures. AAAI 2022 Conference Paper

  14. Exploratory Design of a Hands-free Video Game Controller for a Quadriplegic Individual

    Authors: Atieh Taheri, Ziv Weissman, Misha Sra

    Abstract: From colored pixels to hyper-realistic 3D landscapes of virtual reality, video games have evolved immensely over the last few decades. However, video game input still requires two-handed dexterous finger manipulations for simultaneous joystick and trigger or mouse and keyboard presses. In this work, we explore the design of a hands-free game control method using realtime facial expression recognit… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

    Comments: Published in: Augmented Humans Conference 2021

  15. arXiv:2108.12661  [pdf, other

    cs.HC

    SceneAR: Scene-based Micro Narratives for Sharing and Remixing in Augmented Reality

    Authors: Mengyu Chen, Andrés Monroy-Hernández, Misha Sra

    Abstract: Short-form digital storytelling has become a popular medium for millions of people to express themselves. Traditionally, this medium uses primarily 2D media such as text (e.g., memes), images (e.g., Instagram), gifs (e.g., Giphy), and videos (e.g., TikTok, Snapchat). To expand the modalities from 2D to 3D media, we present SceneAR, a smartphone application for creating sequential scene-based micro… ▽ More

    Submitted 28 August, 2021; originally announced August 2021.

    Comments: To be published in 2021 IEEE International Symposium on Mixed and Augmented Reality (ISMAR)

  16. arXiv:2107.02965  [pdf, other

    cs.HC

    Telelife: The Future of Remote Living

    Authors: Jason Orlosky, Misha Sra, Kenan Bektaş, Huaishu Peng, Jeeeun Kim, Nataliya Kos'myna, Tobias Hollerer, Anthony Steed, Kiyoshi Kiyokawa, Kaan Akşit

    Abstract: In recent years, everyday activities such as work and socialization have steadily shifted to more remote and virtual settings. With the COVID-19 pandemic, the switch from physical to virtual has been accelerated, which has substantially affected various aspects of our lives, including business, education, commerce, healthcare, and personal life. This rapid and large-scale switch from in-person to… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

  17. arXiv:2106.14014  [pdf, other

    eess.IV cs.MM

    Txt2Vid: Ultra-Low Bitrate Compression of Talking-Head Videos via Text

    Authors: Pulkit Tandon, Shubham Chandak, Pat Pataranutaporn, Yimeng Liu, Anesu M. Mapuranga, Pattie Maes, Tsachy Weissman, Misha Sra

    Abstract: Video represents the majority of internet traffic today, driving a continual race between the generation of higher quality content, transmission of larger file sizes, and the development of network infrastructure. In addition, the recent COVID-19 pandemic fueled a surge in the use of video conferencing tools. Since videos take up considerable bandwidth (~100 Kbps to a few Mbps), improved video com… ▽ More

    Submitted 2 April, 2022; v1 submitted 26 June, 2021; originally announced June 2021.

    Comments: 11 pages, 8 figures, 2 table. Addition of statistical analysis of results. Reorganization and rewriting of text to make it clearer

  18. arXiv:1512.02922  [pdf, other

    cs.HC

    MetaSpace II: Object and full-body tracking for interaction and navigation in social VR

    Authors: Misha Sra, Chris Schmandt

    Abstract: MetaSpace II (MS2) is a social Virtual Reality (VR) system where multiple users can not only see and hear but also interact with each other, grasp and manipulate objects, walk around in space, and get tactile feedback. MS2 allows walking in physical space by tracking each user's skeleton in real-time and allows users to feel by employing passive haptics i.e., when users touch or manipulate an obje… ▽ More

    Submitted 9 December, 2015; originally announced December 2015.

    Comments: 10 pages, 9 figures. Video: http://living.media.mit.edu/projects/metaspace-ii/

    ACM Class: H.5.1

  19. arXiv:1512.02921  [pdf, other

    cs.HC

    Design Strategies for Playful Technologies to Support Light-intensity Physical Activity in the Workplace

    Authors: Misha Sra, Chris Schmandt

    Abstract: Moderate to vigorous intensity physical activity has an established preventative role in obesity, cardiovascular disease, and diabetes. However recent evidence suggests that sitting time affects health negatively independent of whether adults meet prescribed physical activity guidelines. Since many of us spend long hours daily sitting in front of a host of electronic screens, this is cause for con… ▽ More

    Submitted 9 December, 2015; originally announced December 2015.

    Comments: 11 pages, 5 figures. Video: http://living.media.mit.edu/projects/see-saw/