Skip to main content

Showing 1–30 of 30 results for author: Bansal, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.11409  [pdf, other

    cs.CL cs.AI

    CodeGemma: Open Code Models Based on Gemma

    Authors: CodeGemma Team, Heri Zhao, Jeffrey Hui, Joshua Howland, Nam Nguyen, Siqi Zuo, Andrea Hu, Christopher A. Choquette-Choo, **gyue Shen, Joe Kelley, Kshitij Bansal, Luke Vilnis, Mateo Wirth, Paul Michel, Peter Choy, Pratik Joshi, Ravin Kumar, Sarmad Hashmi, Shubham Agrawal, Zhitao Gong, Jane Fine, Tris Warkentin, Ale Jakse Hartman, Bin Ni, Kathy Korevec , et al. (2 additional authors not shown)

    Abstract: This paper introduces CodeGemma, a collection of specialized open code models built on top of Gemma, capable of a variety of code and natural language generation tasks. We release three model variants. CodeGemma 7B pretrained (PT) and instruction-tuned (IT) variants have remarkably resilient natural language understanding, excel in mathematical reasoning, and match code capabilities of other open… ▽ More

    Submitted 18 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: v1: 11 pages, 4 figures, 5 tables. v2: Update metadata

  2. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  3. arXiv:2208.03849  [pdf, other

    cs.CV cs.AI

    RadSegNet: A Reliable Approach to Radar Camera Fusion

    Authors: Kshitiz Bansal, Keshav Rungta, Dinesh Bharadia

    Abstract: Perception systems for autonomous driving have seen significant advancements in their performance over last few years. However, these systems struggle to show robustness in extreme weather conditions because sensors like lidars and cameras, which are the primary sensors in a sensor suite, see a decline in performance under these conditions. In order to solve this problem, camera-radar fusion syste… ▽ More

    Submitted 7 August, 2022; originally announced August 2022.

  4. Pointillism: Accurate 3D bounding box estimation with multi-radars

    Authors: Kshitiz Bansal, Keshav Rungta, Siyuan Zhu, Dinesh Bharadia

    Abstract: Autonomous perception requires high-quality environment sensing in the form of 3D bounding boxes of dynamic objects. The primary sensors used in automotive systems are light-based cameras and LiDARs. However, they are known to fail in adverse weather conditions. Radars can potentially solve this problem as they are barely affected by adverse weather conditions. However, specular reflections of wir… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

    Comments: Accepted in SenSys '20. Dataset has been made publicly available

    Journal ref: Proceedings of the 18th Conference on Embedded Networked Sensor Systems. Pages 340-353, 2020

  5. arXiv:2112.07549  [pdf, ps, other

    cs.IT

    Sequential Change Detection through Empirical Distribution and Universal Codes

    Authors: Vikrant Malik, R. K. Bansal

    Abstract: Universal compression algorithms have been studied in the past for sequential change detection, where they have been used to estimate the post-change distribution in the modified version of the Cumulative Sum (CUSUM) Test. In this paper, we introduce a modified CUSUM test where the pre-change distribution is also unknown and an empirical version of the pre-change distribution is used to implement… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

  6. arXiv:2112.01938  [pdf, other

    cs.CL cs.AI cs.LG

    Shapes of Emotions: Multimodal Emotion Recognition in Conversations via Emotion Shifts

    Authors: Harsh Agarwal, Keshav Bansal, Abhinav Joshi, Ashutosh Modi

    Abstract: Emotion Recognition in Conversations (ERC) is an important and active research area. Recent work has shown the benefits of using multiple modalities (e.g., text, audio, and video) for the ERC task. In a conversation, participants tend to maintain a particular emotional state unless some stimuli evokes a change. There is a continuous ebb and flow of emotions in a conversation. Inspired by this obse… ▽ More

    Submitted 7 November, 2022; v1 submitted 3 December, 2021; originally announced December 2021.

    Comments: 13 pages, Accepted at Workshop on Performance and Interpretability Evaluations of Multimodal, Multipurpose, Massive-Scale Models, COLING 2022

  7. arXiv:2007.10819  [pdf, other

    cs.CL cs.LG

    BAKSA at SemEval-2020 Task 9: Bolstering CNN with Self-Attention for Sentiment Analysis of Code Mixed Text

    Authors: Ayush Kumar, Harsh Agarwal, Keshav Bansal, Ashutosh Modi

    Abstract: Sentiment Analysis of code-mixed text has diversified applications in opinion mining ranging from tagging user reviews to identifying social or political sentiments of a sub-population. In this paper, we present an ensemble architecture of convolutional neural net (CNN) and self-attention based LSTM for sentiment analysis of code-mixed tweets. While the CNN component helps in the classification of… ▽ More

    Submitted 21 July, 2020; originally announced July 2020.

    Comments: 6 pages, 8 figures, 2 tables. Accepted at Proceedings of the 14th International Workshop on Semantic Evaluation (SemEval-2020)

  8. arXiv:2006.04757  [pdf, other

    cs.LG cs.AI cs.PL stat.ML

    Mathematical Reasoning via Self-supervised Skip-tree Training

    Authors: Markus N. Rabe, Dennis Lee, Kshitij Bansal, Christian Szegedy

    Abstract: We examine whether self-supervised language modeling applied to mathematical formulas enables logical reasoning. We suggest several logical reasoning tasks that can be used to evaluate language models trained on formal mathematical statements, such as type inference, suggesting missing assumptions and completing equalities. To train language models for formal mathematics, we propose a novel skip-t… ▽ More

    Submitted 12 August, 2020; v1 submitted 8 June, 2020; originally announced June 2020.

  9. arXiv:2004.08450  [pdf, other

    cs.PL

    Reducing Commutativity Verification to Reachability with Differencing Abstractions

    Authors: Eric Koskinen, Kshitij Bansal

    Abstract: Commutativity of data structure methods is of ongoing interest, with roots in the database community. In recent years commutativity has been shown to be a key ingredient to enabling multicore concurrency in contexts such as parallelizing compilers, transactional memory, speculative execution and, more broadly, software scalability. Despite this interest, it remains an open question as to how a dat… ▽ More

    Submitted 17 April, 2020; originally announced April 2020.

  10. arXiv:1909.12756  [pdf, other

    cs.IR cs.AI

    On-Device User Intent Prediction for Context and Sequence Aware Recommendation

    Authors: Benu Madhab Changmai, Divija Nagaraju, Debi Prasanna Mohanty, Kriti Singh, Kunal Bansal, Sukumar Moharana

    Abstract: The pursuit of improved accuracy in recommender systems has led to the incorporation of user context. Context-aware recommender systems typically handle large amounts of data which must be uploaded and stored on the cloud, putting the user's personal information at risk. While there have been previous studies on privacy-sensitive and context-aware recommender systems, there has not been a full-fle… ▽ More

    Submitted 18 September, 2019; originally announced September 2019.

  11. arXiv:1909.11851  [pdf, other

    cs.LG cs.AI stat.ML

    Mathematical Reasoning in Latent Space

    Authors: Dennis Lee, Christian Szegedy, Markus N. Rabe, Sarah M. Loos, Kshitij Bansal

    Abstract: We design and conduct a simple experiment to study whether neural networks can perform several steps of approximate reasoning in a fixed dimensional latent space. The set of rewrites (i.e. transformations) that can be successfully performed on a statement represents essential semantic features of the statement. We can compress this information by embedding the formula in a vector space, such that… ▽ More

    Submitted 25 September, 2019; originally announced September 2019.

  12. arXiv:1905.10501  [pdf, other

    cs.LG cs.AI cs.LO stat.ML

    Learning to Reason in Large Theories without Imitation

    Authors: Kshitij Bansal, Christian Szegedy, Markus N. Rabe, Sarah M. Loos, Viktor Toman

    Abstract: In this paper, we demonstrate how to do automated theorem proving in the presence of a large knowledge base of potential premises without learning from human proofs. We suggest an exploration mechanism that mixes in additional premises selected by a tf-idf (term frequency-inverse document frequency) based lookup in a deep reinforcement learning scenario. This helps with exploring and learning whic… ▽ More

    Submitted 11 June, 2020; v1 submitted 24 May, 2019; originally announced May 2019.

    Comments: Major revision

  13. arXiv:1905.10006  [pdf, other

    cs.LG cs.AI cs.LO stat.ML

    Graph Representations for Higher-Order Logic and Theorem Proving

    Authors: Aditya Paliwal, Sarah Loos, Markus Rabe, Kshitij Bansal, Christian Szegedy

    Abstract: This paper presents the first use of graph neural networks (GNNs) for higher-order proof search and demonstrates that GNNs can improve upon state-of-the-art results in this domain. Interactive, higher-order theorem provers allow for the formalization of most mathematical theories and have been shown to pose a significant challenge for deep learning. Higher-order logic is highly expressive and, eve… ▽ More

    Submitted 12 September, 2019; v1 submitted 23 May, 2019; originally announced May 2019.

  14. arXiv:1905.01537  [pdf, other

    cs.LG cs.AI

    Hierarchical Policy Learning is Sensitive to Goal Space Design

    Authors: Zach Dwiel, Madhavun Candadai, Mariano Phielipp, Arjun K. Bansal

    Abstract: Hierarchy in reinforcement learning agents allows for control at multiple time scales yielding improved sample efficiency, the ability to deal with long time horizons and transferability of sub-policies to tasks outside the training distribution. It is often implemented as a master policy providing goals to a sub-policy. Ideally, we would like the goal-spaces to be learned, however, properties of… ▽ More

    Submitted 25 June, 2019; v1 submitted 4 May, 2019; originally announced May 2019.

    Comments: Accepted to be presented at Task-Agnostic Reinforcement Learning (TARL) workshop at ICLR'19

  15. arXiv:1904.03241  [pdf, other

    cs.LO cs.AI cs.LG

    HOList: An Environment for Machine Learning of Higher-Order Theorem Proving

    Authors: Kshitij Bansal, Sarah M. Loos, Markus N. Rabe, Christian Szegedy, Stewart Wilcox

    Abstract: We present an environment, benchmark, and deep learning driven automated theorem prover for higher-order logic. Higher-order interactive theorem provers enable the formalization of arbitrary mathematical theories and thereby present an interesting, open-ended challenge for deep learning. We provide an open-source framework based on the HOL Light theorem prover that can be used as a reinforcement l… ▽ More

    Submitted 1 November, 2019; v1 submitted 5 April, 2019; originally announced April 2019.

    Comments: Accepted at ICML 2019

  16. arXiv:1806.05154  [pdf, other

    cs.CV

    Automated Performance Assessment in Transoesophageal Echocardiography with Convolutional Neural Networks

    Authors: Evangelos B. Mazomenos, Kamakshi Bansal, Bruce Martin, Andrew Smith, Susan Wright, Danail Stoyanov

    Abstract: Transoesophageal echocardiography (TEE) is a valuable diagnostic and monitoring imaging modality. Proper image acquisition is essential for diagnosis, yet current assessment techniques are solely based on manual expert review. This paper presents a supervised deep learn ing framework for automatically evaluating and grading the quality of TEE images. To obtain the necessary dataset, 38 participant… ▽ More

    Submitted 13 June, 2018; originally announced June 2018.

    Comments: to be presented in MICCAI 2018, Granada, Spain, 16-20 Sep 2018

  17. arXiv:1802.08748  [pdf, ps, other

    cs.PL

    Automatic Generation of Precise and Useful Commutativity Conditions (Extended Version)

    Authors: Kshitij Bansal, Eric Koskinen, Omer Tripp

    Abstract: Reasoning about commutativity between data-structure operations is an important problem with applications including parallelizing compilers, optimistic parallelization and, more recently, Ethereum smart contracts. There have been research results on automatic generation of commutativity conditions, yet we are unaware of any fully automated technique to generate conditions that are both sound and e… ▽ More

    Submitted 23 February, 2018; originally announced February 2018.

    Comments: Note: This is an extended version of our paper, which appears in TACAS 2018

  18. arXiv:1801.08058  [pdf, other

    cs.DC cs.LG

    Intel nGraph: An Intermediate Representation, Compiler, and Executor for Deep Learning

    Authors: Scott Cyphers, Arjun K. Bansal, Anahita Bhiwandiwalla, Jayaram Bobba, Matthew Brookhart, Avijit Chakraborty, Will Constable, Christian Convey, Leona Cook, Omar Kanawi, Robert Kimball, Jason Knight, Nikolay Korovaiko, Varun Kumar, Yixing Lao, Christopher R. Lishka, Jaikrishnan Menon, Jennifer Myers, Sandeep Aswath Narayana, Adam Procter, Tristan J. Webb

    Abstract: The Deep Learning (DL) community sees many novel topologies published each year. Achieving high performance on each new topology remains challenging, as each requires some level of manual effort. This issue is compounded by the proliferation of frameworks and hardware platforms. The current approach, which we call "direct optimization", requires deep changes within each framework to improve the tr… ▽ More

    Submitted 29 January, 2018; v1 submitted 24 January, 2018; originally announced January 2018.

  19. arXiv:1711.02213  [pdf, other

    cs.LG math.NA stat.ML

    Flexpoint: An Adaptive Numerical Format for Efficient Training of Deep Neural Networks

    Authors: Urs Köster, Tristan J. Webb, Xin Wang, Marcel Nassar, Arjun K. Bansal, William H. Constable, Oğuz H. Elibol, Scott Gray, Stewart Hall, Luke Hornof, Amir Khosrowshahi, Carey Kloss, Ruby J. Pai, Naveen Rao

    Abstract: Deep neural networks are commonly developed and trained in 32-bit floating point format. Significant gains in performance and energy efficiency could be realized by training and inference in numerical formats optimized for deep learning. Despite advances in limited precision inference in recent years, training of neural networks in low bit-width remains a challenging problem. Here we present the F… ▽ More

    Submitted 2 December, 2017; v1 submitted 6 November, 2017; originally announced November 2017.

    Comments: 14 pages, 5 figures, accepted in Neural Information Processing Systems 2017

  20. Reasoning with Finite Sets and Cardinality Constraints in SMT

    Authors: Kshitij Bansal, Clark Barrett, Andrew Reynolds, Cesare Tinelli

    Abstract: We consider the problem of deciding the satisfiability of quantifier-free formulas in the theory of finite sets with cardinality constraints. Sets are a common high-level data structure used in programming; thus, such a theory is useful for modeling program constructs directly. More importantly, sets are a basic construct of mathematics and thus natural to use when formalizing the properties of co… ▽ More

    Submitted 31 October, 2018; v1 submitted 20 February, 2017; originally announced February 2017.

    Journal ref: Logical Methods in Computer Science, Volume 14, Issue 4 (November 1, 2018) lmcs:3155

  21. arXiv:1606.00822  [pdf

    cs.CV cs.HC

    Unifying Geometric Features and Facial Action Units for Improved Performance of Facial Expression Analysis

    Authors: Mehdi Ghayoumi, Arvind K Bansal

    Abstract: Previous approaches to model and analyze facial expression analysis use three different techniques: facial action units, geometric features and graph based modelling. However, previous approaches have treated these technique separately. There is an interrelationship between these techniques. The facial expression analysis is significantly improved by utilizing these map**s between major geometri… ▽ More

    Submitted 2 June, 2016; originally announced June 2016.

    Comments: 8 pages, ISBN: 978-1-61804-285-9

  22. On Deciding Local Theory Extensions via E-matching

    Authors: Kshitij Bansal, Andrew Reynolds, Tim King, Clark Barrett, Thomas Wies

    Abstract: Satisfiability Modulo Theories (SMT) solvers incorporate decision procedures for theories of data types that commonly occur in software. This makes them important tools for automating verification problems. A limitation frequently encountered is that verification problems are often not fully expressible in the theories supported natively by the solvers. Many solvers allow the specification of appl… ▽ More

    Submitted 27 August, 2015; originally announced August 2015.

  23. arXiv:1412.6583  [pdf, other

    cs.LG cs.CV cs.NE

    Discovering Hidden Factors of Variation in Deep Networks

    Authors: Brian Cheung, Jesse A. Livezey, Arjun K. Bansal, Bruno A. Olshausen

    Abstract: Deep learning has enjoyed a great deal of success because of its ability to learn useful features for tasks such as classification. But there has been less exploration in learning the factors of variation apart from the classification signal. By augmenting autoencoders with simple regularization terms during training, we demonstrate that standard deep architectures can discover and explicitly repr… ▽ More

    Submitted 17 June, 2015; v1 submitted 19 December, 2014; originally announced December 2014.

    Comments: Presented at International Conference on Learning Representations 2015 Workshop

  24. On Match Lengths, Zero Entropy and Large Deviations - with Application to Sliding Window Lempel-Ziv Algorithm

    Authors: Siddharth Jain, R. K. Bansal

    Abstract: The Sliding Window Lempel-Ziv (SWLZ) algorithm that makes use of recurrence times and match lengths has been studied from various perspectives in information theory literature. In this paper, we undertake a finer study of these quantities under two different scenarios, i) \emph{zero entropy} sources that are characterized by strong long-term memory, and ii) the processes with weak memory as descri… ▽ More

    Submitted 6 November, 2014; v1 submitted 5 November, 2014; originally announced November 2014.

    Comments: accepted to appear in IEEE Transactions on Information Theory

  25. K-Algorithm A Modified Technique for Noise Removal in Handwritten Documents

    Authors: Kanika Bansal, Rajiv Kumar

    Abstract: OCR has been an active research area since last few decades. OCR performs the recognition of the text in the scanned document image and converts it into editable form. The OCR process can have several stages like pre-processing, segmentation, recognition and post processing. The pre-processing stage is a crucial stage for the success of OCR, which mainly deals with noise removal. In the present pa… ▽ More

    Submitted 6 June, 2013; originally announced June 2013.

    Journal ref: International Journal of Information Sciences and Techniques, May 2013, Volume 3, Number 3

  26. arXiv:1303.1098  [pdf, ps, other

    cs.IT

    On Match Lengths and the Asymptotic Behavior of Sliding Window Lempel-Ziv Algorithm for Zero Entropy Sequences

    Authors: Siddharth Jain, Rakesh Kumar Bansal

    Abstract: The Sliding Window Lempel-Ziv (SWLZ) algorithm has been studied from various perspectives in information theory literature. In this paper, we provide a general law which defines the asymptotics of match length for stationary and ergodic zero entropy processes. Moreover, we use this law to choose the match length $L_o$ in the almost sure optimality proof of Fixed Shift Variant of Lempel-Ziv (FSLZ)… ▽ More

    Submitted 17 May, 2013; v1 submitted 5 March, 2013; originally announced March 2013.

    Comments: 5 pages, International Symposium on Information Theory, 2013

  27. arXiv:1303.1093  [pdf, ps, other

    cs.IT

    On Large Deviation Property of Recurrence Times

    Authors: Siddharth Jain, Rakesh Kumar Bansal

    Abstract: We extend the study by Ornstein and Weiss on the asymptotic behavior of the normalized version of recurrence times and establish the large deviation property for a certain class of mixing processes. Further, an estimator for entropy based on recurrence times is proposed for which large deviation behavior is proved for stationary and ergodic sources satisfying similar mixing conditions.

    Submitted 17 May, 2013; v1 submitted 5 March, 2013; originally announced March 2013.

    Comments: 5 pages, International Symposium on Information Theory 2013

  28. arXiv:1212.1485  [pdf, other

    cs.LO cs.FL

    A Note on the Complexity of Model-Checking Bounded Multi-Pushdown Systems

    Authors: Kshitij Bansal, Stéphane Demri

    Abstract: In this note, we provide complexity characterizations of model checking multi-pushdown systems. Multi-pushdown systems model recursive concurrent programs in which any sequential process has a finite control. We consider three standard notions for boundedness: context boundedness, phase boundedness and stack ordering. The logical formalism is a linear-time temporal logic extending well-known logic… ▽ More

    Submitted 6 December, 2012; originally announced December 2012.

    Report number: NYU TR2012-949 ACM Class: F.4.1; D.2.4

  29. arXiv:0803.3515  [pdf

    cs.OH

    Geographic Information Systems in Evaluation and Visualization of Construction Schedule

    Authors: V. K. Bansal, Mahesh Pal

    Abstract: Commercially available scheduling tools such as Primavera and Microsoft Project fail to provide information pertaining to the spatial aspects of construction project. A methodology using geographical information systems (GIS) is developed to represent spatial aspects of the construction progress graphically by synchronizing it with construction schedule. The spatial aspects are depicted by 3D mo… ▽ More

    Submitted 25 March, 2008; originally announced March 2008.

    Comments: Presented in Second ESRI Asia-Pacific User Conference New Delhi, 2007

  30. arXiv:cs/0607029   

    cs.IT

    A Coding Theorem Characterizing Renyi's Entropy through Variable-to-Fixed Length Codes

    Authors: Vaneet Aggarwal, R. K. Bansal

    Abstract: This paper has been withdrawn

    Submitted 28 October, 2006; v1 submitted 8 July, 2006; originally announced July 2006.

    Comments: This paper has been withdrawn