Skip to main content

Showing 1–13 of 13 results for author: Tambe, T

.
  1. arXiv:2310.07854  [pdf, other

    cs.RO

    VaPr: Variable-Precision Tensors to Accelerate Robot Motion Planning

    Authors: Yu-Shun Hsiao, Siva Kumar Sastry Hari, Balakumar Sundaralingam, Jason Yik, Thierry Tambe, Charbel Sakr, Stephen W. Keckler, Vijay Janapa Reddi

    Abstract: High-dimensional motion generation requires numerical precision for smooth, collision-free solutions. Typically, double-precision or single-precision floating-point (FP) formats are utilized. Using these for big tensors imposes a strain on the memory bandwidth provided by the devices and alters the memory footprint, hence limiting their applicability to low-power edge devices needed for mobile rob… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: 7 pages, 5 figures, 8 tables, to be published in 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  2. arXiv:2305.03148  [pdf, other

    cs.AR cs.LG cs.NE

    CAMEL: Co-Designing AI Models and Embedded DRAMs for Efficient On-Device Learning

    Authors: Sai Qian Zhang, Thierry Tambe, Nestor Cuevas, Gu-Yeon Wei, David Brooks

    Abstract: On-device learning allows AI models to adapt to user data, thereby enhancing service quality on edge platforms. However, training AI on resource-limited devices poses significant challenges due to the demanding computing workload and the substantial memory consumption and data access required by deep neural networks (DNNs). To address these issues, we propose utilizing embedded dynamic random-acce… ▽ More

    Submitted 22 December, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

  3. arXiv:2203.00218  [pdf, other

    cs.AR cs.PL

    Application-Level Validation of Accelerator Designs Using a Formal Software/Hardware Interface

    Authors: Bo-Yuan Huang, Steven Lyubomirsky, Yi Li, Mike He, Gus Henry Smith, Thierry Tambe, Akash Gaonkar, Vishal Canumalla, Andrew Cheung, Gu-Yeon Wei, Aarti Gupta, Zachary Tatlock, Sharad Malik

    Abstract: Ideally, accelerator development should be as easy as software development. Several recent design languages/tools are working toward this goal, but actually testing early designs on real applications end-to-end remains prohibitively difficult due to the costs of building specialized compiler and simulator support. We propose a new first-in-class, mostly automated methodology termed "3LA" to enable… ▽ More

    Submitted 22 August, 2023; v1 submitted 28 February, 2022; originally announced March 2022.

  4. arXiv:2109.05683  [pdf, ps, other

    cs.RO cs.AR

    AutoSoC: Automating Algorithm-SOC Co-design for Aerial Robots

    Authors: Srivatsan Krishnan, Thierry Tambe, Zishen Wan, Vijay Janapa Reddi

    Abstract: Aerial autonomous machines (Drones) has a plethora of promising applications and use cases. While the popularity of these autonomous machines continues to grow, there are many challenges, such as endurance and agility, that could hinder the practical deployment of these machines. The closed-loop control frequency must be high to achieve high agility. However, given the resource-constrained nature… ▽ More

    Submitted 12 September, 2021; originally announced September 2021.

    Comments: Class Project for CS249r: Special Topics on Edge Computing (Autonomous Machines)

  5. arXiv:2105.01134  [pdf, other

    eess.AS cs.SD

    Quantifying and Maximizing the Benefits of Back-End Noise Adaption on Attention-Based Speech Recognition Models

    Authors: Coleman Hooper, Thierry Tambe, Gu-Yeon Wei

    Abstract: This work analyzes how attention-based Bidirectional Long Short-Term Memory (BLSTM) models adapt to noise-augmented speech. We identify crucial components for noise adaptation in BLSTM models by freezing model components during fine-tuning. We first freeze larger model subnetworks and then pursue a fine-grained freezing approach in the encoder after identifying its importance for noise adaptation.… ▽ More

    Submitted 23 September, 2021; v1 submitted 3 May, 2021; originally announced May 2021.

    Comments: Submitted to ENLSP 2021

  6. arXiv:2011.14203  [pdf, other

    cs.AR cs.CL

    EdgeBERT: Sentence-Level Energy Optimizations for Latency-Aware Multi-Task NLP Inference

    Authors: Thierry Tambe, Coleman Hooper, Lillian Pentecost, Tianyu Jia, En-Yu Yang, Marco Donato, Victor Sanh, Paul N. Whatmough, Alexander M. Rush, David Brooks, Gu-Yeon Wei

    Abstract: Transformer-based language models such as BERT provide significant accuracy improvement for a multitude of natural language processing (NLP) tasks. However, their hefty computational and memory demands make them challenging to deploy to resource-constrained edge platforms with strict latency requirements. We present EdgeBERT, an in-depth algorithm-hardware co-design for latency-aware energy optimi… ▽ More

    Submitted 5 September, 2021; v1 submitted 28 November, 2020; originally announced November 2020.

    Comments: 12 pages plus references. Paper to appear at the 54th IEEE/ACM International Symposium on Microarchitecture (MICRO 2021)

  7. arXiv:1909.13271  [pdf, other

    cs.LG cs.AR stat.ML

    AdaptivFloat: A Floating-point based Data Type for Resilient Deep Learning Inference

    Authors: Thierry Tambe, En-Yu Yang, Zishen Wan, Yuntian Deng, Vijay Janapa Reddi, Alexander Rush, David Brooks, Gu-Yeon Wei

    Abstract: Conventional hardware-friendly quantization methods, such as fixed-point or integer, tend to perform poorly at very low word sizes as their shrinking dynamic ranges cannot adequately capture the wide data distributions commonly seen in sequence transduction models. We present AdaptivFloat, a floating-point inspired number representation format for deep learning that dynamically maximizes and optim… ▽ More

    Submitted 11 February, 2020; v1 submitted 29 September, 2019; originally announced September 2019.

    Comments: 10 pages

  8. arXiv:1908.08976  [pdf, other

    eess.SP

    MASR: A Modular Accelerator for Sparse RNNs

    Authors: Udit Gupta, Brandon Reagen, Lillian Pentecost, Marco Donato, Thierry Tambe, Alexander M. Rush, Gu-Yeon Wei, David Brooks

    Abstract: Recurrent neural networks (RNNs) are becoming the de facto solution for speech recognition. RNNs exploit long-term temporal relationships in data by applying repeated, learned transformations. Unlike fully-connected (FC) layers with single vector matrix operations, RNN layers consist of hundreds of such operations chained over time. This poses challenges unique to RNNs that are not found in convol… ▽ More

    Submitted 23 August, 2019; originally announced August 2019.

  9. First-principles calculations of step formation energies and step interactions on TiN(001)

    Authors: C. V. Ciobanu, D. T. Tambe, V. B. Shenoy

    Abstract: We study the formation energies and repulsive interactions of monatomic steps on the TiN(001) surface, using density functional total-energy calculations. The calculated formation energy of [100] oriented steps agree well with recently reported experimental values; these steps are shown to have a rumpled structure, with the Ti atoms undergoing larger displacements than the N atoms. For steps tha… ▽ More

    Submitted 24 October, 2004; v1 submitted 30 September, 2004; originally announced October 2004.

    Journal ref: Surface Science 582, 145-150 (2005).

  10. Influence of step-edge barriers on the morphological relaxation of nanoscale ripples on crystal surfaces

    Authors: V. B. Shenoy, A. Ramasubramaniam, H. Ramanarayan, D. T. Tambe, W-L. Chan, E. Chason

    Abstract: We show that the decay of sinusoidal ripples on crystal surfaces, where mass transport is limited by the attachment and detachment of atoms at the step-edges, is remarkably different from the decay behavior that has been reported until now. Unlike the decreasing or at most constant rate of amplitude decay of sinusoidal profiles observed in earlier work, we find that the decay rate increases with… ▽ More

    Submitted 29 April, 2004; originally announced April 2004.

    Comments: To appear in Phys. Rev. Lett

  11. arXiv:cond-mat/0404592  [pdf, ps, other

    cond-mat.mtrl-sci

    On the energetic origin of self-limiting trenches formed around Ge/Si quantum dots

    Authors: D. T. Tambe, V. B. Shenoy

    Abstract: At high growth temperatures, the misfit strain at the boundary of Ge quantum dots on Si(001) is relieved by formation of trenches around the base of the islands. The depth of the trenches has been observed to saturate at a level that depends on the base-width of the islands. Using finite element simulations, we show that the self-limiting nature of trench depth is due to a competition between th… ▽ More

    Submitted 26 April, 2004; originally announced April 2004.

  12. Comparative study of dimer vacancies and dimer-vacancy lines on Si(001) and Ge(001)

    Authors: C. V. Ciobanu, D. T. Tambe, V. B. Shenoy

    Abstract: Although the clean Si(001) and Ge(001) surfaces are very similar, experiments to date have shown that dimer-vacancy (DV) defects self-organize into vacancy lines (VLs) on Si(001), but not on Ge(001). In this paper, we perform empirical-potential calculations aimed at understanding the differences between the vacancies on Si(001) and Ge(001). We identify three energetic parameters that characteri… ▽ More

    Submitted 18 March, 2004; v1 submitted 30 October, 2003; originally announced October 2003.

    Comments: 3 tables, 4 figures, to appear in Surface Science

    Journal ref: Surface Science, 556, 171 (2004).

  13. Atomic-scale perspective on the origin of attractive step interactions on Si(113)

    Authors: C. V. Ciobanu, D. T. Tambe, V. B. Shenoy, C. Z. Wang, K. M. Ho

    Abstract: Recent experiments have shown that steps on Si(113) surfaces self-organize into bunches due to a competition between long-range repulsive and short-range attractive interactions. Using empirical and tight-binding interatomic potentials, we investigate the physical origin of the short-range attraction, and report the formation and interaction energies of steps. We find that the short-range attrac… ▽ More

    Submitted 30 October, 2003; v1 submitted 24 April, 2003; originally announced April 2003.

    Comments: 4 pages, 3 figures, to appear in Phys. Rev B, Rapid Communications

    Journal ref: Physical Review B, 68, 201302(R) (2003)