Skip to main content

Showing 1–33 of 33 results for author: Hashemi, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  2. arXiv:2401.03302  [pdf, other

    eess.IV cs.AI cs.CV cs.LG stat.ML

    Realism in Action: Anomaly-Aware Diagnosis of Brain Tumors from Medical Images Using YOLOv8 and DeiT

    Authors: Seyed Mohammad Hossein Hashemi, Leila Safari, Amirhossein Dadashzade Taromi

    Abstract: In the field of medical sciences, reliable detection and classification of brain tumors from images remains a formidable challenge due to the rarity of tumors within the population of patients. Therefore, the ability to detect tumors in anomaly scenarios is paramount for ensuring timely interventions and improved patient outcomes. This study addresses the issue by leveraging deep learning (DL) tec… ▽ More

    Submitted 10 January, 2024; v1 submitted 6 January, 2024; originally announced January 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  3. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  4. arXiv:2307.11749  [pdf, other

    cs.LG cs.CR

    Differentially Private Heavy Hitter Detection using Federated Analytics

    Authors: Karan Chadha, Junye Chen, John Duchi, Vitaly Feldman, Hanieh Hashemi, Omid Javidbakht, Audra McMillan, Kunal Talwar

    Abstract: In this work, we study practical heuristics to improve the performance of prefix-tree based algorithms for differentially private heavy hitter detection. Our model assumes each user has multiple data points and the goal is to learn as many of the most frequent data points as possible across all users' data with aggregate and local differential privacy. We propose an adaptive hyperparameter tuning… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

  5. arXiv:2307.05925  [pdf, other

    cs.IT eess.SP

    A Tractable Statistical Representation of IFTR Fading with Applications

    Authors: Maryam Olyaee, Hadi Hashemi, Juan M. Romero-Jerez

    Abstract: The recently introduced independent fluctuating two-ray (IFTR) fading model, consisting of two specular components fluctuating independently plus a diffuse component, has proven to provide an excellent fit to different wireless environments, including the millimeter-wave band. However, the original formulations of the probability density function (PDF) and cumulative distribution function (CDF) of… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: This work was submitted to the IEEE for publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  6. arXiv:2307.02740  [pdf, other

    cs.IR cs.CL

    Dense Retrieval Adaptation using Target Domain Description

    Authors: Helia Hashemi, Yong Zhuang, Sachith Sri Ram Kothur, Srivas Prasad, Edgar Meij, W. Bruce Croft

    Abstract: In information retrieval (IR), domain adaptation is the process of adapting a retrieval model to a new domain whose data distribution is different from the source domain. Existing methods in this area focus on unsupervised domain adaptation where they have access to the target document collection or supervised (often few-shot) domain adaptation where they additionally have access to (limited) labe… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

  7. arXiv:2305.10403  [pdf, other

    cs.CL cs.AI

    PaLM 2 Technical Report

    Authors: Rohan Anil, Andrew M. Dai, Orhan Firat, Melvin Johnson, Dmitry Lepikhin, Alexandre Passos, Siamak Shakeri, Emanuel Taropa, Paige Bailey, Zhifeng Chen, Eric Chu, Jonathan H. Clark, Laurent El Shafey, Yan** Huang, Kathy Meier-Hellstern, Gaurav Mishra, Erica Moreira, Mark Omernick, Kevin Robinson, Sebastian Ruder, Yi Tay, Kefan Xiao, Yuanzhong Xu, Yu**g Zhang, Gustavo Hernandez Abrego , et al. (103 additional authors not shown)

    Abstract: We introduce PaLM 2, a new state-of-the-art language model that has better multilingual and reasoning capabilities and is more compute-efficient than its predecessor PaLM. PaLM 2 is a Transformer-based model trained using a mixture of objectives. Through extensive evaluations on English and multilingual language, and reasoning tasks, we demonstrate that PaLM 2 has significantly improved quality on… ▽ More

    Submitted 13 September, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

  8. arXiv:2302.04163  [pdf, ps, other

    eess.SY cs.RO

    Task Space Control of Robot Manipulators based on Visual SLAM

    Authors: Seyed Hamed Hashemi, Jouni Mattila

    Abstract: This paper aims to address the open problem of designing a globally stable vision-based controller for robot manipulators. Accordingly, based on a hybrid mechanism, this paper proposes a novel task-space control law attained by taking the gradient of a potential function in SE(3). The key idea is to employ the Visual Simultaneous Localization and Map** (VSLAM) algorithm to estimate a robot pose.… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  9. arXiv:2212.06712  [pdf, other

    cs.IT

    Analysis of the Outage Probability of Ground-Based Relaying for Satellite Systems

    Authors: Hadi Hashemi, Beatriz Soret, M. Carmen Aguayo-Torres

    Abstract: This paper investigates the theoretical basis for using ground relaying in multi-antenna satellites exposed to blocking situations. Inactive and unobstructed User Equipments (UEs) located on ground are the relaying nodes of UEs that are not in the field of view of the satellite. Exact closed-form relationships of the Signal-to-Noise Ratio (SNR) and the outage probability are obtained for the case… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

  10. arXiv:2212.06264  [pdf, other

    cs.CE cs.CR cs.DC cs.LG

    Data Leakage via Access Patterns of Sparse Features in Deep Learning-based Recommendation Systems

    Authors: Hanieh Hashemi, Wenjie Xiong, Liu Ke, Kiwan Maeng, Murali Annavaram, G. Edward Suh, Hsien-Hsin S. Lee

    Abstract: Online personalized recommendation services are generally hosted in the cloud where users query the cloud-based model to receive recommended input such as merchandise of interest or news feed. State-of-the-art recommendation models rely on sparse and dense features to represent users' profile information and the items they interact with. Although sparse features account for 99% of the total model… ▽ More

    Submitted 12 December, 2022; originally announced December 2022.

  11. arXiv:2207.00083  [pdf, other

    cs.CR cs.AR cs.LG

    DarKnight: An Accelerated Framework for Privacy and Integrity Preserving Deep Learning Using Trusted Hardware

    Authors: Hanieh Hashemi, Yongqin Wang, Murali Annavaram

    Abstract: Privacy and security-related concerns are growing as machine learning reaches diverse application domains. The data holders want to train or infer with private data while exploiting accelerators, such as GPUs, that are hosted in the cloud. Cloud systems are vulnerable to attackers that compromise the privacy of data and integrity of computations. Tackling such a challenge requires unifying theoret… ▽ More

    Submitted 30 June, 2022; originally announced July 2022.

    Comments: MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture. arXiv admin note: text overlap with arXiv:2105.00334

  12. arXiv:2112.13416  [pdf, other

    cs.CR cs.LG cs.MM

    Attribute Inference Attack of Speech Emotion Recognition in Federated Learning Settings

    Authors: Tiantian Feng, Hanieh Hashemi, Rajat Hebbar, Murali Annavaram, Shrikanth S. Narayanan

    Abstract: Speech emotion recognition (SER) processes speech signals to detect and characterize expressed perceived emotions. Many SER application systems often acquire and transmit speech data collected at the client-side to remote cloud platforms for inference and decision making. However, speech data carry rich information not only about emotions conveyed in vocal expressions, but also other sensitive dem… ▽ More

    Submitted 22 December, 2022; v1 submitted 26 December, 2021; originally announced December 2021.

  13. arXiv:2107.12958  [pdf, other

    cs.DC cs.CR cs.IT cs.LG

    Adaptive Verifiable Coded Computing: Towards Fast, Secure and Private Distributed Machine Learning

    Authors: Tingting Tang, Ramy E. Ali, Hanieh Hashemi, Tynan Gangwani, Salman Avestimehr, Murali Annavaram

    Abstract: Stragglers, Byzantine workers, and data privacy are the main bottlenecks in distributed cloud computing. Some prior works proposed coded computing strategies to jointly address all three challenges. They require either a large number of workers, a significant communication cost or a significant computational complexity to tolerate Byzantine workers. Much of the overhead in prior schemes comes from… ▽ More

    Submitted 22 March, 2022; v1 submitted 27 July, 2021; originally announced July 2021.

  14. arXiv:2106.15085  [pdf, other

    cs.CL

    Automatic Construction of Enterprise Knowledge Base

    Authors: Junyi Chai, Yujie He, Homa Hashemi, Bing Li, Daraksha Parveen, Ranganath Kondapally, Wen** Xu

    Abstract: In this paper, we present an automatic knowledge base construction system from large scale enterprise documents with minimal efforts of human intervention. In the design and deployment of such a knowledge mining system for enterprise, we faced several challenges including data distributional shift, performance evaluation, compliance requirements and other practical issues. We leveraged state-of-th… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

  15. arXiv:2106.09227  [pdf, other

    cs.IR

    Current Challenges and Future Directions in Podcast Information Access

    Authors: Rosie Jones, Hamed Zamani, Markus Schedl, Ching-Wei Chen, Sravana Reddy, Ann Clifton, Jussi Karlgren, Helia Hashemi, Aasish Pappu, Zahra Nazari, Longqi Yang, Oguz Semerci, Hugues Bouchard, Ben Carterette

    Abstract: Podcasts are spoken documents across a wide-range of genres and styles, with growing listenership across the world, and a rapidly lowering barrier to entry for both listeners and creators. The great strides in search and recommendation in research and industry have yet to see impact in the podcast space, where recommendations are still largely driven by word of mouth. In this perspective paper, we… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: SIGIR 2021

  16. arXiv:2105.02295  [pdf, other

    cs.CR cs.AR cs.LG

    Byzantine-Robust and Privacy-Preserving Framework for FedML

    Authors: Hanieh Hashemi, Yongqin Wang, Chuan Guo, Murali Annavaram

    Abstract: Federated learning has emerged as a popular paradigm for collaboratively training a model from data distributed among a set of clients. This learning setting presents, among others, two unique challenges: how to protect privacy of the clients' data during training, and how to ensure integrity of the trained model. We propose a two-pronged solution that aims to address both challenges under a singl… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Journal ref: Security and Safety in Machine Learning Systems Workshop in ICLR 2021

  17. arXiv:2105.00334  [pdf, other

    cs.CR cs.AR cs.LG

    Privacy and Integrity Preserving Training Using Trusted Hardware

    Authors: Hanieh Hashemi, Yongqin Wang, Murali Annavaram

    Abstract: Privacy and security-related concerns are growing as machine learning reaches diverse application domains. The data holders want to train with private data while exploiting accelerators, such as GPUs, that are hosted in the cloud. However, Cloud systems are vulnerable to attackers that compromise the privacy of data and integrity of computations. This work presents DarKnight, a framework for large… ▽ More

    Submitted 1 May, 2021; originally announced May 2021.

    Journal ref: Distributed and Private Machine Learning ICLR 2021 Workshop

  18. arXiv:2103.03221  [pdf, ps, other

    cs.LG q-bio.QM

    GenoML: Automated Machine Learning for Genomics

    Authors: Mary B. Makarious, Hampton L. Leonard, Dan Vitale, Hirotaka Iwaki, David Saffo, Lana Sargent, Anant Dadu, Eduardo Salmerón Castaño, John F. Carter, Melina Maleknia, Juan A. Botia, Cornelis Blauwendraat, Roy H. Campbell, Sayed Hadi Hashemi, Andrew B. Singleton, Mike A. Nalls, Faraz Faghri

    Abstract: GenoML is a Python package automating machine learning workflows for genomics (genetics and multi-omics) with an open science philosophy. Genomics data require significant domain expertise to clean, pre-process, harmonize and perform quality control of the data. Furthermore, tuning, validation, and interpretation involve taking into account the biology and possibly the limitations of the underlyin… ▽ More

    Submitted 4 March, 2021; originally announced March 2021.

  19. arXiv:2010.07541  [pdf, other

    cs.DC

    Secure and Fault Tolerant Decentralized Learning

    Authors: Saurav Prakash, Hanieh Hashemi, Yongqin Wang, Murali Annavaram, Salman Avestimehr

    Abstract: Federated learning (FL) is a promising paradigm for training a global model over data distributed across multiple data owners without centralizing clients' raw data. However, sharing of local model updates can also reveal information of clients' local datasets. Trusted execution environments (TEEs) within the FL server have been recently deployed by companies like Meta for secure aggregation. Howe… ▽ More

    Submitted 13 September, 2022; v1 submitted 15 October, 2020; originally announced October 2020.

  20. arXiv:2006.07548  [pdf, other

    cs.IR cs.CL cs.LG

    Guided Transformer: Leveraging Multiple External Sources for Representation Learning in Conversational Search

    Authors: Helia Hashemi, Hamed Zamani, W. Bruce Croft

    Abstract: Asking clarifying questions in response to ambiguous or faceted queries has been recognized as a useful technique for various information retrieval systems, especially conversational search systems with limited bandwidth interfaces. Analyzing and generating clarifying questions have been studied recently but the accurate utilization of user responses to clarifying questions has been relatively les… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

    Comments: To appear in the Proceedings of ACM SIGIR 2020. 10 pages

  21. arXiv:2006.01300  [pdf, other

    cs.CR

    DarKnight: A Data Privacy Scheme for Training and Inference of Deep Neural Networks

    Authors: Hanieh Hashemi, Yongqin Wang, Murali Annavaram

    Abstract: Protecting the privacy of input data is of growing importance as machine learning methods reach new application domains. In this paper, we provide a unified training and inference framework for large DNNs while protecting input privacy and computation integrity. Our approach called DarKnight uses a novel data blinding strategy using matrix masking to create input obfuscation within a trusted execu… ▽ More

    Submitted 15 October, 2020; v1 submitted 1 June, 2020; originally announced June 2020.

  22. arXiv:2004.14020  [pdf, other

    cs.NI cs.DC cs.LG

    Caramel: Accelerating Decentralized Distributed Deep Learning with Computation Scheduling

    Authors: Sayed Hadi Hashemi, Sangeetha Abdu Jyothi, Brighten Godfrey, Roy Campbell

    Abstract: The method of choice for parameter aggregation in Deep Neural Network (DNN) training, a network-intensive task, is shifting from the Parameter Server model to decentralized aggregation schemes (AllReduce) inspired by theoretical guarantees of better performance. However, current implementations of AllReduce overlook the interdependence of communication and computation, resulting in significant per… ▽ More

    Submitted 29 April, 2020; originally announced April 2020.

  23. arXiv:1905.08957  [pdf, other

    cs.IR cs.CL

    ANTIQUE: A Non-Factoid Question Answering Benchmark

    Authors: Helia Hashemi, Mohammad Aliannejadi, Hamed Zamani, W. Bruce Croft

    Abstract: Considering the widespread use of mobile and voice search, answer passage retrieval for non-factoid questions plays a critical role in modern information retrieval systems. Despite the importance of the task, the community still feels the significant lack of large-scale non-factoid question answering collections with real questions and comprehensive relevance judgments. In this paper, we develop a… ▽ More

    Submitted 19 August, 2019; v1 submitted 22 May, 2019; originally announced May 2019.

  24. arXiv:1905.04264  [pdf, other

    cs.DC

    PartitionedVC: Partitioned External Memory Graph Analytics Framework for SSDs

    Authors: Kiran Kumar Matam, Hanieh Hashemi, Murali Annavaram

    Abstract: Graph analytics are at the heart of a broad range of applications such as drug discovery, page ranking, and recommendation systems. When graph size exceeds memory size, out-of-core graph processing is needed. For the widely used external memory graph processing systems, accessing storage becomes the bottleneck. We make the observation that nearly all graph algorithms have a dynamically varying num… ▽ More

    Submitted 11 February, 2020; v1 submitted 10 May, 2019; originally announced May 2019.

    Comments: 13 pages

  25. arXiv:1904.06578  [pdf

    physics.comp-ph cs.LG

    Deep-learning PDEs with unlabeled data and hardwiring physics laws

    Authors: S. Mohammad H. Hashemi, Demetri Psaltis

    Abstract: Providing fast and accurate solutions to partial differential equations is a problem of continuous interest to the fields of applied mathematics and physics. With the recent advances in machine learning, the adoption learning techniques in this domain is being eagerly pursued. We build upon earlier works on linear and homogeneous PDEs, and develop convolutional deep neural networks that can accura… ▽ More

    Submitted 13 April, 2019; originally announced April 2019.

  26. arXiv:1812.04401  [pdf, other

    cs.CC

    Output-Oblivious Stochastic Chemical Reaction Networks

    Authors: Ben Chugg, Anne Condon, Hooman Hashemi

    Abstract: We classify the functions $f:\mathbb{N}^2 \rightarrow \mathbb{N}$ which are stably computable by output-oblivious Stochastic Chemical Reaction Networks (CRNs), i.e., systems of reactions in which output species are never reactants. While it is known that precisely the semilinear functions are stably computable by CRNs, such CRNs sometimes rely on initially producing too many output species and the… ▽ More

    Submitted 30 August, 2022; v1 submitted 7 December, 2018; originally announced December 2018.

    Comments: Published in OPODIS 2018. Latest version adds appendix containing all proofs

  27. arXiv:1803.03288  [pdf, other

    cs.DC cs.LG cs.PF

    TicTac: Accelerating Distributed Deep Learning with Communication Scheduling

    Authors: Sayed Hadi Hashemi, Sangeetha Abdu Jyothi, Roy H. Campbell

    Abstract: State-of-the-art deep learning systems rely on iterative distributed training to tackle the increasing complexity of models and input data. The iteration time in these communication-heavy systems depends on the computation time, communication time and the extent of overlap of computation and communication. In this work, we identify a shortcoming in systems with graph representation for computati… ▽ More

    Submitted 3 October, 2018; v1 submitted 8 March, 2018; originally announced March 2018.

  28. arXiv:1710.00112  [pdf

    cs.DC cs.LG stat.ML

    Toward Scalable Machine Learning and Data Mining: the Bioinformatics Case

    Authors: Faraz Faghri, Sayed Hadi Hashemi, Mohammad Babaeizadeh, Mike A. Nalls, Saurabh Sinha, Roy H. Campbell

    Abstract: In an effort to overcome the data deluge in computational biology and bioinformatics and to facilitate bioinformatics research in the era of big data, we identify some of the most influential algorithms that have been widely used in the bioinformatics community. These top data mining and machine learning algorithms cover classification, clustering, regression, graphical model-based learning, and d… ▽ More

    Submitted 29 September, 2017; originally announced October 2017.

  29. arXiv:1710.00110  [pdf, other

    cs.CR

    Decentralized User-Centric Access Control using PubSub over Blockchain

    Authors: Sayed Hadi Hashemi, Faraz Faghri, Roy H Campbell

    Abstract: We present a mechanism that puts users in the center of control and empowers them to dictate the access to their collections of data. Revisiting the fundamental mechanisms in security for providing protection, our solution uses capabilities, access lists, and access rights following well-understood formal notions for reasoning about access. This contribution presents a practical, correct, auditabl… ▽ More

    Submitted 29 September, 2017; originally announced October 2017.

  30. arXiv:1612.00521  [pdf, other

    cs.DC

    Performance Modeling of Distributed Deep Neural Networks

    Authors: Sayed Hadi Hashemi, Shadi A. Noghabi, William Gropp, Roy H Campbell

    Abstract: During the past decade, machine learning has become extremely popular and can be found in many aspects of our every day life. Nowayadays with explosion of data while rapid growth of computation capacity, Distributed Deep Neural Networks (DDNNs) which can improve their performance linearly with more computation resources, have become hot and trending. However, there has not been an in depth study o… ▽ More

    Submitted 14 December, 2016; v1 submitted 1 December, 2016; originally announced December 2016.

  31. arXiv:1607.04768  [pdf, other

    math.CO cs.DM

    Hoffmann-Ostenhof's conjecture for traceable cubic graphs

    Authors: F. Abdolhosseini, S. Akbari, H. Hashemi, M. S. Moradian

    Abstract: It was conjectured by Hoffmann-Ostenhof that the edge set of every connected cubic graph can be decomposed into a spanning tree, a matching and a family of cycles. In this paper, we show that this conjecture holds for traceable cubic graphs.

    Submitted 16 July, 2016; originally announced July 2016.

    MSC Class: 05C45; 05C70 (Primary)

  32. arXiv:1409.7637  [pdf

    cs.NI

    Experimental Demonstration of Nanosecond Accuracy Wireless Network Synchronization

    Authors: Marcelo Segura, S. Niranjayan, Hossein Hashemi, Andreas F. Molisch

    Abstract: Accurate wireless timing synchronization has been an extremely important topic in wireless sensor networks, required in applications ranging from distributed beam forming to precision localization and navigation. However, it is very challenging to realize, in particular when the required accuracy should be better than the runtime between the nodes. This work presents, to our knowledge for the firs… ▽ More

    Submitted 26 September, 2014; originally announced September 2014.

    Comments: Submitted to ICC 2015

  33. An improved genetic algorithm with a local optimization strategy and an extra mutation level for solving traveling salesman problem

    Authors: Keivan Borna, Vahid Haji Hashemi

    Abstract: The Traveling salesman problem (TSP) is proved to be NP-complete in most cases. The genetic algorithm (GA) is one of the most useful algorithms for solving this problem. In this paper a conventional GA is compared with an improved hybrid GA in solving TSP. The improved or hybrid GA consist of conventional GA and two local optimization strategies. The first strategy is extracting all sequential gro… ▽ More

    Submitted 10 September, 2014; originally announced September 2014.

    Comments: 7 pages, 1 Figure

    Journal ref: International Journal of Computer Science, Engineering and Information Technology (IJCSEIT), Vol. 4, No.4, August 2014, 47-53