Skip to main content

Showing 1–13 of 13 results for author: Gholami, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.12344  [pdf, other

    cs.CV cs.AI cs.LG

    OCT-SelfNet: A Self-Supervised Framework with Multi-Modal Datasets for Generalized and Robust Retinal Disease Detection

    Authors: Fatema-E Jannat, Sina Gholami, Minhaj Nur Alam, Hamed Tabkhi

    Abstract: Despite the revolutionary impact of AI and the development of locally trained algorithms, achieving widespread generalized learning from multi-modal data in medical AI remains a significant challenge. This gap hinders the practical deployment of scalable medical AI solutions. Addressing this challenge, our research contributes a self-supervised robust machine learning framework, OCT-SelfNet, for d… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 12 pages, 7 figures, 6 tables

  2. arXiv:2310.07830  [pdf, other

    cs.CL cs.AI cs.LG

    Does Synthetic Data Make Large Language Models More Efficient?

    Authors: Sia Gholami, Marwan Omar

    Abstract: Natural Language Processing (NLP) has undergone transformative changes with the advent of deep learning methodologies. One challenge persistently confronting researchers is the scarcity of high-quality, annotated datasets that drive these models. This paper explores the nuances of synthetic data generation in NLP, with a focal point on template-based question generation. By assessing its advantage… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  3. arXiv:2310.04573  [pdf, other

    cs.LG cs.AI cs.CL

    Can pruning make Large Language Models more efficient?

    Authors: Sia Gholami, Marwan Omar

    Abstract: Transformer models have revolutionized natural language processing with their unparalleled ability to grasp complex contextual relationships. However, the vast number of parameters in these models has raised concerns regarding computational efficiency, environmental impact, and deployability on resource-limited platforms. To address these challenges, this paper investigates the application of weig… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

  4. arXiv:2310.02421  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Can a student Large Language Model perform as well as it's teacher?

    Authors: Sia Gholami, Marwan Omar

    Abstract: The burgeoning complexity of contemporary deep learning models, while achieving unparalleled accuracy, has inadvertently introduced deployment challenges in resource-constrained environments. Knowledge distillation, a technique aiming to transfer knowledge from a high-capacity "teacher" model to a streamlined "student" model, emerges as a promising solution to this dilemma. This paper provides a c… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  5. arXiv:2309.06589  [pdf, ps, other

    cs.CL cs.AI

    Do Generative Large Language Models need billions of parameters?

    Authors: Sia Gholami, Marwan Omar

    Abstract: This paper presents novel systems and methodologies for the development of efficient large language models (LLMs). It explores the trade-offs between model size, performance, and computational resources, with the aim of maximizing the efficiency of these AI systems. The research explores novel methods that allow different parts of the model to share parameters, reducing the total number of unique… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  6. arXiv:2212.02450  [pdf, other

    cs.CV cs.AI

    Framework for 2D Ad placements in LinearTV

    Authors: Divya Bhargavi, Karan Sindwani, Sia Gholami

    Abstract: Virtual Product placement(VPP) is the advertising technique of digitally placing a branded object into the scene of a movie or TV show. This type of advertising provides the ability for brands to reach consumers without interrupting the viewing experience with a commercial break, as the products are seen in the background or as props. Despite this being a billion-dollar industry, ad rendering tech… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

  7. arXiv:2208.09921  [pdf

    cs.LG

    Alexa, Predict My Flight Delay

    Authors: Sia Gholami, Saba Khashe

    Abstract: Airlines are critical today for carrying people and commodities on time. Any delay in the schedule of these planes can potentially disrupt the business and trade of thousands of employees at any given time. Therefore, precise flight delay prediction is beneficial for the aviation industry and passenger travel. Recent research has focused on using artificial intelligence algorithms to predict the p… ▽ More

    Submitted 21 August, 2022; originally announced August 2022.

  8. arXiv:2203.00734  [pdf, other

    cs.CV cs.AI

    Knock, knock. Who's there? -- Identifying football player jersey numbers with synthetic data

    Authors: Divya Bhargavi, Erika Pelaez Coyotl, Sia Gholami

    Abstract: Automatic player identification is an essential and complex task in sports video analysis. Different strategies have been devised over the years, but identification based on jersey numbers is one of the most common approaches given its versatility and relative simplicity. However, automatic detection of jersey numbers is still challenging due to changing camera angles, low video resolution, small… ▽ More

    Submitted 4 April, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

  9. arXiv:2111.11520  [pdf, other

    cs.CL cs.IR cs.LG

    Zero-Shot Open-Book Question Answering

    Authors: Sia Gholami, Mehdi Noori

    Abstract: Open book question answering is a subset of question answering tasks where the system aims to find answers in a given set of documents (open-book) and common knowledge about a topic. This article proposes a solution for answering natural language questions from a corpus of Amazon Web Services (AWS) technical documents with no domain-specific labeled data (zero-shot). These questions can have yes-n… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

  10. arXiv:2105.09809  [pdf, other

    cs.HC cs.RO

    Quantitative Physical Ergonomics Assessment of Teleoperation Interfaces

    Authors: Soheil Gholami, Marta Lorenzini, Elena De Momi, Arash Ajoudani

    Abstract: Human factors and ergonomics are the essential constituents of teleoperation interfaces, which can significantly affect the human operator's performance. Thus, a quantitative evaluation of these elements and the ability to establish reliable comparison bases for different teleoperation interfaces are the keys to select the most suitable one for a particular application. However, most of the works… ▽ More

    Submitted 20 May, 2021; originally announced May 2021.

    Comments: 10 pages, 9 figures, submitted to IEEE Transactions on Human-Machine Systems

  11. arXiv:2104.11757  [pdf, ps, other

    cs.CY

    Becoming Good at AI for Good

    Authors: Meghana Kshirsagar, Caleb Robinson, Siyu Yang, Shahrzad Gholami, Ivan Klyuzhin, Sumit Mukherjee, Md Nasir, Anthony Ortiz, Felipe Oviedo, Darren Tanner, Anusua Trivedi, Yixi Xu, Ming Zhong, Bistra Dilkina, Rahul Dodhia, Juan M. Lavista Ferres

    Abstract: AI for good (AI4G) projects involve develo** and applying artificial intelligence (AI) based solutions to further goals in areas such as sustainability, health, humanitarian aid, and social justice. Develo** and deploying such solutions must be done in collaboration with partners who are experts in the domain in question and who already have experience in making progress towards such goals. Ba… ▽ More

    Submitted 3 May, 2021; v1 submitted 23 April, 2021; originally announced April 2021.

    Comments: Accepted to AIES-2021

  12. arXiv:1903.06669  [pdf, other

    stat.AP cs.AI cs.LG

    Stay Ahead of Poachers: Illegal Wildlife Poaching Prediction and Patrol Planning Under Uncertainty with Field Test Evaluations

    Authors: Lily Xu, Shahrzad Gholami, Sara Mc Carthy, Bistra Dilkina, Andrew Plumptre, Milind Tambe, Rohit Singh, Mustapha Nsubuga, Joshua Mabonga, Margaret Driciru, Fred Wanyama, Aggrey Rwetsiba, Tom Okello, Eric Enyel

    Abstract: Illegal wildlife poaching threatens ecosystems and drives endangered species toward extinction. However, efforts for wildlife protection are constrained by the limited resources of law enforcement agencies. To help combat poaching, the Protection Assistant for Wildlife Security (PAWS) is a machine learning pipeline that has been developed as a data-driven approach to identify areas at high risk of… ▽ More

    Submitted 5 November, 2019; v1 submitted 8 March, 2019; originally announced March 2019.

    Comments: 12 pages, 11 figures. Short paper published in ICDE 2020

  13. arXiv:1312.6157  [pdf, other

    cs.LG cs.NE

    Distinction between features extracted using deep belief networks

    Authors: Mohammad Pezeshki, Sajjad Gholami, Ahmad Nickabadi

    Abstract: Data representation is an important pre-processing step in many machine learning algorithms. There are a number of methods used for this task such as Deep Belief Networks (DBNs) and Discrete Fourier Transforms (DFTs). Since some of the features extracted using automated feature extraction methods may not always be related to a specific machine learning task, in this paper we propose two methods in… ▽ More

    Submitted 2 January, 2014; v1 submitted 20 December, 2013; originally announced December 2013.

    Comments: 4 pages, 4 figures, ICLR 2014 workshop track