Skip to main content

Showing 1–29 of 29 results for author: Khademi, M

.
  1. arXiv:2404.14219  [pdf, other

    cs.CL cs.AI

    Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

    Authors: Marah Abdin, Sam Ade Jacobs, Ammar Ahmad Awan, Jyoti Aneja, Ahmed Awadallah, Hany Awadalla, Nguyen Bach, Amit Bahree, Arash Bakhtiari, Jianmin Bao, Harkirat Behl, Alon Benhaim, Misha Bilenko, Johan Bjorck, Sébastien Bubeck, Qin Cai, Martin Cai, Caio César Teodoro Mendes, Weizhu Chen, Vishrav Chaudhary, Dong Chen, Dongdong Chen, Yen-Chun Chen, Yi-Ling Chen, Parul Chopra , et al. (90 additional authors not shown)

    Abstract: We introduce phi-3-mini, a 3.8 billion parameter language model trained on 3.3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3.5 (e.g., phi-3-mini achieves 69% on MMLU and 8.38 on MT-bench), despite being small enough to be deployed on a phone. The innovation lies entirely in our dataset… ▽ More

    Submitted 23 May, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: 19 pages

  2. arXiv:2403.08002  [pdf, other

    cs.CL cs.CV

    Towards a clinically accessible radiology foundation model: open-access and lightweight, with automated evaluation

    Authors: Juan Manuel Zambrano Chaves, Shih-Cheng Huang, Yanbo Xu, Hanwen Xu, Naoto Usuyama, Sheng Zhang, Fei Wang, Yujia Xie, Mahmoud Khademi, Ziyi Yang, Hany Awadalla, Julia Gong, Houdong Hu, Jianwei Yang, Chunyuan Li, Jianfeng Gao, Yu Gu, Cliff Wong, Mu Wei, Tristan Naumann, Muhao Chen, Matthew P. Lungren, Akshay Chaudhari, Serena Yeung-Levy, Curtis P. Langlotz , et al. (2 additional authors not shown)

    Abstract: The scaling laws and extraordinary performance of large foundation models motivate the development and utilization of such models in biomedicine. However, despite early promising results on some biomedical benchmarks, there are still major challenges that need to be addressed before these models can be used in real-world clinics. Frontier general-domain models such as GPT-4V still have significant… ▽ More

    Submitted 26 June, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  3. Large Non-Volatile Frequency Tuning of Spin Hall Nano-Oscillators using Circular Memristive Nano-Gates

    Authors: Maha Khademi, Akash Kumar, Mona Rajabali, Saroj P. Dash, Johan Åkerman

    Abstract: Spin Hall nano oscillators (SHNOs) are promising candidates for neuromorphic computing due to their miniaturized dimensions, non-linearity, fast dynamics, and ability to synchronize in long chains and arrays. However, tuning the individual SHNOs in large chains/arrays, which is key to implementing synaptic control, has remained a challenge. Here, we demonstrate circular memristive nano-gates, both… ▽ More

    Submitted 18 January, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: Marie Sklodowska-Curie Actions, H2020-MSCA-ITN-2020; Project Acronym SPEAR; Grant Agreement No. 955671

  4. arXiv:2311.18775  [pdf, other

    cs.CV cs.AI cs.CL cs.LG cs.SD eess.AS

    CoDi-2: In-Context, Interleaved, and Interactive Any-to-Any Generation

    Authors: Zineng Tang, Ziyi Yang, Mahmoud Khademi, Yang Liu, Chenguang Zhu, Mohit Bansal

    Abstract: We present CoDi-2, a versatile and interactive Multimodal Large Language Model (MLLM) that can follow complex multimodal interleaved instructions, conduct in-context learning (ICL), reason, chat, edit, etc., in an any-to-any input-output modality paradigm. By aligning modalities with language for both encoding and generation, CoDi-2 empowers Large Language Models (LLMs) to not only understand comp… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: Project Page: https://codi-2.github.io/

  5. arXiv:2308.01078  [pdf, other

    gr-qc hep-th

    Black hole solutions to Einstein-Bel-Robinson gravity

    Authors: S. N. Sajadi, Robert B. Mann, H. Sheikhahmadi, M. Khademi

    Abstract: In this paper, we study the physical properties of black holes in the framework of the recently proposed Einstien-Bel-Robinson gravity. We show that interestingly the theory propagates a transverse and massive graviton on a maximally symmetric background with positive energy. There is also a single ghost-free branch that returns to the Einstein case when β\to 0. We find new black hole solutions to… ▽ More

    Submitted 28 January, 2024; v1 submitted 2 August, 2023; originally announced August 2023.

    Comments: Major changes, with significant corrections and new solutions presented; figure added

  6. arXiv:2305.13738  [pdf, other

    cs.CL cs.AI cs.CV

    i-Code Studio: A Configurable and Composable Framework for Integrative AI

    Authors: Yuwei Fang, Mahmoud Khademi, Chenguang Zhu, Ziyi Yang, Reid Pryzant, Yichong Xu, Yao Qian, Takuya Yoshioka, Lu Yuan, Michael Zeng, Xuedong Huang

    Abstract: Artificial General Intelligence (AGI) requires comprehensive understanding and generation capabilities for a variety of tasks spanning different modalities and functionalities. Integrative AI is one important direction to approach AGI, through combining multiple models to tackle complex multimodal tasks. However, there is a lack of a flexible and composable platform to facilitate efficient and eff… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  7. arXiv:2305.12311  [pdf, other

    cs.CL cs.AI cs.CV cs.LG eess.AS

    i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data

    Authors: Ziyi Yang, Mahmoud Khademi, Yichong Xu, Reid Pryzant, Yuwei Fang, Chenguang Zhu, Dongdong Chen, Yao Qian, Mei Gao, Yi-Ling Chen, Robert Gmyr, Naoyuki Kanda, Noel Codella, Bin Xiao, Yu Shi, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang

    Abstract: The convergence of text, visual, and audio data is a key step towards human-like artificial intelligence, however the current Vision-Language-Speech landscape is dominated by encoder-only models which lack generative abilities. We propose closing this gap with i-Code V2, the first model capable of generating natural language from any combination of Vision, Language, and Speech data. i-Code V2 is a… ▽ More

    Submitted 20 May, 2023; originally announced May 2023.

  8. arXiv:2210.05063  [pdf, other

    cs.CV

    Improving Dense Contrastive Learning with Dense Negative Pairs

    Authors: Berk Iskender, Zhenlin Xu, Simon Kornblith, En-Hung Chu, Maryam Khademi

    Abstract: Many contrastive representation learning methods learn a single global representation of an entire image. However, dense contrastive representation learning methods such as DenseCL (Wang et al., 2021) can learn better representations for tasks requiring stronger spatial localization of features, such as multi-label classification, detection, and segmentation. In this work, we study how to improve… ▽ More

    Submitted 10 January, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

  9. arXiv:2207.04186  [pdf, other

    cs.CV

    A Study on Self-Supervised Object Detection Pretraining

    Authors: Trung Dang, Simon Kornblith, Huy Thong Nguyen, Peter Chin, Maryam Khademi

    Abstract: In this work, we study different approaches to self-supervised pretraining of object detection models. We first design a general framework to learn a spatially consistent dense representation from an image, by randomly sampling and projecting boxes to each augmented view and maximizing the similarity between corresponding box features. We study existing design choices in the literature, such as bo… ▽ More

    Submitted 10 August, 2022; v1 submitted 8 July, 2022; originally announced July 2022.

  10. Influence of Magnetic Fields on the Gas Rotation in the Galaxy $NGC\;6946$

    Authors: M. Khademi, S. Nasiri, F. S. Tabatabaei

    Abstract: Magnetic fields can play an important role in the energy balance and formation of gas structures in galaxies. However, their dynamical effect on the rotation curve of galaxies is immensely unexplored. We investigate the dynamical effect of the known magnetic arms of $NGC\;6946$ on its circular gas rotation traced in HI, considering two dark matter mass density models, ISO, and the universal NFW pr… ▽ More

    Submitted 7 February, 2023; v1 submitted 18 May, 2022; originally announced May 2022.

    Comments: 19 pages, 10 captioned figures, Accepted for publication in The Astrophysical Journal (ApJ)

  11. Kinematical asymmetry in the dwarf irregular galaxy WLM and a perturbed halo potential

    Authors: M. Khademi, Y. Yang, F. Hammer, S. Nasiri

    Abstract: WLM is a dwarf irregular that is seen almost edge-on that has prompted a number of kinematical studies investigating its rotation curve and its dark matter content. In this paper, we investigate the origin of the strong asymmetry of the rotation curve, which shows a significant discrepancy between the approaching and the receding side. We first examine whether an $m = 1$ perturbation (lopsidedness… ▽ More

    Submitted 22 August, 2021; v1 submitted 6 July, 2021; originally announced July 2021.

    Comments: 12 pages, 6 figures, Accepted for publication in Astronomy and Astrophysics (A & A), Preprint version available online

    Journal ref: A&A 654, A7 (2021)

  12. arXiv:2011.11765  [pdf, other

    cs.CV cs.LG

    Boosting Contrastive Self-Supervised Learning with False Negative Cancellation

    Authors: Tri Huynh, Simon Kornblith, Matthew R. Walter, Michael Maire, Maryam Khademi

    Abstract: Self-supervised representation learning has made significant leaps fueled by progress in contrastive learning, which seeks to learn transformations that embed positive input pairs nearby, while pushing negative pairs far apart. While positive pairs can be generated reliably (e.g., as different views of the same image), it is difficult to accurately establish negative pairs, defined as samples from… ▽ More

    Submitted 2 January, 2022; v1 submitted 23 November, 2020; originally announced November 2020.

    Comments: Code is available at https://github.com/google-research/fnc

  13. Physical Properties of a Regular Rotating Black Hole: Thermodynamics, Stability, Quasinormal Modes

    Authors: S. H. Hendi, S. N. Sajadi, Maryam. Khademi

    Abstract: Respecting the angular momentum conservation of torque-free systems, it is natural to consider rotating solutions of massive objects. Besides that, motivated by the realistic astrophysical black holes that rotate, we use the Newman-Janis formalism to construct a regular rotating black hole. We start with a nonlinearly charged regular static black hole in the framework of the standard general relat… ▽ More

    Submitted 20 June, 2020; originally announced June 2020.

    Comments: 19 pages, 14 figures

    Journal ref: Phys. Rev. D 103, 064016 (2021)

  14. arXiv:1903.06827  [pdf, other

    cs.SI physics.soc-ph

    Does Homophily Make Socialbots More Influential? Exploring Infiltration Strategies

    Authors: Samaneh Hosseini Moghaddam, Mandana Khademi, Maghsoud Abbaspour

    Abstract: Socialbots are intelligent software controlling all the behavior of fake accounts in an online social network. They use artificial intelligence techniques to pass themselves off as human social media users. Socialbots exploit user trust to achieve their malicious goals, such as astroturfing, performing Sybil attacks, spamming, and harvesting private data. The first phase to countermeasure the mali… ▽ More

    Submitted 15 March, 2019; originally announced March 2019.

  15. arXiv:1711.00740  [pdf, other

    cs.LG cs.AI cs.PL cs.SE

    Learning to Represent Programs with Graphs

    Authors: Miltiadis Allamanis, Marc Brockschmidt, Mahmoud Khademi

    Abstract: Learning tasks on source code (i.e., formal languages) have been considered recently, but most work has tried to transfer natural language methods and does not capitalize on the unique opportunities offered by code's known syntax. For example, long-range dependencies induced by using the same variable or function in distant locations are often not considered. We propose to use graphs to represent… ▽ More

    Submitted 4 May, 2018; v1 submitted 1 November, 2017; originally announced November 2017.

    Comments: Published in ICLR 2018. arXiv admin note: text overlap with arXiv:1705.07867

  16. arXiv:1710.10994  [pdf

    cs.CL cs.IR

    Conceptual Text Summarizer: A new model in continuous vector space

    Authors: Mohammad Ebrahim Khademi, Mohammad Fakhredanesh, Seyed Mojtaba Hoseini

    Abstract: Traditional methods of summarization are not cost-effective and possible today. Extractive summarization is a process that helps to extract the most important sentences from a text automatically and generates a short informative summary. In this work, we propose an unsupervised method to summarize Persian texts. This method is a novel hybrid approach that clusters the concepts of the text using de… ▽ More

    Submitted 1 September, 2018; v1 submitted 30 October, 2017; originally announced October 2017.

    Comments: The experimental results completed

  17. arXiv:1506.06408  [pdf, ps, other

    math.OC

    A dynamic Stackelberg game for green supply chain management

    Authors: Mehrnoosh Khademi, Massimiliano Ferrara, Mehdi Salimi, Somayeh Sharifi

    Abstract: In this paper, we establish a dynamic game to allocate CSR (Corporate Social Responsibility) to the members of a supply chain. We propose a model of a three-tier supply chain in a decentralized state which includes a supplier, a manufacturer and a retailer. For analyzing supply chain performance in decentralized state and the relationships between the members of the supply chain, we use a Stackelb… ▽ More

    Submitted 21 June, 2015; originally announced June 2015.

    Comments: arXiv admin note: text overlap with arXiv:1503.04772

  18. arXiv:1503.04772  [pdf, ps, other

    math.OC econ.GN

    A dynamic game on Green Supply Chain Management

    Authors: Mehrnoosh Khademi, Massimiliano Ferrara, Bruno Pansera, Mehdi Salimi

    Abstract: In this paper, we establish a dynamic game to allocate CSR (Corporate Social Responsibility) to the members of a supply chain. We propose a model of three-tier supply chain in decentralized state that is including supplier, manufacturer and retailer. For analyzing supply chain performance in decentralized state and the relationships between the members of supply chain, we use Stackelberg game and… ▽ More

    Submitted 12 March, 2015; originally announced March 2015.

  19. arXiv:1412.0065  [pdf, other

    cs.CV

    3D Hand Pose Detection in Egocentric RGB-D Images

    Authors: Gregory Rogez, James S. Supancic III, Maryam Khademi, Jose Maria Martinez Montiel, Deva Ramanan

    Abstract: We focus on the task of everyday hand pose estimation from egocentric viewpoints. For this task, we show that depth sensors are particularly informative for extracting near-field interactions of the camera wearer with his/her environment. Despite the recent advances in full-body pose estimation using Kinect-like sensors, reliable monocular hand pose estimation in RGB-D images is still an unsolved… ▽ More

    Submitted 28 November, 2014; originally announced December 2014.

    Comments: 14 pages, 15 figures, extended version of the corresponding ECCV workshop paper, submitted to International Journal of Computer Vision

  20. arXiv:1405.0085  [pdf

    cs.CV

    Relative Facial Action Unit Detection

    Authors: Mahmoud Khademi, Louis-Philippe Morency

    Abstract: This paper presents a subject-independent facial action unit (AU) detection method by introducing the concept of relative AU detection, for scenarios where the neutral face is not provided. We propose a new classification objective function which analyzes the temporal neighborhood of the current frame to decide if the expression recently increased, decreased or showed no change. This approach is a… ▽ More

    Submitted 30 April, 2014; originally announced May 2014.

    Comments: Accepted at IEEE Winter Conference on Applications of Computer Vision, Steamboat Springs Colorado, USA, 2014

  21. arXiv:1401.0864  [pdf

    cs.IR

    Predicting a Business Star in Yelp from Its Reviews Text Alone

    Authors: Mingming Fan, Maryam Khademi

    Abstract: Yelp online reviews are invaluable source of information for users to choose where to visit or what to eat among numerous available options. But due to overwhelming number of reviews, it is almost impossible for users to go through all reviews and find the information they are looking for. To provide a business overview, one solution is to give the business a 1-5 star(s). This rating can be subjec… ▽ More

    Submitted 4 January, 2014; originally announced January 2014.

    Comments: 5 pages, 6 figures, 2 tables

  22. arXiv:1011.2512  [pdf

    cs.AI cs.LG

    Extended Active Learning Method

    Authors: Ali Akbar Kiaei, Saeed Bagheri Shouraki, Seyed Hossein Khasteh, Mahmoud Khademi, Alireza Ghatreh Samani

    Abstract: Active Learning Method (ALM) is a soft computing method which is used for modeling and control, based on fuzzy logic. Although ALM has shown that it acts well in dynamic environments, its operators cannot support it very well in complex situations due to losing data. Thus ALM can find better membership functions if more appropriate operators be chosen for it. This paper substituted two new operato… ▽ More

    Submitted 17 January, 2011; v1 submitted 10 November, 2010; originally announced November 2010.

    Comments: 18 pages, 26 figures, 2 tables, submitted to the control engineering practice of Elsevier

  23. arXiv:1010.4951   

    cs.CV cs.LG

    Local Component Analysis for Nonparametric Bayes Classifier

    Authors: Mahmoud Khademi, Mohammad T. Manzuri-Shalmani, Meharn safayani

    Abstract: The decision boundaries of Bayes classifier are optimal because they lead to maximum probability of correct decision. It means if we knew the prior probabilities and the class-conditional densities, we could design a classifier which gives the lowest probability of error. However, in classification based on nonparametric density estimation methods such as Parzen windows, the decision regions depen… ▽ More

    Submitted 19 July, 2012; v1 submitted 24 October, 2010; originally announced October 2010.

    Comments: This paper has been withdrawn by the author due to an error in experimental results

  24. arXiv:1010.4561  [pdf

    cs.AI

    New S-norm and T-norm Operators for Active Learning Method

    Authors: Ali Akbar Kiaei, Saeed Bagheri Shouraki, Seyed Hossein Khasteh, Mahmoud Khademi, Ali Reza Ghatreh Samani

    Abstract: Active Learning Method (ALM) is a soft computing method used for modeling and control based on fuzzy logic. All operators defined for fuzzy sets must serve as either fuzzy S-norm or fuzzy T-norm. Despite being a powerful modeling method, ALM does not possess operators which serve as S-norms and T-norms which deprive it of a profound analytical expression/form. This paper introduces two new operato… ▽ More

    Submitted 6 February, 2011; v1 submitted 21 October, 2010; originally announced October 2010.

    Comments: 11 pages, 20 figures, under review of SPRINGER (Fuzzy Optimization and Decision Making)

    ACM Class: I.5.1; F.4.1; H.2.1

  25. arXiv:1004.0755  [pdf

    cs.CV cs.LG

    Extended Two-Dimensional PCA for Efficient Face Representation and Recognition

    Authors: Mehran Safayani, Mohammad T. Manzuri-Shalmani, Mahmoud Khademi

    Abstract: In this paper a novel method called Extended Two-Dimensional PCA (E2DPCA) is proposed which is an extension to the original 2DPCA. We state that the covariance matrix of 2DPCA is equivalent to the average of the main diagonal of the covariance matrix of PCA. This implies that 2DPCA eliminates some covariance information that can be useful for recognition. E2DPCA instead of just using the main diag… ▽ More

    Submitted 5 April, 2010; originally announced April 2010.

    Comments: Proc. of 4th International Conference on Intelligent Computer Communication and Processing (ICCP), Cluj-Napoca, Romania, pp. 295--298, 2008.

  26. arXiv:1004.0517  [pdf

    cs.CV cs.LG

    Multilinear Biased Discriminant Analysis: A Novel Method for Facial Action Unit Representation

    Authors: Mahmoud Khademi, Mehran Safayani, Mohammad T. Manzuri-Shalmani

    Abstract: In this paper a novel efficient method for representation of facial action units by encoding an image sequence as a fourth-order tensor is presented. The multilinear tensor-based extension of the biased discriminant analysis (BDA) algorithm, called multilinear biased discriminant analysis (MBDA), is first proposed. Then, we apply the MBDA and two-dimensional BDA (2DBDA) algorithms, as the dimensio… ▽ More

    Submitted 4 April, 2010; originally announced April 2010.

    Comments: Proc. of 16th Korea-Japan Joint Workshop on Frontiers of Computer Vision, Hiroshima, Japan, 2010.

  27. arXiv:1004.0515  [pdf

    cs.CV cs.LG

    Recognizing Combinations of Facial Action Units with Different Intensity Using a Mixture of Hidden Markov Models and Neural Network

    Authors: Mahmoud Khademi, Mohammad T. Manzuri-Shalmani, Mohammad H. Kiapour, Ali A. Kiaei

    Abstract: Facial Action Coding System consists of 44 action units (AUs) and more than 7000 combinations. Hidden Markov models (HMMs) classifier has been used successfully to recognize facial action units (AUs) and expressions due to its ability to deal with AU dynamics. However, a separate HMM is necessary for each single AU and each AU combination. Since combinations of AU numbering in thousands, a more ef… ▽ More

    Submitted 4 April, 2010; originally announced April 2010.

    Journal ref: LNCS vol. 5997, pp. 304--313, Springer, Heidelberg (Proc. of 9th IAPR Workshop on Multiple Classifier Systems), 2010.

  28. arXiv:1004.0512  [pdf

    cs.CV

    Analysis, Interpretation, and Recognition of Facial Action Units and Expressions Using Neuro-Fuzzy Modeling

    Authors: Mahmoud Khademi, Mohammad Hadi Kiapour, Mohammad T. Manzuri-Shalmani, Ali A. Kiaei

    Abstract: In this paper an accurate real-time sequence-based system for representation, recognition, interpretation, and analysis of the facial action units (AUs) and expressions is presented. Our system has the following characteristics: 1) employing adaptive-network-based fuzzy inference systems (ANFIS) and temporal information, we developed a classification scheme based on neuro-fuzzy modeling of the AU… ▽ More

    Submitted 4 April, 2010; originally announced April 2010.

    Journal ref: LNAI vol. 5998, pp. 161--172, Springer, Heidelberg (Proc. of 4th IAPR Workshop on Artificial Neural Networks in Pattern Recognition), 2010.

  29. arXiv:1004.0378   

    cs.CV cs.LG

    Facial Expression Representation and Recognition Using 2DHLDA, Gabor Wavelets, and Ensemble Learning

    Authors: Mahmoud Khademi, Mohammad H. Kiapour, Mehran Safayani, Mohammad T. Manzuri, M. Shojaei

    Abstract: In this paper, a novel method for representation and recognition of the facial expressions in two-dimensional image sequences is presented. We apply a variation of two-dimensional heteroscedastic linear discriminant analysis (2DHLDA) algorithm, as an efficient dimensionality reduction technique, to Gabor representation of the input sequence. 2DHLDA is an extension of the two-dimensional linear dis… ▽ More

    Submitted 19 July, 2012; v1 submitted 2 April, 2010; originally announced April 2010.

    Comments: This paper has been withdrawn by the author due to an error in experimental results

    ACM Class: I.5