-
MM-VID: Advancing Video Understanding with GPT-4V(ision)
Authors:
Kevin Lin,
Faisal Ahmed,
Linjie Li,
Chung-Ching Lin,
Ehsan Azarnasab,
Zhengyuan Yang,
Jianfeng Wang,
Lin Liang,
Zicheng Liu,
Yumao Lu,
Ce Liu,
Lijuan Wang
Abstract:
We present MM-VID, an integrated system that harnesses the capabilities of GPT-4V, combined with specialized tools in vision, audio, and speech, to facilitate advanced video understanding. MM-VID is designed to address the challenges posed by long-form videos and intricate tasks such as reasoning within hour-long content and gras** storylines spanning multiple episodes. MM-VID uses a video-to-sc…
▽ More
We present MM-VID, an integrated system that harnesses the capabilities of GPT-4V, combined with specialized tools in vision, audio, and speech, to facilitate advanced video understanding. MM-VID is designed to address the challenges posed by long-form videos and intricate tasks such as reasoning within hour-long content and gras** storylines spanning multiple episodes. MM-VID uses a video-to-script generation with GPT-4V to transcribe multimodal elements into a long textual script. The generated script details character movements, actions, expressions, and dialogues, paving the way for large language models (LLMs) to achieve video understanding. This enables advanced capabilities, including audio description, character identification, and multimodal high-level comprehension. Experimental results demonstrate the effectiveness of MM-VID in handling distinct video genres with various video lengths. Additionally, we showcase its potential when applied to interactive environments, such as video games and graphic user interfaces.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.
-
Domain-specific optimization and diverse evaluation of self-supervised models for histopathology
Authors:
Jeremy Lai,
Faruk Ahmed,
Supriya Vijay,
Tiam Jaroensri,
Jessica Loo,
Saurabh Vyawahare,
Saloni Agarwal,
Fayaz Jamil,
Yossi Matias,
Greg S. Corrado,
Dale R. Webster,
Jonathan Krause,
Yun Liu,
Po-Hsuan Cameron Chen,
Ellery Wulczyn,
David F. Steiner
Abstract:
Task-specific deep learning models in histopathology offer promising opportunities for improving diagnosis, clinical research, and precision medicine. However, development of such models is often limited by availability of high-quality data. Foundation models in histopathology that learn general representations across a wide range of tissue types, diagnoses, and magnifications offer the potential…
▽ More
Task-specific deep learning models in histopathology offer promising opportunities for improving diagnosis, clinical research, and precision medicine. However, development of such models is often limited by availability of high-quality data. Foundation models in histopathology that learn general representations across a wide range of tissue types, diagnoses, and magnifications offer the potential to reduce the data, compute, and technical expertise necessary to develop task-specific deep learning models with the required level of model performance. In this work, we describe the development and evaluation of foundation models for histopathology via self-supervised learning (SSL). We first establish a diverse set of benchmark tasks involving 17 unique tissue types and 12 unique cancer types and spanning different optimal magnifications and task types. Next, we use this benchmark to explore and evaluate histopathology-specific SSL methods followed by further evaluation on held out patch-level and weakly supervised tasks. We found that standard SSL methods thoughtfully applied to histopathology images are performant across our benchmark tasks and that domain-specific methodological improvements can further increase performance. Our findings reinforce the value of using domain-specific SSL methods in pathology, and establish a set of high quality foundation models to enable further research across diverse applications.
△ Less
Submitted 19 October, 2023;
originally announced October 2023.
-
Entropy Based Multi-robot Active SLAM
Authors:
Muhammad Farhan Ahmed,
Matteo Maragliano,
Vincent Frémont,
Carmine Tommaso Recchiuto
Abstract:
In this article, we present an efficient multi-robot active SLAM framework that involves a frontier-sharing method for maximum exploration of an unknown environment. It encourages the robots to spread into the environment while weighting the goal frontiers with the pose graph SLAM uncertainly and path entropy. Our approach works on a limited number of frontier points and weights the goal frontiers…
▽ More
In this article, we present an efficient multi-robot active SLAM framework that involves a frontier-sharing method for maximum exploration of an unknown environment. It encourages the robots to spread into the environment while weighting the goal frontiers with the pose graph SLAM uncertainly and path entropy. Our approach works on a limited number of frontier points and weights the goal frontiers with a utility function that encapsulates both the SLAM and map uncertainties, thus providing an efficient and not computationally expensive solution. Our approach has been tested on publicly available simulation environments and on real robots. An accumulative 31% more coverage than similar state-of-the-art approaches has been obtained, proving the capability of our approach for efficient environment exploration.
△ Less
Submitted 9 October, 2023;
originally announced October 2023.
-
Impact of global monopole on heavy mesons in hot-dense medium
Authors:
M. Abu-Shady,
Faizuddin Ahmed
Abstract:
This research study is primarily focus on investigating how the topological effects influence the eigenvalue solutions in the presence of a hot-dense medium. To accomplish this, we employ the non-relativistic Schrödinger wave equation, taking into consideration both the quantum flux field and an interaction potential. Through this approach, we determine the energy eigenvalues and their correspondi…
▽ More
This research study is primarily focus on investigating how the topological effects influence the eigenvalue solutions in the presence of a hot-dense medium. To accomplish this, we employ the non-relativistic Schrödinger wave equation, taking into consideration both the quantum flux field and an interaction potential. Through this approach, we determine the energy eigenvalues and their corresponding wave functions using the Nikiforov-Uvarov method. Our findings indicate that when we consider both the topological effects and the magnetic flux, $Φ$, there is a noticeable reduction in the binding energy within the hot-dense medium. Additionally, we analyze the role of the baryonic potential in sha** the binding energy within the $(T, u_b)$ plane. Interestingly, it is evident that the influence of the baryonic potential becomes more pronounced as its values decrease
△ Less
Submitted 17 May, 2024; v1 submitted 29 September, 2023;
originally announced October 2023.
-
Efficient Frontier Management for Collaborative Active SLAM
Authors:
Muhammad Farhan Ahmed,
Matteo Maragliano,
Vincent FremontCarmine,
Tommaso Recchiuto,
Antonio Sgorbissa
Abstract:
In autonomous robotics, a critical challenge lies in develo** robust solutions for Active Collaborative SLAM, wherein multiple robots collaboratively explore and map an unknown environment while intelligently coordinating their movements and sensor data acquisitions. In this article, we present an efficient centralized frontier sharing approach that maximizes exploration by taking into account i…
▽ More
In autonomous robotics, a critical challenge lies in develo** robust solutions for Active Collaborative SLAM, wherein multiple robots collaboratively explore and map an unknown environment while intelligently coordinating their movements and sensor data acquisitions. In this article, we present an efficient centralized frontier sharing approach that maximizes exploration by taking into account information gain in the merged map, distance, and reward computation among frontier candidates and encourages the spread of agents into the environment. Eventually, our method efficiently spreads the robots for maximum exploration while kee** SLAM uncertainty low. Additionally, we also present two coordination approaches, synchronous and asynchronous to prioritize robot goal assignments by the central server. The proposed method is implemented in ROS and evaluated through simulation and experiments on publicly available datasets and similar methods, rendering promising results.
△ Less
Submitted 15 May, 2024; v1 submitted 3 October, 2023;
originally announced October 2023.
-
Active SLAM Utility Function Exploiting Path Entropy
Authors:
Muhammad Farhan Ahmed,
Vincent Fremont,
Isabelle Fantoni
Abstract:
In this article we present a utility function for Active SLAM (A-SLAM) which utilizes map entropy along with D-Optimality criterion metrices for weighting goal frontier candidates. We propose a utility function for frontier goal selection that exploits the occupancy grid map by utilizing the path entropy and favors unknown map locations for maximum area coverage while maintaining a low localizatio…
▽ More
In this article we present a utility function for Active SLAM (A-SLAM) which utilizes map entropy along with D-Optimality criterion metrices for weighting goal frontier candidates. We propose a utility function for frontier goal selection that exploits the occupancy grid map by utilizing the path entropy and favors unknown map locations for maximum area coverage while maintaining a low localization and map** uncertainties. We quantify the efficiency of our method using various graph connectivity matrices and map efficiency indexes for an environment exploration task. Using simulation and experimental results against similar approaches we achieve an average of 32% more coverage using publicly available data sets.
△ Less
Submitted 16 November, 2023; v1 submitted 28 September, 2023;
originally announced September 2023.
-
Noise-Crypt: Image Encryption with Non-linear Noise, Hybrid Chaotic Maps, and Hashing
Authors:
Laiba Asghar,
Fawad Ahmed,
Muhammad Shahbaz Khan,
Arshad Arshad,
Jawad Ahmad
Abstract:
To secure the digital images over insecure transmission channels, a new image encryption algorithm Noise-Crypt is proposed in this paper. Noise-Crypt integrates non-linear random noise, hybrid chaotic maps, and SHA-256 hashing algorithm. The utilized hybrid chaotic maps are the logistic-tent and the logistic-sine-cosine map. The hybrid chaotic maps enhance the pseudorandom sequence generation and…
▽ More
To secure the digital images over insecure transmission channels, a new image encryption algorithm Noise-Crypt is proposed in this paper. Noise-Crypt integrates non-linear random noise, hybrid chaotic maps, and SHA-256 hashing algorithm. The utilized hybrid chaotic maps are the logistic-tent and the logistic-sine-cosine map. The hybrid chaotic maps enhance the pseudorandom sequence generation and selection of substitution boxes, while the logistic-sine-cosine map induces non-linearity in the algorithm through random noise. This deliberate inclusion of noise contributes to increased resistance against cryptanalysis. The proposed scheme has been evaluated for several security parameters, such as differential attacks, entropy, correlation, etc. Extensive evaluation demonstrates the efficacy of the proposed scheme, with almost ideal values of entropy of 7.99 and correlation of -0.0040. Results of the security analysis validate the potency of the proposed scheme in achieving robust image encryption.
△ Less
Submitted 20 September, 2023;
originally announced September 2023.
-
Improved Breast Cancer Diagnosis through Transfer Learning on Hematoxylin and Eosin Stained Histology Images
Authors:
Fahad Ahmed,
Reem Abdel-Salam,
Leon Hamnett,
Mary Adewunmi,
Temitope Ayano
Abstract:
Breast cancer is one of the leading causes of death for women worldwide. Early screening is essential for early identification, but the chance of survival declines as the cancer progresses into advanced stages. For this study, the most recent BRACS dataset of histological (H\&E) stained images was used to classify breast cancer tumours, which contains both the whole-slide images (WSI) and region-o…
▽ More
Breast cancer is one of the leading causes of death for women worldwide. Early screening is essential for early identification, but the chance of survival declines as the cancer progresses into advanced stages. For this study, the most recent BRACS dataset of histological (H\&E) stained images was used to classify breast cancer tumours, which contains both the whole-slide images (WSI) and region-of-interest (ROI) images, however, for our study we have considered ROI images. We have experimented using different pre-trained deep learning models, such as Xception, EfficientNet, ResNet50, and InceptionResNet, pre-trained on the ImageNet weights. We pre-processed the BRACS ROI along with image augmentation, upsampling, and dataset split strategies. For the default dataset split, the best results were obtained by ResNet50 achieving 66% f1-score. For the custom dataset split, the best results were obtained by performing upsampling and image augmentation which results in 96.2% f1-score. Our second approach also reduced the number of false positive and false negative classifications to less than 3% for each class. We believe that our study significantly impacts the early diagnosis and identification of breast cancer tumors and their subtypes, especially atypical and malignant tumors, thus improving patient outcomes and reducing patient mortality rates. Overall, this study has primarily focused on identifying seven (7) breast cancer tumor subtypes, and we believe that the experimental models can be fine-tuned further to generalize over previous breast cancer histology datasets as well.
△ Less
Submitted 24 November, 2023; v1 submitted 15 September, 2023;
originally announced September 2023.
-
The Power of Internet of Things (IoT): Connecting the Dots with Cloud, Edge, and Fog Computing
Authors:
Shams Forruque Ahmed,
Shanjana Shuravi,
Shaila Afrin,
Sabiha Jannat Rafa,
Mahfara Hoque,
Amir H. Gandomi
Abstract:
The Internet of Things (IoT) is regarded as an improved communication system that has revolutionized traditional lifestyles. To function successfully, IoT requires a combination of cloud, fog, and edge computing architectures. Few studies have addressed cloud, fog, and edge computing simultaneously, comparing them and their issues, although several studies have looked into ways of integrating IoT…
▽ More
The Internet of Things (IoT) is regarded as an improved communication system that has revolutionized traditional lifestyles. To function successfully, IoT requires a combination of cloud, fog, and edge computing architectures. Few studies have addressed cloud, fog, and edge computing simultaneously, comparing them and their issues, although several studies have looked into ways of integrating IoT with either one or two computing systems. Thus, this review provides a thorough understanding of IoT integration with these three computing architectures, as well as their respective applications and limitations. It also highlights the advantages, unresolved issues, future opportunities and directions of IoT integration with the computing systems to advance the IoT. IoT can use the Cloud's almost limitless resources to overcome technology restrictions, such as data processing, storage, and transmission. While edge computing can outperform cloud computing in many circumstances, IoT and edge computing become increasingly integrated as IoT devices increase. Cloud computing also poses a few issues, including managing time-sensitive IoT applications like video gaming, simulation, and streaming, which can be addressed by fog computing integrated with IoT. Due to the proximity of fog computing resources to the edge, data transfers and communication delays to the cloud can be reduced as a result of combining the two. The integration of IoT with cloud, fog, and edge computing will create new business prototypes and opportunities. Since IoT has the potential to greatly enhance connectivity infrastructure as an inevitable component of the future internet, further study is needed before it can be fully integrated.
△ Less
Submitted 6 September, 2023;
originally announced September 2023.
-
Unveiling the frontiers of deep learning: innovations sha** diverse domains
Authors:
Shams Forruque Ahmed,
Md. Sakib Bin Alam,
Maliha Kabir,
Shaila Afrin,
Sabiha Jannat Rafa,
Aanushka Mehjabin,
Amir H. Gandomi
Abstract:
Deep learning (DL) enables the development of computer models that are capable of learning, visualizing, optimizing, refining, and predicting data. In recent years, DL has been applied in a range of fields, including audio-visual data processing, agriculture, transportation prediction, natural language, biomedicine, disaster management, bioinformatics, drug design, genomics, face recognition, and…
▽ More
Deep learning (DL) enables the development of computer models that are capable of learning, visualizing, optimizing, refining, and predicting data. In recent years, DL has been applied in a range of fields, including audio-visual data processing, agriculture, transportation prediction, natural language, biomedicine, disaster management, bioinformatics, drug design, genomics, face recognition, and ecology. To explore the current state of deep learning, it is necessary to investigate the latest developments and applications of deep learning in these disciplines. However, the literature is lacking in exploring the applications of deep learning in all potential sectors. This paper thus extensively investigates the potential applications of deep learning across all major fields of study as well as the associated benefits and challenges. As evidenced in the literature, DL exhibits accuracy in prediction and analysis, makes it a powerful computational tool, and has the ability to articulate itself and optimize, making it effective in processing data with no prior training. Given its independence from training data, deep learning necessitates massive amounts of data for effective analysis and processing, much like data volume. To handle the challenge of compiling huge amounts of medical, scientific, healthcare, and environmental data for use in deep learning, gated architectures like LSTMs and GRUs can be utilized. For multimodal learning, shared neurons in the neural network for all activities and specialized neurons for particular tasks are necessary.
△ Less
Submitted 6 September, 2023;
originally announced September 2023.
-
Navigating the IoT landscape: Unraveling forensics, security issues, applications, research challenges, and future
Authors:
Shams Forruque Ahmed,
Shanjana Shuravi,
Afsana Bhuyian,
Shaila Afrin,
Aanushka Mehjabin,
Sweety Angela Kuldeep,
Md. Sakib Bin Alam,
Amir H. Gandomi
Abstract:
Given the exponential expansion of the internet, the possibilities of security attacks and cybercrimes have increased accordingly. However, poorly implemented security mechanisms in the Internet of Things (IoT) devices make them susceptible to cyberattacks, which can directly affect users. IoT forensics is thus needed for investigating and mitigating such attacks. While many works have examined Io…
▽ More
Given the exponential expansion of the internet, the possibilities of security attacks and cybercrimes have increased accordingly. However, poorly implemented security mechanisms in the Internet of Things (IoT) devices make them susceptible to cyberattacks, which can directly affect users. IoT forensics is thus needed for investigating and mitigating such attacks. While many works have examined IoT applications and challenges, only a few have focused on both the forensic and security issues in IoT. Therefore, this paper reviews forensic and security issues associated with IoT in different fields. Future prospects and challenges in IoT research and development are also highlighted. As demonstrated in the literature, most IoT devices are vulnerable to attacks due to a lack of standardized security measures. Unauthorized users could get access, compromise data, and even benefit from control of critical infrastructure. To fulfil the security-conscious needs of consumers, IoT can be used to develop a smart home system by designing a FLIP-based system that is highly scalable and adaptable. Utilizing a blockchain-based authentication mechanism with a multi-chain structure can provide additional security protection between different trust domains. Deep learning can be utilized to develop a network forensics framework with a high-performing system for detecting and tracking cyberattack incidents. Moreover, researchers should consider limiting the amount of data created and delivered when using big data to develop IoT-based smart systems. The findings of this review will stimulate academics to seek potential solutions for the identified issues, thereby advancing the IoT field.
△ Less
Submitted 6 September, 2023;
originally announced September 2023.
-
Machine learning assisted analysis of visible spectroscopy in pulsed-power-driven plasmas
Authors:
Rishabh Datta,
Faez Ahmed,
Jack D Hare
Abstract:
We use machine learning models to predict ion density and electron temperature from visible emission spectra, in a high energy density pulsed-power-driven aluminum plasma, generated by an exploding wire array. Radiation transport simulations, which use spectral emissivity and opacity values generated using the collisional-radiative code PrismSPECT, are used to determine the spectral intensity gene…
▽ More
We use machine learning models to predict ion density and electron temperature from visible emission spectra, in a high energy density pulsed-power-driven aluminum plasma, generated by an exploding wire array. Radiation transport simulations, which use spectral emissivity and opacity values generated using the collisional-radiative code PrismSPECT, are used to determine the spectral intensity generated by the plasma along the spectrometer's line of sight. The spectra exhibit Al-II and Al-III lines, whose line ratios and line widths vary with the density and temperature of the plasma. These calculations provide a 2500-size synthetic dataset of 400-dimensional intensity spectra, which is used to train and compare the performance of multiple machine learning models on a 3-variable regression task. The AutoGluon model performs best, with an R2-score of roughly 98% for density and temperature predictions. Simpler models (random forest, k-nearest neighbor, and deep neural network) also exhibit high R2-scores (>90%) for density and temperature predictions. These results demonstrate the potential of machine learning in providing rapid or real-time analysis of emission spectroscopy data in pulsed-power-driven plasmas.
△ Less
Submitted 31 August, 2023;
originally announced August 2023.
-
A new five-dimensional vacuum-defect wormhole space-time
Authors:
Faizuddin Ahmed
Abstract:
We introduce a novel extension to the Klinkahmer-vacuum defect model by incorporating a fifth spatial coordinate, resulting in a comprehensive five-dimensional wormhole space-time. This extension preserves its status as a vacuum solution to the field equations in five-dimensions. We delve into the behavior of geodesic equations in the proximity of this wormhole, shedding light on its intriguing pr…
▽ More
We introduce a novel extension to the Klinkahmer-vacuum defect model by incorporating a fifth spatial coordinate, resulting in a comprehensive five-dimensional wormhole space-time. This extension preserves its status as a vacuum solution to the field equations in five-dimensions. We delve into the behavior of geodesic equations in the proximity of this wormhole, shedding light on its intriguing properties.
△ Less
Submitted 23 August, 2023;
originally announced August 2023.
-
Conformal Predictions Enhanced Expert-guided Meshing with Graph Neural Networks
Authors:
Amin Heyrani Nobari,
Justin Rey,
Suhas Kodali,
Matthew Jones,
Faez Ahmed
Abstract:
Computational Fluid Dynamics (CFD) is widely used in different engineering fields, but accurate simulations are dependent upon proper meshing of the simulation domain. While highly refined meshes may ensure precision, they come with high computational costs. Similarly, adaptive remeshing techniques require multiple simulations and come at a great computational cost. This means that the meshing pro…
▽ More
Computational Fluid Dynamics (CFD) is widely used in different engineering fields, but accurate simulations are dependent upon proper meshing of the simulation domain. While highly refined meshes may ensure precision, they come with high computational costs. Similarly, adaptive remeshing techniques require multiple simulations and come at a great computational cost. This means that the meshing process is reliant upon expert knowledge and years of experience. Automating mesh generation can save significant time and effort and lead to a faster and more efficient design process. This paper presents a machine learning-based scheme that utilizes Graph Neural Networks (GNN) and expert guidance to automatically generate CFD meshes for aircraft models. In this work, we introduce a new 3D segmentation algorithm that outperforms two state-of-the-art models, PointNet++ and PointMLP, for surface classification. We also present a novel approach to project predictions from 3D mesh segmentation models to CAD surfaces using the conformal predictions method, which provides marginal statistical guarantees and robust uncertainty quantification and handling. We demonstrate that the addition of conformal predictions effectively enables the model to avoid under-refinement, hence failure, in CFD meshing even for weak and less accurate models. Finally, we demonstrate the efficacy of our approach through a real-world case study that demonstrates that our automatically generated mesh is comparable in quality to expert-generated meshes and enables the solver to converge and produce accurate results. Furthermore, we compare our approach to the alternative of adaptive remeshing in the same case study and find that our method is 5 times faster in the overall process of simulation. The code and data for this project are made publicly available at https://github.com/ahnobari/AutoSurf.
△ Less
Submitted 14 August, 2023;
originally announced August 2023.
-
Topologically Charged Rotating Wormhole
Authors:
Faizuddin Ahmed
Abstract:
In this article, we present a stationary metric ansatz to describe a rotating traversable wormhole in the presence of the topological defect produced by a global monopole charge. This particular rotating space-time is referred to as the topologically charged rotating Schwarzschild-Klinkahmer wormhole. Our study involves the analysis of geodesic motion for test particles and photon rays in the cont…
▽ More
In this article, we present a stationary metric ansatz to describe a rotating traversable wormhole in the presence of the topological defect produced by a global monopole charge. This particular rotating space-time is referred to as the topologically charged rotating Schwarzschild-Klinkahmer wormhole. Our study involves the analysis of geodesic motion for test particles and photon rays in the context of this topologically charged rotating traversable wormhole. We aim to analyze the effects of global monopole charge and other parameters on the outcomes of this investigation. Additionally, we explore the matter-energy distribution within this rotating wormhole, considering it as a non-vacuum solution of Einstein's field equation. Notably, we demonstrate that the energy density of the matter content satisfies the criteria of the weak energy condition.
△ Less
Submitted 15 December, 2023; v1 submitted 7 August, 2023;
originally announced August 2023.
-
Explainable Cost-Sensitive Deep Neural Networks for Brain Tumor Detection from Brain MRI Images considering Data Imbalance
Authors:
Md Tanvir Rouf Shawon,
G. M. Shahariar Shibli,
Farzad Ahmed,
Sajib Kumar Saha Joy
Abstract:
This paper presents a research study on the use of Convolutional Neural Network (CNN), ResNet50, InceptionV3, EfficientNetB0 and NASNetMobile models to efficiently detect brain tumors in order to reduce the time required for manual review of the report and create an automated system for classifying brain tumors. An automated pipeline is proposed, which encompasses five models: CNN, ResNet50, Incep…
▽ More
This paper presents a research study on the use of Convolutional Neural Network (CNN), ResNet50, InceptionV3, EfficientNetB0 and NASNetMobile models to efficiently detect brain tumors in order to reduce the time required for manual review of the report and create an automated system for classifying brain tumors. An automated pipeline is proposed, which encompasses five models: CNN, ResNet50, InceptionV3, EfficientNetB0 and NASNetMobile. The performance of the proposed architecture is evaluated on a balanced dataset and found to yield an accuracy of 99.33% for fine-tuned InceptionV3 model. Furthermore, Explainable AI approaches are incorporated to visualize the model's latent behavior in order to understand its black box behavior. To further optimize the training process, a cost-sensitive neural network approach has been proposed in order to work with imbalanced datasets which has achieved almost 4% more accuracy than the conventional models used in our experiments. The cost-sensitive InceptionV3 (CS-InceptionV3) and CNN (CS-CNN) show a promising accuracy of 92.31% and a recall value of 1.00 respectively on an imbalanced dataset. The proposed models have shown great potential in improving tumor detection accuracy and must be further developed for application in practical solutions. We have provided the datasets and made our implementations publicly available at - https://github.com/shahariar-shibli/Explainable-Cost-Sensitive-Deep-Neural-Networks-for-Brain-Tumor-Detection-from-Brain-MRI-Images
△ Less
Submitted 1 August, 2023;
originally announced August 2023.
-
A topologically charged four-dimensional wormhole and the energy conditions
Authors:
Faizuddin Ahmed
Abstract:
In this research work, our primary focus revolves around the examination of a specific category of traversable wormholes known as topologically charged generalized Schwarzschild-Simpson-Visser-type wormhole, $ds^2=-\Big(1-\frac{2\,M}{\sqrt{x^2+b^2}}\Big)\,dt^2+\Big(1-\frac{2\,M}{\sqrt{x^2+b^2}}\Big)^{-1}\,\Big(\frac{dx^2}{α^2}\Big)+(x^2+a^2)\,(dθ^2+\sin^2 θ\,dφ^2)$. This wormhole is uniquely defin…
▽ More
In this research work, our primary focus revolves around the examination of a specific category of traversable wormholes known as topologically charged generalized Schwarzschild-Simpson-Visser-type wormhole, $ds^2=-\Big(1-\frac{2\,M}{\sqrt{x^2+b^2}}\Big)\,dt^2+\Big(1-\frac{2\,M}{\sqrt{x^2+b^2}}\Big)^{-1}\,\Big(\frac{dx^2}{α^2}\Big)+(x^2+a^2)\,(dθ^2+\sin^2 θ\,dφ^2)$. This wormhole is uniquely defined by a pair of key parameters (length scales $a$ and $b$), together with the global monopole charge $α$. A noteworthy outcome of our investigation is the observation that the energy-momentum tensor associated with this wormhole complies with both the weak energy condition (WEC) and the null energy condition (NEC). Furthermore, incorporation of global monopole charge introduces a substantial influence on the curvature properties of wormhole space-time and various associated physical quantities derived from this geometry.
△ Less
Submitted 22 November, 2023; v1 submitted 30 July, 2023;
originally announced August 2023.
-
Mining Reddit Data to Elicit Students' Requirements During COVID-19 Pandemic
Authors:
Shadikur Rahman,
Faiz Ahmed,
Maleknaz Nayebi
Abstract:
Data-driven requirements engineering leverages the abundance of openly accessible and crowdsourced information on the web. By incorporating user feedback provided about a software product, such as reviews in mobile app stores, these approaches facilitate the identification of issues, bug fixes, and implementation of change requests. However, relying solely on user feedback about a software product…
▽ More
Data-driven requirements engineering leverages the abundance of openly accessible and crowdsourced information on the web. By incorporating user feedback provided about a software product, such as reviews in mobile app stores, these approaches facilitate the identification of issues, bug fixes, and implementation of change requests. However, relying solely on user feedback about a software product limits the possibility of eliciting all requirements, as users may not always have a clear understanding of their exact needs from the software, despite their wealth of experience with the problem, event, or challenges they encounter and use the software to assist them. In this study, we propose a shift in requirements elicitation, focusing on gathering feedback related to the problem itself rather than relying solely on feedback about the software product. We conducted a case study on student requirements during the COVID-19 pandemic in a higher education institution. We gathered their communications from Reddit during the pandemic and employed multiple machine-learning and natural language processing techniques to identify requirement sentences. We achieved the F-score of 0.79 using Naive Bayes with TF-IDF when benchmarking multiple techniques. The results lead us to believe that mining requirements from communication about a problem are feasible. While we present the preliminary results, we envision a future where these requirements complement conventionally elicited requirements and help to close the requirements gap.
△ Less
Submitted 26 July, 2023;
originally announced July 2023.
-
A note on "The Klein-Gordon oscillator in (1+2)-dimensions Gürses space-time backgrounds (Ann. Phys. (N. Y.) 404 (2019) 1)"
Authors:
Omar. Mustafa,
Faizuddin Ahmed
Abstract:
We revisit and discuss the KG-oscillator in the $(1+2)$-dimensional Gürses space-time studied in (F. Ahmed, Ann. Phys. (N. Y.) 404 (2019) 1). The modified oscillator frequency {\it i.e.,} $\tildeω^{2}=(Ω^{2}\,E^{2}+η^{2})$ appeared in the eigenvalue equation is an energy-dependent parameter, and consequently, the results are accurately reported here. Moreover, we show some interesting spectroscopi…
▽ More
We revisit and discuss the KG-oscillator in the $(1+2)$-dimensional Gürses space-time studied in (F. Ahmed, Ann. Phys. (N. Y.) 404 (2019) 1). The modified oscillator frequency {\it i.e.,} $\tildeω^{2}=(Ω^{2}\,E^{2}+η^{2})$ appeared in the eigenvalue equation is an energy-dependent parameter, and consequently, the results are accurately reported here. Moreover, we show some interesting spectroscopic features indulged within the very nature of Gürses space-time for the KG-Gürses oscillators.
△ Less
Submitted 25 July, 2023;
originally announced July 2023.
-
Geodesics motion of test particles around Schwarzschild-Klinkhamer wormhole with topological defects and gravitational lensing
Authors:
Faizuddin Ahmed
Abstract:
This study investigates the geodesic motion of test particles, both massless and massive, within a Schwarzschild-Klinkhamer (SK) wormhole space-time. We specifically consider the influence of cosmic strings on the system and analyze the effective potential, and observing that the presence of a cosmic string parameter alters it for null and time-like geodesics. Moreover, we calculate the deflection…
▽ More
This study investigates the geodesic motion of test particles, both massless and massive, within a Schwarzschild-Klinkhamer (SK) wormhole space-time. We specifically consider the influence of cosmic strings on the system and analyze the effective potential, and observing that the presence of a cosmic string parameter alters it for null and time-like geodesics. Moreover, we calculate the deflection angle for null geodesics, and demonstrate that the cosmic string modifies this angle and induces a shift in the results. Additionally, we extend our investigation in this SK-wormhole space-time but with a global monopole. We explore the geodesic motion of test particles in this scenario and find that the effective potential is affected by the global monopole. Similarly, we determine the deflection angle for null geodesics and show that the global monopole parameter introduces modifications to this angle. Lastly, we present several known solutions for space-times involving cosmic strings and global monopoles within the framework of this SK-wormhole.
△ Less
Submitted 7 November, 2023; v1 submitted 17 July, 2023;
originally announced July 2023.
-
Towards Zero-power 3D Imaging: VLC-assisted Passive ToF Sensing
Authors:
Faisal Ahmed,
Miguel Heredia Conde,
Paula López Martínez
Abstract:
Passive Time-of-Flight (ToF) imaging can be enabled by optical wireless communication (OWC). The lighting infrastructure is the backbone of emerging light-based wireless communication. To this end, communication sources are used as opportunity illuminators to probe the scene, and an array of time-resolved pixels are exploited to demodulate the return, provided that the ToF camera can be externally…
▽ More
Passive Time-of-Flight (ToF) imaging can be enabled by optical wireless communication (OWC). The lighting infrastructure is the backbone of emerging light-based wireless communication. To this end, communication sources are used as opportunity illuminators to probe the scene, and an array of time-resolved pixels are exploited to demodulate the return, provided that the ToF camera can be externally synchronized. Our work employs a direct line-of-sight path to synchronize the camera externally. Together with the indirect path given by the reflections from the scene, this yields a bistatic configuration. Each Time-of-Flight (ToF) measurement induced a solution space of ellipsoidal shape and redefined the image formation model based on the bistatic configuration. In this demo, we showcase a passive ToF camera capable of delivering intensity and depth information in practice without emitting photons from the ToF camera. Our passive modality can eliminate built-in illumination sources, thus co** with optical power constraints, as is desired in future ToF cameras.
△ Less
Submitted 17 July, 2023;
originally announced July 2023.
-
A Blockchain-Based Framework for Distributed Agile Software Testing Life Cycle
Authors:
Muhammad Shoaib Farooq,
Fatima Ahmed
Abstract:
A blockchain-based framework for distributed agile software testing life cycle is an innovative approach that uses blockchain technology to optimize the software testing process. Previously, various methods were employed to address communication and collaboration challenges in software testing, but they were deficient in aspects such as trust, traceability, and security. Additionally, a significan…
▽ More
A blockchain-based framework for distributed agile software testing life cycle is an innovative approach that uses blockchain technology to optimize the software testing process. Previously, various methods were employed to address communication and collaboration challenges in software testing, but they were deficient in aspects such as trust, traceability, and security. Additionally, a significant cause of project failure was the non-completion of unit testing by developers, leading to delayed testing. This paper integration of blockchain technology in software testing resolves critical concerns related to transparency, trust, coordination, and communication. We have proposed a blockchain based framework named as TestingPlus. TestingPlus framework utilizes blockchain technology to provide a secure and transparent platform for acceptance testing and payment verification. By leveraging smart contracts on a private Ethereum blockchain, TestingPlus can help to ensure that both the testing team and the development team are working towards a common goal and are compensated fairly for their contributions.
△ Less
Submitted 14 July, 2023;
originally announced July 2023.
-
Cascaded Model Predictive Control of a Tandem-Rotor Helicopter
Authors:
Faraaz Ahmed,
Ludwik Sobiesiak,
James Richard Forbes
Abstract:
This letter considers cascaded model predictive control (MPC) as a computationally lightweight method for controlling a tandem-rotor helicopter. A traditional single MPC structure is split into separate outer and inner-loops. The outer-loop MPC uses an $SE_2(3)$ error to linearize the translational dynamics about a reference trajectory. The inner-loop MPC uses the optimal angular velocity sequence…
▽ More
This letter considers cascaded model predictive control (MPC) as a computationally lightweight method for controlling a tandem-rotor helicopter. A traditional single MPC structure is split into separate outer and inner-loops. The outer-loop MPC uses an $SE_2(3)$ error to linearize the translational dynamics about a reference trajectory. The inner-loop MPC uses the optimal angular velocity sequence of the outer-loop MPC to linearize the rotational dynamics. The outer-loop MPC is run at a slower rate than the inner-loop allowing for longer prediction time and improved performance. Monte-Carlo simulations demonstrate robustness to model uncertainty and environmental disturbances. The proposed control structure is benchmarked against a single MPC algorithm where it shows significant improvements in position and velocity tracking while using significantly less computational resources.
△ Less
Submitted 28 June, 2023;
originally announced June 2023.
-
Learning from Invalid Data: On Constraint Satisfaction in Generative Models
Authors:
Giorgio Giannone,
Lyle Regenwetter,
Akash Srivastava,
Dan Gutfreund,
Faez Ahmed
Abstract:
Generative models have demonstrated impressive results in vision, language, and speech. However, even with massive datasets, they struggle with precision, generating physically invalid or factually incorrect data. This is particularly problematic when the generated data must satisfy constraints, for example, to meet product specifications in engineering design or to adhere to the laws of physics i…
▽ More
Generative models have demonstrated impressive results in vision, language, and speech. However, even with massive datasets, they struggle with precision, generating physically invalid or factually incorrect data. This is particularly problematic when the generated data must satisfy constraints, for example, to meet product specifications in engineering design or to adhere to the laws of physics in a natural scene. To improve precision while preserving diversity and fidelity, we propose a novel training mechanism that leverages datasets of constraint-violating data points, which we consider invalid. Our approach minimizes the divergence between the generative distribution and the valid prior while maximizing the divergence with the invalid distribution. We demonstrate how generative models like GANs and DDPMs that we augment to train with invalid data vastly outperform their standard counterparts which solely train on valid data points. For example, our training procedure generates up to 98 % fewer invalid samples on 2D densities, improves connectivity and stability four-fold on a stacking block problem, and improves constraint satisfaction by 15 % on a structural topology optimization benchmark in engineering design. We also analyze how the quality of the invalid data affects the learning procedure and the generalization properties of models. Finally, we demonstrate significant improvements in sample efficiency, showing that a tenfold increase in valid samples leads to a negligible difference in constraint satisfaction, while less than 10 % invalid samples lead to a tenfold improvement. Our proposed mechanism offers a promising solution for improving precision in generative models while preserving diversity and fidelity, particularly in domains where constraint satisfaction is critical and data is limited, such as engineering design, robotics, and medicine.
△ Less
Submitted 26 June, 2023;
originally announced June 2023.
-
Gravitational lensing in a space-time with cosmic string within the Eddington-inspired Born-Infeld gravity
Authors:
Faizuddin Ahmed
Abstract:
This study explores the deflection angle of photon rays or light-like geodesics within the framework of Eddington-inspired Born-Infeld (EiBI) gravity background space-time, taking into account the influence of cosmic strings. The primary focus lies in deriving the effective potential of the system applicable to both null and time-like geodesics, as well as determining the angle of deflection for l…
▽ More
This study explores the deflection angle of photon rays or light-like geodesics within the framework of Eddington-inspired Born-Infeld (EiBI) gravity background space-time, taking into account the influence of cosmic strings. The primary focus lies in deriving the effective potential of the system applicable to both null and time-like geodesics, as well as determining the angle of deflection for light-like geodesics. Our analysis shows that the presence of cosmic strings induces modifications in these physical quantities, leading to shifts in their respective values.
△ Less
Submitted 29 March, 2024; v1 submitted 21 June, 2023;
originally announced June 2023.
-
ClimSim: A large multi-scale dataset for hybrid physics-ML climate emulation
Authors:
Sungduk Yu,
Walter Hannah,
Liran Peng,
Jerry Lin,
Mohamed Aziz Bhouri,
Ritwik Gupta,
Björn Lütjens,
Justus Christopher Will,
Gunnar Behrens,
Julius Busecke,
Nora Loose,
Charles I Stern,
Tom Beucler,
Bryce Harrop,
Benjamin R Hillman,
Andrea Jenney,
Savannah Ferretti,
Nana Liu,
Anima Anandkumar,
Noah D Brenowitz,
Veronika Eyring,
Nicholas Geneva,
Pierre Gentine,
Stephan Mandt,
Jaideep Pathak
, et al. (31 additional authors not shown)
Abstract:
Modern climate projections lack adequate spatial and temporal resolution due to computational constraints. A consequence is inaccurate and imprecise predictions of critical processes such as storms. Hybrid methods that combine physics with machine learning (ML) have introduced a new generation of higher fidelity climate simulators that can sidestep Moore's Law by outsourcing compute-hungry, short,…
▽ More
Modern climate projections lack adequate spatial and temporal resolution due to computational constraints. A consequence is inaccurate and imprecise predictions of critical processes such as storms. Hybrid methods that combine physics with machine learning (ML) have introduced a new generation of higher fidelity climate simulators that can sidestep Moore's Law by outsourcing compute-hungry, short, high-resolution simulations to ML emulators. However, this hybrid ML-physics simulation approach requires domain-specific treatment and has been inaccessible to ML experts because of lack of training data and relevant, easy-to-use workflows. We present ClimSim, the largest-ever dataset designed for hybrid ML-physics research. It comprises multi-scale climate simulations, developed by a consortium of climate scientists and ML researchers. It consists of 5.7 billion pairs of multivariate input and output vectors that isolate the influence of locally-nested, high-resolution, high-fidelity physics on a host climate simulator's macro-scale physical state.
The dataset is global in coverage, spans multiple years at high sampling frequency, and is designed such that resulting emulators are compatible with downstream coupling into operational climate simulators. We implement a range of deterministic and stochastic regression baselines to highlight the ML challenges and their scoring. The data (https://huggingface.co/datasets/LEAP/ClimSim_high-res) and code (https://leap-stc.github.io/ClimSim) are released openly to support the development of hybrid ML-physics and high-fidelity climate simulations for the benefit of science and society.
△ Less
Submitted 6 February, 2024; v1 submitted 14 June, 2023;
originally announced June 2023.
-
Surrogate Modeling of Car Drag Coefficient with Depth and Normal Renderings
Authors:
Binyang Song,
Chenyang Yuan,
Frank Permenter,
Nikos Arechiga,
Faez Ahmed
Abstract:
Generative AI models have made significant progress in automating the creation of 3D shapes, which has the potential to transform car design. In engineering design and optimization, evaluating engineering metrics is crucial. To make generative models performance-aware and enable them to create high-performing designs, surrogate modeling of these metrics is necessary. However, the currently used re…
▽ More
Generative AI models have made significant progress in automating the creation of 3D shapes, which has the potential to transform car design. In engineering design and optimization, evaluating engineering metrics is crucial. To make generative models performance-aware and enable them to create high-performing designs, surrogate modeling of these metrics is necessary. However, the currently used representations of three-dimensional (3D) shapes either require extensive computational resources to learn or suffer from significant information loss, which impairs their effectiveness in surrogate modeling. To address this issue, we propose a new two-dimensional (2D) representation of 3D shapes. We develop a surrogate drag model based on this representation to verify its effectiveness in predicting 3D car drag. We construct a diverse dataset of 9,070 high-quality 3D car meshes labeled by drag coefficients computed from computational fluid dynamics (CFD) simulations to train our model. Our experiments demonstrate that our model can accurately and efficiently evaluate drag coefficients with an $R^2$ value above 0.84 for various car categories. Moreover, the proposed representation method can be generalized to many other product categories beyond cars. Our model is implemented using deep neural networks, making it compatible with recent AI image generation tools (such as Stable Diffusion) and a significant step towards the automatic generation of drag-optimized car designs. We have made the dataset and code publicly available at https://decode.mit.edu/projects/dragprediction/.
△ Less
Submitted 26 May, 2023;
originally announced June 2023.
-
A Reference Framework for Variability Management of Software Product Lines
Authors:
Saiqa Aleem,
Luiz Fernando Capretz,
Faheem Ahmed
Abstract:
Variability management (VM) in software product line engineering (SPLE) is introduced as an abstraction that enables the reuse and customization of assets. VM is a complex task involving the identification, representation, and instantiation of variability for specific products, as well as the evolution of variability itself. This work presents a comparison and contrast between existing VM approach…
▽ More
Variability management (VM) in software product line engineering (SPLE) is introduced as an abstraction that enables the reuse and customization of assets. VM is a complex task involving the identification, representation, and instantiation of variability for specific products, as well as the evolution of variability itself. This work presents a comparison and contrast between existing VM approaches using qualitative meta-synthesis to determine the underlying perspectives, metaphors, and concepts of existing methods. A common frame of reference for the VM was proposed as the result of this analysis. Putting metaphors in the context of the dimensions in which variability occurs and identifying its key concepts provides a better understanding of its management and enables several analyses and evaluation opportunities. Finally, the proposed framework was evaluated using a qualitative study approach. The results of the evaluation phase suggest that the organizations in practice only focus on one dimension. The presented frame of reference will help the organization to cover this gap in practice.
△ Less
Submitted 6 June, 2023;
originally announced June 2023.
-
Aligning Optimization Trajectories with Diffusion Models for Constrained Design Generation
Authors:
Giorgio Giannone,
Akash Srivastava,
Ole Winther,
Faez Ahmed
Abstract:
Generative models have had a profound impact on vision and language, paving the way for a new era of multimodal generative applications. While these successes have inspired researchers to explore using generative models in science and engineering to accelerate the design process and reduce the reliance on iterative optimization, challenges remain. Specifically, engineering optimization methods bas…
▽ More
Generative models have had a profound impact on vision and language, paving the way for a new era of multimodal generative applications. While these successes have inspired researchers to explore using generative models in science and engineering to accelerate the design process and reduce the reliance on iterative optimization, challenges remain. Specifically, engineering optimization methods based on physics still outperform generative models when dealing with constrained environments where data is scarce and precision is paramount. To address these challenges, we introduce Diffusion Optimization Models (DOM) and Trajectory Alignment (TA), a learning framework that demonstrates the efficacy of aligning the sampling trajectory of diffusion models with the optimization trajectory derived from traditional physics-based methods. This alignment ensures that the sampling process remains grounded in the underlying physical principles. Our method allows for generating feasible and high-performance designs in as few as two steps without the need for expensive preprocessing, external surrogate models, or additional labeled data. We apply our framework to structural topology optimization, a fundamental problem in mechanical design, evaluating its performance on in- and out-of-distribution configurations. Our results demonstrate that TA outperforms state-of-the-art deep generative models on in-distribution configurations and halves the inference computational cost. When coupled with a few steps of optimization, it also improves manufacturability for out-of-distribution conditions. By significantly improving performance and inference efficiency, DOM enables us to generate high-quality designs in just a few steps and guide them toward regions of high performance and manufacturability, paving the way for the widespread application of generative models in large-scale data-driven design.
△ Less
Submitted 29 May, 2023;
originally announced May 2023.
-
Multi-modal Machine Learning for Vehicle Rating Predictions Using Image, Text, and Parametric Data
Authors:
Hanqi Su,
Binyang Song,
Faez Ahmed
Abstract:
Accurate vehicle rating prediction can facilitate designing and configuring good vehicles. This prediction allows vehicle designers and manufacturers to optimize and improve their designs in a timely manner, enhance their product performance, and effectively attract consumers. However, most of the existing data-driven methods rely on data from a single mode, e.g., text, image, or parametric data,…
▽ More
Accurate vehicle rating prediction can facilitate designing and configuring good vehicles. This prediction allows vehicle designers and manufacturers to optimize and improve their designs in a timely manner, enhance their product performance, and effectively attract consumers. However, most of the existing data-driven methods rely on data from a single mode, e.g., text, image, or parametric data, which results in a limited and incomplete exploration of the available information. These methods lack comprehensive analyses and exploration of data from multiple modes, which probably leads to inaccurate conclusions and hinders progress in this field. To overcome this limitation, we propose a multi-modal learning model for more comprehensive and accurate vehicle rating predictions. Specifically, the model simultaneously learns features from the parametric specifications, text descriptions, and images of vehicles to predict five vehicle rating scores, including the total score, critics score, performance score, safety score, and interior score. We compare the multi-modal learning model to the corresponding unimodal models and find that the multi-modal model's explanatory power is 4% - 12% higher than that of the unimodal models. On this basis, we conduct sensitivity analyses using SHAP to interpret our model and provide design and optimization directions to designers and manufacturers. Our study underscores the importance of the data-driven multi-modal learning approach for vehicle design, evaluation, and optimization. We have made the code publicly available at http://decode.mit.edu/projects/vehicleratings/.
△ Less
Submitted 27 May, 2023; v1 submitted 24 May, 2023;
originally announced May 2023.
-
Psychological Metrics for Dialog System Evaluation
Authors:
Salvatore Giorgi,
Shreya Havaldar,
Farhan Ahmed,
Zuhaib Akhtar,
Shalaka Vaidya,
Gary Pan,
Lyle H. Ungar,
H. Andrew Schwartz,
Joao Sedoc
Abstract:
We present metrics for evaluating dialog systems through a psychologically-grounded "human" lens in which conversational agents express a diversity of both states (e.g., emotion) and traits (e.g., personality), just as people do. We present five interpretable metrics from established psychology that are fundamental to human communication and relationships: emotional entropy, linguistic style and e…
▽ More
We present metrics for evaluating dialog systems through a psychologically-grounded "human" lens in which conversational agents express a diversity of both states (e.g., emotion) and traits (e.g., personality), just as people do. We present five interpretable metrics from established psychology that are fundamental to human communication and relationships: emotional entropy, linguistic style and emotion matching, agreeableness, and empathy. These metrics can be applied (1) across dialogs and (2) on turns within dialogs. The psychological metrics are compared against seven state-of-the-art traditional metrics (e.g., BARTScore and BLEURT) on seven standard dialog system data sets. We also introduce a novel data set, the Three Bot Dialog Evaluation Corpus, which consists of annotated conversations from ChatGPT, GPT-3, and BlenderBot. We demonstrate that our proposed metrics offer novel information; they are uncorrelated with traditional metrics, can be used to meaningfully compare dialog systems, and lead to increased accuracy (beyond existing traditional metrics) in predicting crowd-sourced dialog judgements. The interpretability and unique signal of our psychological metrics make them a valuable tool for evaluating and improving dialog systems.
△ Less
Submitted 15 September, 2023; v1 submitted 24 May, 2023;
originally announced May 2023.
-
MCD: A Model-Agnostic Counterfactual Search Method For Multi-modal Design Modifications
Authors:
Lyle Regenwetter,
Yazan Abu Obaideh,
Faez Ahmed
Abstract:
Designers may often ask themselves how to adjust their design concepts to achieve demanding functional goals. To answer such questions, designers must often consider counterfactuals, weighing design alternatives and their projected performance. This paper introduces Multi-objective Counterfactuals for Design (MCD), a computational tool that automates and streamlines the counterfactual search proce…
▽ More
Designers may often ask themselves how to adjust their design concepts to achieve demanding functional goals. To answer such questions, designers must often consider counterfactuals, weighing design alternatives and their projected performance. This paper introduces Multi-objective Counterfactuals for Design (MCD), a computational tool that automates and streamlines the counterfactual search process and recommends targeted design modifications that meet designers' unique requirements. MCD improves upon existing counterfactual search methods by supporting multi-objective requirements, which are crucial in design problems, and by decoupling the counterfactual search and sampling processes, thus enhancing efficiency and facilitating objective trade-off visualization. The paper showcases MCD's capabilities in complex engineering tasks using three demonstrative bicycle design challenges. In the first, MCD effectively identifies design modifications that quantifiably enhance functional performance, strengthening the bike frame and saving weight. In the second, MCD modifies parametric bike models in a cross-modal fashion to resemble subjective text prompts or reference images. In a final multidisciplinary case study, MCD tackles all the quantitative and subjective design requirements introduced in the first two problems, while simultaneously customizing a bike design to an individual rider's biomechanical attributes. By exploring hypothetical design alterations and their impact on multiple design objectives, MCD recommends effective design modifications for practitioners seeking to make targeted enhancements to their designs. The code, test problems, and datasets used in the paper are available to the public at decode.mit.edu/projects/counterfactuals/.
△ Less
Submitted 31 May, 2024; v1 submitted 18 May, 2023;
originally announced May 2023.
-
Benchmarking Deep Learning Frameworks for Automated Diagnosis of Ocular Toxoplasmosis: A Comprehensive Approach to Classification and Segmentation
Authors:
Syed Samiul Alam,
Samiul Based Shuvo,
Shams Nafisa Ali,
Fardeen Ahmed,
Arbil Chakma,
Yeong Min Jang
Abstract:
Ocular Toxoplasmosis (OT), is a common eye infection caused by T. gondii that can cause vision problems. Diagnosis is typically done through a clinical examination and imaging, but these methods can be complicated and costly, requiring trained personnel. To address this issue, we have created a benchmark study that evaluates the effectiveness of existing pre-trained networks using transfer learnin…
▽ More
Ocular Toxoplasmosis (OT), is a common eye infection caused by T. gondii that can cause vision problems. Diagnosis is typically done through a clinical examination and imaging, but these methods can be complicated and costly, requiring trained personnel. To address this issue, we have created a benchmark study that evaluates the effectiveness of existing pre-trained networks using transfer learning techniques to detect OT from fundus images. Furthermore, we have also analysed the performance of transfer-learning based segmentation networks to segment lesions in the images. This research seeks to provide a guide for future researchers looking to utilise DL techniques and develop a cheap, automated, easy-to-use, and accurate diagnostic method. We have performed in-depth analysis of different feature extraction techniques in order to find the most optimal one for OT classification and segmentation of lesions. For classification tasks, we have evaluated pre-trained models such as VGG16, MobileNetV2, InceptionV3, ResNet50, and DenseNet121 models. Among them, MobileNetV2 outperformed all other models in terms of Accuracy (Acc), Recall, and F1 Score outperforming the second-best model, InceptionV3 by 0.7% higher Acc. However, DenseNet121 achieved the best result in terms of Precision, which was 0.1% higher than MobileNetv2. For the segmentation task, this work has exploited U-Net architecture. In order to utilize transfer learning the encoder block of the traditional U-Net was replaced by MobileNetV2, InceptionV3, ResNet34, and VGG16 to evaluate different architectures moreover two different two different loss functions (Dice loss and Jaccard loss) were exploited in order to find the most optimal one. The MobileNetV2/U-Net outperformed ResNet34 by 0.5% and 2.1% in terms of Acc and Dice Score, respectively when Jaccard loss function is employed during the training.
△ Less
Submitted 18 May, 2023;
originally announced May 2023.
-
DATED: Guidelines for Creating Synthetic Datasets for Engineering Design Applications
Authors:
Cyril Picard,
Jürg Schiffmann,
Faez Ahmed
Abstract:
Exploiting the recent advancements in artificial intelligence, showcased by ChatGPT and DALL-E, in real-world applications necessitates vast, domain-specific, and publicly accessible datasets. Unfortunately, the scarcity of such datasets poses a significant challenge for researchers aiming to apply these breakthroughs in engineering design. Synthetic datasets emerge as a viable alternative. Howeve…
▽ More
Exploiting the recent advancements in artificial intelligence, showcased by ChatGPT and DALL-E, in real-world applications necessitates vast, domain-specific, and publicly accessible datasets. Unfortunately, the scarcity of such datasets poses a significant challenge for researchers aiming to apply these breakthroughs in engineering design. Synthetic datasets emerge as a viable alternative. However, practitioners are often uncertain about generating high-quality datasets that accurately represent real-world data and are suitable for the intended downstream applications. This study aims to fill this knowledge gap by proposing comprehensive guidelines for generating, annotating, and validating synthetic datasets. The trade-offs and methods associated with each of these aspects are elaborated upon. Further, the practical implications of these guidelines are illustrated through the creation of a turbo-compressors dataset. The study underscores the importance of thoughtful sampling methods to ensure the appropriate size, diversity, utility, and realism of a dataset. It also highlights that design diversity does not equate to performance diversity or realism. By employing test sets that represent uniform, real, or task-specific samples, the influence of sample size and sampling strategy is scrutinized. Overall, this paper offers valuable insights for researchers intending to create and publish synthetic datasets for engineering design, thereby paving the way for more effective applications of AI advancements in the field. The code and data for the dataset and methods are made publicly accessible at https://github.com/cyrilpic/radcomp .
△ Less
Submitted 15 May, 2023;
originally announced May 2023.
-
Ship-D: Ship Hull Dataset for Design Optimization using Machine Learning
Authors:
Noah J. Bagazinski,
Faez Ahmed
Abstract:
Machine learning has recently made significant strides in reducing design cycle time for complex products. Ship design, which currently involves years long cycles and small batch production, could greatly benefit from these advancements. By develo** a machine learning tool for ship design that learns from the design of many different types of ships, tradeoffs in ship design could be identified a…
▽ More
Machine learning has recently made significant strides in reducing design cycle time for complex products. Ship design, which currently involves years long cycles and small batch production, could greatly benefit from these advancements. By develo** a machine learning tool for ship design that learns from the design of many different types of ships, tradeoffs in ship design could be identified and optimized. However, the lack of publicly available ship design datasets currently limits the potential for leveraging machine learning in generalized ship design. To address this gap, this paper presents a large dataset of thirty thousand ship hulls, each with design and functional performance information, including parameterization, mesh, point cloud, and image representations, as well as thirty two hydrodynamic drag measures under different operating conditions. The dataset is structured to allow human input and is also designed for computational methods. Additionally, the paper introduces a set of twelve ship hulls from publicly available CAD repositories to showcase the proposed parameterizations ability to accurately reconstruct existing hulls. A surrogate model was developed to predict the thirty two wave drag coefficients, which was then implemented in a genetic algorithm case study to reduce the total drag of a hull by sixty percent while maintaining the shape of the hulls cross section and the length of the parallel midbody. Our work provides a comprehensive dataset and application examples for other researchers to use in advancing data driven ship design.
△ Less
Submitted 16 May, 2023; v1 submitted 14 May, 2023;
originally announced May 2023.
-
Topological Effects With Inverse Quadratic Yukawa Plus Inverse Square Potential on Eigenvalue Solutions
Authors:
Faizuddin Ahmed
Abstract:
In this analysis, we study the non-relativistic Schrodinger wave equation under the influence of quantum flux field with interactions potential in the background of a point-like global monopole (PGM). In fact, we consider an inverse quadratic Yukawa plus inverse square potential and derive the radial equation employing the Greene-Aldrich approximation scheme in the centrifugal term. We determine t…
▽ More
In this analysis, we study the non-relativistic Schrodinger wave equation under the influence of quantum flux field with interactions potential in the background of a point-like global monopole (PGM). In fact, we consider an inverse quadratic Yukawa plus inverse square potential and derive the radial equation employing the Greene-Aldrich approximation scheme in the centrifugal term. We determine the approximate eigenvalue solution using the parametric Nikiforov-Uvarov method and analyze the result. Afterwards, we derive the radial wave equation using the same potential employing a power series expansion method in the exponential potential and solve it analytically. We show that the energy eigenvalues are shifted by the topological defects of a point-like global monopole compared to the flat space result. In addition, we see that the energy eigenvalues depend on the quantum flux field that shows an analogue to the Aharonov-Bohm effect
△ Less
Submitted 25 April, 2023;
originally announced May 2023.
-
ADVISE: AI-accelerated Design of Evidence Synthesis for Global Development
Authors:
Kristen M. Edwards,
Binyang Song,
Jaron Porciello,
Mark Engelbert,
Carolyn Huang,
Faez Ahmed
Abstract:
When designing evidence-based policies and programs, decision-makers must distill key information from a vast and rapidly growing literature base. Identifying relevant literature from raw search results is time and resource intensive, and is often done by manual screening. In this study, we develop an AI agent based on a bidirectional encoder representations from transformers (BERT) model and inco…
▽ More
When designing evidence-based policies and programs, decision-makers must distill key information from a vast and rapidly growing literature base. Identifying relevant literature from raw search results is time and resource intensive, and is often done by manual screening. In this study, we develop an AI agent based on a bidirectional encoder representations from transformers (BERT) model and incorporate it into a human team designing an evidence synthesis product for global development. We explore the effectiveness of the human-AI hybrid team in accelerating the evidence synthesis process. To further improve team efficiency, we enhance the human-AI hybrid team through active learning (AL). Specifically, we explore different sampling strategies, including random sampling, least confidence (LC) sampling, and highest priority (HP) sampling, to study their influence on the collaborative screening process. Results show that incorporating the BERT-based AI agent into the human team can reduce the human screening effort by 68.5% compared to the case of no AI assistance and by 16.8% compared to the case of using a support vector machine (SVM)-based AI agent for identifying 80% of all relevant documents. When we apply the HP sampling strategy for AL, the human screening effort can be reduced even more: by 78.3% for identifying 80% of all relevant documents compared to no AI assistance. We apply the AL-enhanced human-AI hybrid teaming workflow in the design process of three evidence gap maps (EGMs) for USAID and find it to be highly effective. These findings demonstrate how AI can accelerate the development of evidence synthesis products and promote timely evidence-based decision making in global development in a human-AI hybrid teaming context.
△ Less
Submitted 1 May, 2023;
originally announced May 2023.
-
Feshbach-Villars oscillator (FVO) in Kaluza-Klein Theory (KKT)
Authors:
Abdelmalek Bouzenada,
Abdelmalek Boumali,
R. L. L . Vitoria,
Faizuddin Ahmed,
Marwan Al-Raeei
Abstract:
This research investigates the relativistic quantum dynamics of spin-0 scalar massive charged particles via the relativistic Feshbach-Villars oscillator in the background of the Kaluza-Klein Theory. We solve the Feshbach-Villars equation in the abckground of a cosmic string spec-time in the context of the Kaluza-Klein and presented the eigenvalue solution. Afterward, we rewrite this system in the…
▽ More
This research investigates the relativistic quantum dynamics of spin-0 scalar massive charged particles via the relativistic Feshbach-Villars oscillator in the background of the Kaluza-Klein Theory. We solve the Feshbach-Villars equation in the abckground of a cosmic string spec-time in the context of the Kaluza-Klein and presented the eigenvalue solution. Afterward, we rewrite this system in the case of the Feshbach-Villars quantum oscillator and obtain the eigenvalue analytically. Finally, we study the interaction of the Feshbach-Villars equation and oscillator in a cosmic dislocation in the Som-Raychaudhuri in the context of the Kaluza-Klein Theory and solve the wave equation analytically. We analyze the influence of topological defect in the quantification of energy and wave function of the Feshbach-Villars oscillator and with the external fields in the last one
△ Less
Submitted 6 July, 2023; v1 submitted 1 May, 2023;
originally announced May 2023.
-
MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action
Authors:
Zhengyuan Yang,
Linjie Li,
Jianfeng Wang,
Kevin Lin,
Ehsan Azarnasab,
Faisal Ahmed,
Zicheng Liu,
Ce Liu,
Michael Zeng,
Lijuan Wang
Abstract:
We propose MM-REACT, a system paradigm that integrates ChatGPT with a pool of vision experts to achieve multimodal reasoning and action. In this paper, we define and explore a comprehensive list of advanced vision tasks that are intriguing to solve, but may exceed the capabilities of existing vision and vision-language models. To achieve such advanced visual intelligence, MM-REACT introduces a tex…
▽ More
We propose MM-REACT, a system paradigm that integrates ChatGPT with a pool of vision experts to achieve multimodal reasoning and action. In this paper, we define and explore a comprehensive list of advanced vision tasks that are intriguing to solve, but may exceed the capabilities of existing vision and vision-language models. To achieve such advanced visual intelligence, MM-REACT introduces a textual prompt design that can represent text descriptions, textualized spatial coordinates, and aligned file names for dense visual signals such as images and videos. MM-REACT's prompt design allows language models to accept, associate, and process multimodal information, thereby facilitating the synergetic combination of ChatGPT and various vision experts. Zero-shot experiments demonstrate MM-REACT's effectiveness in addressing the specified capabilities of interests and its wide application in different scenarios that require advanced visual understanding. Furthermore, we discuss and compare MM-REACT's system paradigm with an alternative approach that extends language models for multimodal scenarios through joint finetuning. Code, demo, video, and visualization are available at https://multimodal-react.github.io/
△ Less
Submitted 20 March, 2023;
originally announced March 2023.
-
Diffusing the Optimal Topology: A Generative Optimization Approach
Authors:
Giorgio Giannone,
Faez Ahmed
Abstract:
Topology Optimization seeks to find the best design that satisfies a set of constraints while maximizing system performance. Traditional iterative optimization methods like SIMP can be computationally expensive and get stuck in local minima, limiting their applicability to complex or large-scale problems. Learning-based approaches have been developed to accelerate the topology optimization process…
▽ More
Topology Optimization seeks to find the best design that satisfies a set of constraints while maximizing system performance. Traditional iterative optimization methods like SIMP can be computationally expensive and get stuck in local minima, limiting their applicability to complex or large-scale problems. Learning-based approaches have been developed to accelerate the topology optimization process, but these methods can generate designs with floating material and low performance when challenged with out-of-distribution constraint configurations. Recently, deep generative models, such as Generative Adversarial Networks and Diffusion Models, conditioned on constraints and physics fields have shown promise, but they require extensive pre-processing and surrogate models for improving performance. To address these issues, we propose a Generative Optimization method that integrates classic optimization like SIMP as a refining mechanism for the topology generated by a deep generative model. We also remove the need for conditioning on physical fields using a computationally inexpensive approximation inspired by classic ODE solutions and reduce the number of steps needed to generate a feasible and performant topology. Our method allows us to efficiently generate good topologies and explicitly guide them to regions with high manufacturability and high performance, without the need for external auxiliary models or additional labeled data. We believe that our method can lead to significant advancements in the design and optimization of structures in engineering applications, and can be applied to a broader spectrum of performance-aware engineering design problems.
△ Less
Submitted 16 March, 2023;
originally announced March 2023.
-
Rotational and inverse square potential effects on harmonic oscillator confined by flux field in a space-time with screw dislocation
Authors:
Faizuddin Ahmed,
Houcine Aounallah,
Prabir Rudra
Abstract:
This research paper delves into the study of a non-relativistic quantum system, considering the interplay of non-inertial effects induced by a rotating frame and confinement by the Aharonov-Bohm (AB) flux field with potential in the backdrop of topological defects, specifically a screw dislocation. We first focus on the harmonic oscillator problem, incorporating an inverse-square repulsive potenti…
▽ More
This research paper delves into the study of a non-relativistic quantum system, considering the interplay of non-inertial effects induced by a rotating frame and confinement by the Aharonov-Bohm (AB) flux field with potential in the backdrop of topological defects, specifically a screw dislocation. We first focus on the harmonic oscillator problem, incorporating an inverse-square repulsive potential. Notably, it becomes evident that the energy eigenvalues and wave functions are intricately influenced by multiple factors: the topological defect parameter $β$ (representing the screw dislocation), the presence of a rotating frame engaged in constant angular motion with speed $Ω$, and the external potential. Then we study the quantum behavior of non-relativistic particles, engaging in interactions governed by an inverse square potential, all while taking into account the effects of the rotating frame. In both scenarios, a significant observation is made: the quantum flux field's existence brings about a shift in the energy spectrum. This phenomenon bears a resemblance to the electromagnetic Aharonov-Bohm effect.
△ Less
Submitted 29 August, 2023; v1 submitted 2 March, 2023;
originally announced March 2023.
-
Holistic IJTAG-based External and Internal Fault Monitoring in UAVs
Authors:
Foisal Ahmed,
Maksim Jenihhin
Abstract:
Cyber-Physical Systems (CPSs), such as Unmanned Aerial Vehicles (UAVs), use System-on-Chip (SoC) based computing platforms to perform multiple complex tasks in safety-critical applications that require a highly dependable operation. Due to continuous technological manufacturing miniaturization SoCs face a wide spectrum of chip-level reliability issues such as aging, soft and hard errors during the…
▽ More
Cyber-Physical Systems (CPSs), such as Unmanned Aerial Vehicles (UAVs), use System-on-Chip (SoC) based computing platforms to perform multiple complex tasks in safety-critical applications that require a highly dependable operation. Due to continuous technological manufacturing miniaturization SoCs face a wide spectrum of chip-level reliability issues such as aging, soft and hard errors during the operational lifetime of a UAV. In addition, external (off-chip) faults in the sensors, actuators, and motors are another cause of UAV failures. While existing works examine either on-chip faults (internal) or sensors/actuators faults (external) separately, this research proposes a UAV health monitoring infrastructure considering both external and internal faults holistically. The proposed method relies on the IEEE 1687 standard (IJTAG) and employs on-chip embedded instruments as health monitors to instantly access external and internal sensor data. Experimental results for functional simulation of a real-life case-study design demonstrate both types of fault detection by serving only three clock cycles and the localization process using 16 and 30 clock cycles for the case of single and double faults, respectively.
△ Less
Submitted 3 March, 2023;
originally announced March 2023.
-
Unsupervised Recycled FPGA Detection Using Symmetry Analysis
Authors:
Tanvir Ahmad Tarique,
Foisal Ahmed,
Maksim Jenihhin,
Liakot Ali
Abstract:
Recently, recycled field-programmable gate arrays (FPGAs) pose a significant hardware security problem due to the proliferation of the semiconductor supply chain. Ring oscillator (RO) based frequency analyzing technique is one of the popular methods, where most studies used the known fresh FPGAs (KFFs) in machine learning-based detection, which is not a realistic approach. In this paper, we presen…
▽ More
Recently, recycled field-programmable gate arrays (FPGAs) pose a significant hardware security problem due to the proliferation of the semiconductor supply chain. Ring oscillator (RO) based frequency analyzing technique is one of the popular methods, where most studies used the known fresh FPGAs (KFFs) in machine learning-based detection, which is not a realistic approach. In this paper, we present a novel recycled FPGA detection method by examining the symmetry information of the RO frequency using unsupervised anomaly detection method. Due to the symmetrical array structure of the FPGA, some adjacent logic blocks on an FPGA have comparable RO frequencies, hence our method simply analyzes the RO frequencies of those blocks to determine how similar they are. The proposed approach efficiently categorizes recycled FPGAs by utilizing direct density ratio estimation through outliers detection. Experiments using Xilinx Artix-7 FPGAs demonstrate that the proposed method accurately classifies recycled FPGAs from 10 fresh FPGAs by x fewer computations compared with the conventional method.
△ Less
Submitted 3 March, 2023;
originally announced March 2023.
-
Evolution of MAC Protocols in the Machine Learning Decade: A Comprehensive Survey
Authors:
Mostafa Hussien,
Islam A. T. F. Taj-Eddin,
Mohammed F. A. Ahmed,
Ali Ranjha,
Kim Khoa Nguyen,
Mohamed Cheriet
Abstract:
The last decade, (2012 - 2022), saw an unprecedented advance in machine learning (ML) techniques, particularly deep learning (DL). As a result of the proven capabilities of DL, a large amount of work has been presented and studied in almost every field. Since 2012, when the convolution neural networks have been reintroduced in the context of \textit{ImagNet} competition, DL continued to achieve su…
▽ More
The last decade, (2012 - 2022), saw an unprecedented advance in machine learning (ML) techniques, particularly deep learning (DL). As a result of the proven capabilities of DL, a large amount of work has been presented and studied in almost every field. Since 2012, when the convolution neural networks have been reintroduced in the context of \textit{ImagNet} competition, DL continued to achieve superior performance in many challenging tasks and problems. Wireless communications, in general, and medium access control (MAC) techniques, in particular, were among the fields that were heavily affected by this improvement. MAC protocols play a critical role in defining the performance of wireless communication systems. At the same time, the community lacks a comprehensive survey that collects, analyses, and categorizes the recent work in ML-inspired MAC techniques. In this work, we fill this gap by surveying a long line of work in this era. We solidify the impact of machine learning on wireless MAC protocols. We provide a comprehensive background to the widely adopted MAC techniques, their design issues, and their taxonomy, in connection with the famous application domains. Furthermore, we provide an overview of the ML techniques that have been considered in this context. Finally, we augment our work by proposing some promising future research directions and open research questions that are worth further investigation.
△ Less
Submitted 23 January, 2023;
originally announced February 2023.
-
Random projection tree similarity metric for SpectralNet
Authors:
Mashaan Alshammari,
John Stavrakakis,
Adel F. Ahmed,
Masahiro Takatsuka
Abstract:
SpectralNet is a graph clustering method that uses neural network to find an embedding that separates the data. So far it was only used with $k$-nn graphs, which are usually constructed using a distance metric (e.g., Euclidean distance). $k$-nn graphs restrict the points to have a fixed number of neighbors regardless of the local statistics around them. We proposed a new SpectralNet similarity met…
▽ More
SpectralNet is a graph clustering method that uses neural network to find an embedding that separates the data. So far it was only used with $k$-nn graphs, which are usually constructed using a distance metric (e.g., Euclidean distance). $k$-nn graphs restrict the points to have a fixed number of neighbors regardless of the local statistics around them. We proposed a new SpectralNet similarity metric based on random projection trees (rpTrees). Our experiments revealed that SpectralNet produces better clustering accuracy using rpTree similarity metric compared to $k$-nn graph with a distance metric. Also, we found out that rpTree parameters do not affect the clustering accuracy. These parameters include the leaf size and the selection of projection direction. It is computationally efficient to keep the leaf size in order of $\log(n)$, and project the points onto a random direction instead of trying to find the direction with the maximum dispersion.
△ Less
Submitted 25 February, 2023;
originally announced February 2023.
-
The Effect of Points Dispersion on the $k$-nn Search in Random Projection Forests
Authors:
Mashaan Alshammari,
John Stavrakakis,
Adel F. Ahmed,
Masahiro Takatsuka
Abstract:
Partitioning trees are efficient data structures for $k$-nearest neighbor search. Machine learning libraries commonly use a special type of partitioning trees called $k$d-trees to perform $k$-nn search. Unfortunately, $k$d-trees can be ineffective in high dimensions because they need more tree levels to decrease the vector quantization (VQ) error. Random projection trees rpTrees solve this scalabi…
▽ More
Partitioning trees are efficient data structures for $k$-nearest neighbor search. Machine learning libraries commonly use a special type of partitioning trees called $k$d-trees to perform $k$-nn search. Unfortunately, $k$d-trees can be ineffective in high dimensions because they need more tree levels to decrease the vector quantization (VQ) error. Random projection trees rpTrees solve this scalability problem by using random directions to split the data. A collection of rpTrees is called rpForest. $k$-nn search in an rpForest is influenced by two factors: 1) the dispersion of points along the random direction and 2) the number of rpTrees in the rpForest. In this study, we investigate how these two factors affect the $k$-nn search with varying $k$ values and different datasets. We found that with larger number of trees, the dispersion of points has a very limited effect on the $k$-nn search. One should use the original rpTree algorithm by picking a random direction regardless of the dispersion of points.
△ Less
Submitted 25 February, 2023;
originally announced February 2023.
-
Robust language-based mental health assessments in time and space through social media
Authors:
Siddharth Mangalik,
Johannes C. Eichstaedt,
Salvatore Giorgi,
Jihu Mun,
Farhan Ahmed,
Gilvir Gill,
Adithya V. Ganesan,
Shashanka Subrahmanya,
Nikita Soni,
Sean A. P. Clouston,
H. Andrew Schwartz
Abstract:
Compared to physical health, population mental health measurement in the U.S. is very coarse-grained. Currently, in the largest population surveys, such as those carried out by the Centers for Disease Control or Gallup, mental health is only broadly captured through "mentally unhealthy days" or "sadness", and limited to relatively infrequent state or metropolitan estimates. Through the large scale…
▽ More
Compared to physical health, population mental health measurement in the U.S. is very coarse-grained. Currently, in the largest population surveys, such as those carried out by the Centers for Disease Control or Gallup, mental health is only broadly captured through "mentally unhealthy days" or "sadness", and limited to relatively infrequent state or metropolitan estimates. Through the large scale analysis of social media data, robust estimation of population mental health is feasible at much higher resolutions, up to weekly estimates for counties. In the present work, we validate a pipeline that uses a sample of 1.2 billion Tweets from 2 million geo-located users to estimate mental health changes for the two leading mental health conditions, depression and anxiety. We find moderate to large associations between the language-based mental health assessments and survey scores from Gallup for multiple levels of granularity, down to the county-week (fixed effects $β= .25$ to $1.58$; $p<.001$). Language-based assessment allows for the cost-effective and scalable monitoring of population mental health at weekly time scales. Such spatially fine-grained time series are well suited to monitor effects of societal events and policies as well as enable quasi-experimental study designs in population health and other disciplines. Beyond mental health in the U.S., this method generalizes to a broad set of psychological outcomes and allows for community measurement in under-resourced settings where no traditional survey measures - but social media data - are available.
△ Less
Submitted 24 February, 2023;
originally announced February 2023.
-
Random Projection Forest Initialization for Graph Convolutional Networks
Authors:
Mashaan Alshammari,
John Stavrakakis,
Adel F. Ahmed,
Masahiro Takatsuka
Abstract:
Graph convolutional networks (GCNs) were a great step towards extending deep learning to unstructured data such as graphs. But GCNs still need a constructed graph to work with. To solve this problem, classical graphs such as $k$-nearest neighbor are usually used to initialize the GCN. Although it is computationally efficient to construct $k$-nn graphs, the constructed graph might not be very usefu…
▽ More
Graph convolutional networks (GCNs) were a great step towards extending deep learning to unstructured data such as graphs. But GCNs still need a constructed graph to work with. To solve this problem, classical graphs such as $k$-nearest neighbor are usually used to initialize the GCN. Although it is computationally efficient to construct $k$-nn graphs, the constructed graph might not be very useful for learning. In a $k$-nn graph, points are restricted to have a fixed number of edges, and all edges in the graph have equal weights. We present a new way to construct the graph and initialize the GCN. It is based on random projection forest (rpForest). rpForest enables us to assign varying weights on edges indicating varying importance, which enhanced the learning. The number of trees is a hyperparameter in rpForest. We performed spectral analysis to help us setting this parameter in the right range. In the experiments, initializing the GCN using rpForest provides better results compared to $k$-nn initialization.
△ Less
Submitted 22 October, 2023; v1 submitted 22 February, 2023;
originally announced February 2023.
-
Graph Construction using Principal Axis Trees for Simple Graph Convolution
Authors:
Mashaan Alshammari,
John Stavrakakis,
Adel F. Ahmed,
Masahiro Takatsuka
Abstract:
Graph Neural Networks (GNNs) are increasingly becoming the favorite method for graph learning. They exploit the semi-supervised nature of deep learning, and they bypass computational bottlenecks associated with traditional graph learning methods. In addition to the feature matrix $X$, GNNs need an adjacency matrix $A$ to perform feature propagation. In many cases, the adjacency matrix $A$ is missi…
▽ More
Graph Neural Networks (GNNs) are increasingly becoming the favorite method for graph learning. They exploit the semi-supervised nature of deep learning, and they bypass computational bottlenecks associated with traditional graph learning methods. In addition to the feature matrix $X$, GNNs need an adjacency matrix $A$ to perform feature propagation. In many cases, the adjacency matrix $A$ is missing. We introduce a graph construction scheme that constructs the adjacency matrix $A$ using unsupervised and supervised information. Unsupervised information characterizes the neighborhood around points. We used Principal Axis trees (PA-trees) as a source for unsupervised information, where we create edges between points falling onto the same leaf node. For supervised information, we used the concept of penalty and intrinsic graphs. A penalty graph connects points with different class labels, whereas an intrinsic graph connects points with the same class labels. We used the penalty and intrinsic graphs to remove or add edges to the graph constructed via PA-tree. We tested this graph construction scheme on two well-known GNNs: 1) Graph Convolutional Network (GCN) and 2) Simple Graph Convolution (SGC). The experiments show that it is better to use SGC because it is faster and delivers better or the same results as GCN. We also test the effect of oversmoothing on both GCN and SGC. We found out that the level of smoothing has to be carefully selected for SGC to avoid oversmoothing.
△ Less
Submitted 7 November, 2023; v1 submitted 22 February, 2023;
originally announced February 2023.
-
Multi-modal Machine Learning in Engineering Design: A Review and Future Directions
Authors:
Binyang Song,
Rui Zhou,
Faez Ahmed
Abstract:
In the rapidly advancing field of multi-modal machine learning (MMML), the convergence of multiple data modalities has the potential to reshape various applications. This paper presents a comprehensive overview of the current state, advancements, and challenges of MMML within the sphere of engineering design. The review begins with a deep dive into five fundamental concepts of MMML:multi-modal inf…
▽ More
In the rapidly advancing field of multi-modal machine learning (MMML), the convergence of multiple data modalities has the potential to reshape various applications. This paper presents a comprehensive overview of the current state, advancements, and challenges of MMML within the sphere of engineering design. The review begins with a deep dive into five fundamental concepts of MMML:multi-modal information representation, fusion, alignment, translation, and co-learning. Following this, we explore the cutting-edge applications of MMML, placing a particular emphasis on tasks pertinent to engineering design, such as cross-modal synthesis, multi-modal prediction, and cross-modal information retrieval. Through this comprehensive overview, we highlight the inherent challenges in adopting MMML in engineering design, and proffer potential directions for future research. To spur on the continued evolution of MMML in engineering design, we advocate for concentrated efforts to construct extensive multi-modal design datasets, develop effective data-driven MMML techniques tailored to design applications, and enhance the scalability and interpretability of MMML models. MMML models, as the next generation of intelligent design tools, hold a promising future to impact how products are designed.
△ Less
Submitted 28 July, 2023; v1 submitted 13 February, 2023;
originally announced February 2023.