-
Deciphering the Factors Influencing the Efficacy of Chain-of-Thought: Probability, Memorization, and Noisy Reasoning
Authors:
Akshara Prabhakar,
Thomas L. Griffiths,
R. Thomas McCoy
Abstract:
Chain-of-Thought (CoT) prompting has been shown to enhance the multi-step reasoning capabilities of Large Language Models (LLMs). However, debates persist about whether LLMs exhibit abstract generalization or rely on shallow heuristics when given CoT prompts. To understand the factors influencing CoT reasoning we provide a detailed case study of the symbolic reasoning task of decoding shift cipher…
▽ More
Chain-of-Thought (CoT) prompting has been shown to enhance the multi-step reasoning capabilities of Large Language Models (LLMs). However, debates persist about whether LLMs exhibit abstract generalization or rely on shallow heuristics when given CoT prompts. To understand the factors influencing CoT reasoning we provide a detailed case study of the symbolic reasoning task of decoding shift ciphers, where letters are shifted forward some number of steps in the alphabet. GPT-4 achieves zero accuracy on most shift ciphers with standard prompting, but with CoT its accuracy improves to an average of 32%. By focusing on a single relatively simple task, we are able to identify three factors that systematically affect CoT performance: the probability of the task's expected output (probability), what the model has implicitly learned during pre-training (memorization), and the number of intermediate operations involved in reasoning (noisy reasoning). We show that these factors can drastically influence the task accuracy; e.g., varying the output's probability of occurrence can shift accuracy from 26% to 70%. We also demonstrate that it is essential for the model to explicitly produce intermediate steps as output that can be conditioned on to increase the probability of the correct answer. Our experiments indicate that as long as the model does so, the validity of the demonstrations in the prompt does not matter. Overall, we conclude that CoT prompting performance reflects both memorization and a probabilistic version of genuine reasoning.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
An Initial Study Review of Designing a Technology Solution for Women in Technologically Deprived Areas or Low Resource Constraint Communities
Authors:
Jones Yeboah,
Sophia Bampoh,
Annu Sible Prabhakar
Abstract:
In the West African country of Ghana, depression is a significant issue affecting a large number of women. Despite its importance, the issue received insufficient attention during the COVID-19 pandemic. In developed countries, mobile phones serve as a convenient medium for accessing health information and providers. However, in Ghana, women's access to mobile phones is limited by cultural, social,…
▽ More
In the West African country of Ghana, depression is a significant issue affecting a large number of women. Despite its importance, the issue received insufficient attention during the COVID-19 pandemic. In developed countries, mobile phones serve as a convenient medium for accessing health information and providers. However, in Ghana, women's access to mobile phones is limited by cultural, social, and financial constraints, hindering their ability to seek mental health information and support. While some women in deprived areas can afford feature phones, such as the Nokia 3310, the lack of advanced smartphone features further restricts their access to necessary health information. This paper reviews the potential of Unstructured Supplementary Service Data (USSD) technology to address these challenges. Unlike Short Messaging Service (SMS), USSD can facilitate data collection, complex transactions, and provide information access without the need for internet connectivity. This research proposes studying the use of USSD to improve access to mental health resources for resource-deprived women in Ghana.
△ Less
Submitted 16 June, 2024;
originally announced June 2024.
-
Active Exploration for Real-Time Haptic Training
Authors:
Jake Ketchum,
Ahalya Prabhakar,
Todd D. Murphey
Abstract:
Tactile perception is important for robotic systems that interact with the world through touch. Touch is an active sense in which tactile measurements depend on the contact properties of an interaction--e.g., velocity, force, acceleration--as well as properties of the sensor and object under test. These dependencies make training tactile perceptual models challenging. Additionally, the effects of…
▽ More
Tactile perception is important for robotic systems that interact with the world through touch. Touch is an active sense in which tactile measurements depend on the contact properties of an interaction--e.g., velocity, force, acceleration--as well as properties of the sensor and object under test. These dependencies make training tactile perceptual models challenging. Additionally, the effects of limited sensor life and the near-field nature of tactile sensors preclude the practical collection of exhaustive data sets even for fairly simple objects. Active learning provides a mechanism for focusing on only the most informative aspects of an object during data collection. Here we employ an active learning approach that uses a data-driven model's entropy as an uncertainty measure and explore relative to that entropy conditioned on the sensor state variables. Using a coverage-based ergodic controller, we train perceptual models in near-real time. We demonstrate our approach using a biomimentic sensor, exploring "tactile scenes" composed of shapes, textures, and objects. Each learned representation provides a perceptual sensor model for a particular tactile scene. Models trained on actively collected data outperform their randomly collected counterparts in real-time training tests. Additionally, we find that the resulting network entropy maps can be used to identify high salience portions of a tactile scene.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
Language Models as Science Tutors
Authors:
Alexis Chevalier,
Jiayi Geng,
Alexander Wettig,
Howard Chen,
Sebastian Mizera,
Toni Annala,
Max Jameson Aragon,
Arturo Rodríguez Fanlo,
Simon Frieder,
Simon Machado,
Akshara Prabhakar,
Ellie Thieu,
Jiachen T. Wang,
Zirui Wang,
Xindi Wu,
Mengzhou Xia,
Wenhan Jia,
Jiatong Yu,
Jun-Jie Zhu,
Zhiyong Jason Ren,
Sanjeev Arora,
Danqi Chen
Abstract:
NLP has recently made exciting progress toward training language models (LMs) with strong scientific problem-solving skills. However, model development has not focused on real-life use-cases of LMs for science, including applications in education that require processing long scientific documents. To address this, we introduce TutorEval and TutorChat. TutorEval is a diverse question-answering bench…
▽ More
NLP has recently made exciting progress toward training language models (LMs) with strong scientific problem-solving skills. However, model development has not focused on real-life use-cases of LMs for science, including applications in education that require processing long scientific documents. To address this, we introduce TutorEval and TutorChat. TutorEval is a diverse question-answering benchmark consisting of questions about long chapters from STEM textbooks, written by experts. TutorEval helps measure real-life usability of LMs as scientific assistants, and it is the first benchmark combining long contexts, free-form generation, and multi-disciplinary scientific knowledge. Moreover, we show that fine-tuning base models with existing dialogue datasets leads to poor performance on TutorEval. Therefore, we create TutorChat, a dataset of 80,000 long synthetic dialogues about textbooks. We use TutorChat to fine-tune Llemma models with 7B and 34B parameters. These LM tutors specialized in math have a 32K-token context window, and they excel at TutorEval while performing strongly on GSM8K and MATH. Our datasets build on open-source materials, and we release our models, data, and evaluations.
△ Less
Submitted 16 February, 2024;
originally announced February 2024.
-
Single and double quantum transitions in spin-mixed states under photo-excitation
Authors:
Anand Patel,
Zainab Chowdhry,
Anil Prabhakar,
A. Rathi,
V. P. Bhallamudi
Abstract:
Electronic spins associated with the Nitrogen-Vacancy (NV) center in diamond offer an opportunity to study spin-related phenomena with extremely high sensitivity owing to their high degree of optical polarization. Here, we study both single- and double-quantum transitions (SQT and DQT) in NV centers between spin-mixed states, which arise from magnetic fields that are non-collinear to the NV axis.…
▽ More
Electronic spins associated with the Nitrogen-Vacancy (NV) center in diamond offer an opportunity to study spin-related phenomena with extremely high sensitivity owing to their high degree of optical polarization. Here, we study both single- and double-quantum transitions (SQT and DQT) in NV centers between spin-mixed states, which arise from magnetic fields that are non-collinear to the NV axis. We demonstrate the amplification of the ESR signal from both these types of transition under laser illumination. We obtain hyperfine-resolved X-band ESR signal as a function of both excitation laser power and misalignment of static magnetic field with the NV axis. This combined with our analysis using a seven-level model that incorporates thermal polarization and double quantum relaxation allows us to comprehensively analyze the polarization of NV spins under off-axis fields. Such detailed understanding of spin-mixed states in NV centers under photo-excitation can help greatly in realizing NV-diamond platform's potential in sensing correlated magnets and biological samples, as well as other emerging applications, such as masing and nuclear hyperpolarization.
△ Less
Submitted 30 June, 2023;
originally announced June 2023.
-
InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback
Authors:
John Yang,
Akshara Prabhakar,
Karthik Narasimhan,
Shunyu Yao
Abstract:
Humans write code in a fundamentally interactive manner and rely on constant execution feedback to correct errors, resolve ambiguities, and decompose tasks. While LLMs have recently exhibited promising coding capabilities, current coding benchmarks mostly consider a static instruction-to-code sequence transduction process, which has the potential for error propagation and a disconnect between the…
▽ More
Humans write code in a fundamentally interactive manner and rely on constant execution feedback to correct errors, resolve ambiguities, and decompose tasks. While LLMs have recently exhibited promising coding capabilities, current coding benchmarks mostly consider a static instruction-to-code sequence transduction process, which has the potential for error propagation and a disconnect between the generated code and its final execution environment. To address this gap, we introduce InterCode, a lightweight, flexible, and easy-to-use framework of interactive coding as a standard reinforcement learning (RL) environment, with code as actions and execution feedback as observations. Our framework is language and platform agnostic, uses self-contained Docker environments to provide safe and reproducible execution, and is compatible out-of-the-box with traditional seq2seq coding methods, while enabling the development of new methods for interactive code generation. We use InterCode to create three interactive code environments with Bash, SQL, and Python as action spaces, leveraging data from the static NL2Bash, Spider, and MBPP datasets. We demonstrate InterCode's viability as a testbed by evaluating multiple state-of-the-art LLMs configured with different prompting strategies such as ReAct and Plan & Solve. Our results showcase the benefits of interactive code generation and demonstrate that InterCode can serve as a challenging benchmark for advancing code understanding and generation capabilities. InterCode is designed to be easily extensible and can even be used to create new tasks such as Capture the Flag, a popular coding puzzle that is inherently multi-step and involves multiple programming languages. Project site with code and data: https://intercode-benchmark.github.io
△ Less
Submitted 30 October, 2023; v1 submitted 26 June, 2023;
originally announced June 2023.
-
Security of differential phase shift QKD against explicit individual attacks
Authors:
Valliamai Ramanathan,
Anil Prabhakar,
Prabha Mandayam
Abstract:
Quantum key distribution (QKD) is known to be unconditionally secure in principle, but quantifying the security of QKD protocols from a practical standpoint continues to remain an important challenge. Here, we focus on phase-based QKD protocols and characterize the security of the 3 and n-pulse Differential Phase Shift Quantum Key Distribution (DPS QKD) protocols against individual attacks. In par…
▽ More
Quantum key distribution (QKD) is known to be unconditionally secure in principle, but quantifying the security of QKD protocols from a practical standpoint continues to remain an important challenge. Here, we focus on phase-based QKD protocols and characterize the security of the 3 and n-pulse Differential Phase Shift Quantum Key Distribution (DPS QKD) protocols against individual attacks. In particular, we focus on the minimum error discrimination (MED) and cloning attacks and obtain the corresponding shrinking factor by which the sifted key needs to be shrunk in order to get a secure key. We compare the secure key rates thus obtained with the known lower bounds under a general individual attack. In a departure from the theoretical lower bounds, which have no explicit attack strategies, our work provides a practical assessment of the security of phase-based protocols based on attacks with known implementations.
△ Less
Submitted 12 March, 2024; v1 submitted 19 May, 2023;
originally announced May 2023.
-
Scale-Invariant Specifications for Human-Swarm Systems
Authors:
Joel Meyer,
Ahalya Prabhakar,
Allison Pinosky,
Ian Abraham,
Annalisa Taylor,
Millicent Schlafly,
Katarina Popovic,
Giovani Diniz,
Brendan Teich,
Borislava Simidchieva,
Shane Clark,
Todd Murphey
Abstract:
We present a method for controlling a swarm using its spectral decomposition -- that is, by describing the set of trajectories of a swarm in terms of a spatial distribution throughout the operational domain -- guaranteeing scale invariance with respect to the number of agents both for computation and for the operator tasked with controlling the swarm. We use ergodic control, decentralized across t…
▽ More
We present a method for controlling a swarm using its spectral decomposition -- that is, by describing the set of trajectories of a swarm in terms of a spatial distribution throughout the operational domain -- guaranteeing scale invariance with respect to the number of agents both for computation and for the operator tasked with controlling the swarm. We use ergodic control, decentralized across the network, for implementation. In the DARPA OFFSET program field setting, we test this interface design for the operator using the STOMP interface -- the same interface used by Raytheon BBN throughout the duration of the OFFSET program. In these tests, we demonstrate that our approach is scale-invariant -- the user specification does not depend on the number of agents; it is persistent -- the specification remains active until the user specifies a new command; and it is real-time -- the user can interact with and interrupt the swarm at any time. Moreover, we show that the spectral/ergodic specification of swarm behavior degrades gracefully as the number of agents goes down, enabling the operator to maintain the same approach as agents become disabled or are added to the network. We demonstrate the scale-invariance and dynamic response of our system in a field relevant simulator on a variety of tactical scenarios with up to 50 agents. We also demonstrate the dynamic response of our system in the field with a smaller team of agents. Lastly, we make the code for our system available.
△ Less
Submitted 12 December, 2022; v1 submitted 6 December, 2022;
originally announced December 2022.
-
User-specific, Adaptable Safety Controllers Facilitate User Adoption in Human-Robot Collaboration
Authors:
Ahalya Prabhakar,
Aude Billard
Abstract:
As assistive and collaborative robots become more ubiquitous in the real-world, we need to develop interfaces and controllers that are safe for users to build trust and encourage adoption. In this Blue Sky paper, we discuss the need for co-evolving task and user-specific safety controllers that can accommodate people's safety preferences. We argue that while most adaptive controllers focus on beha…
▽ More
As assistive and collaborative robots become more ubiquitous in the real-world, we need to develop interfaces and controllers that are safe for users to build trust and encourage adoption. In this Blue Sky paper, we discuss the need for co-evolving task and user-specific safety controllers that can accommodate people's safety preferences. We argue that while most adaptive controllers focus on behavioral adaptation, safety adaptation is also a major consideration for building trust in collaborative systems. Furthermore, we highlight the need for adaptation over time, to account for user's changes in preferences as experience and trust builds. We provide a general formulation for what these interfaces should look like and what features are necessary for making them feasible and successful. In this formulation, users provide demonstrations and labelled safety ratings from which a safety value function is learned. These value functions can be updated by updating the safety labels on demonstrations to learn an updated function. We discuss how this can be implemented at a high-level, as well as some promising approaches and techniques for enabling this.
△ Less
Submitted 14 October, 2022;
originally announced October 2022.
-
QKD in the NISQ era: enhancing secure key rates via quantum error correction
Authors:
Shashank Kumar Ranu,
Anil Prabhakar,
Prabha Mandayam
Abstract:
Error mitigation is one of the key challenges in realising the full potential of quantum cryptographic protocols. Consequently, there is a lot of interest in adapting techniques from quantum error correction (QEC) to improve the robustness of quantum cryptographic protocols. In this work, we benchmark the performance of different QKD protocols on noisy quantum devices, with and without error corre…
▽ More
Error mitigation is one of the key challenges in realising the full potential of quantum cryptographic protocols. Consequently, there is a lot of interest in adapting techniques from quantum error correction (QEC) to improve the robustness of quantum cryptographic protocols. In this work, we benchmark the performance of different QKD protocols on noisy quantum devices, with and without error correction. We obtain the secure key rates of BB84, B92 and BBM92 QKD protocols over a quantum channel that is subject to amplitude-dam** noise. We demonstrate, theoretically and via implementations on the IBM quantum processors, that B92 is the optimal protocol under amplitude-dam** and generalized amplitude-dam** noise. We then show that the security of the noisy BBM92 protocol crucially depends on the type and the mode of distribution of an entangled pair. Finally, we implement an error-corrected BB84 protocol using dual-rail encoding on a noisy quantum processor, and show that the dual-rail BB84 implementation outperforms the conventional BB84 in the presence of noise. Our secure key rate calculation also takes into account the effects of CNOT imperfections on the error rates of the protocols.
△ Less
Submitted 11 October, 2022;
originally announced October 2022.
-
Proceedings of the AI-HRI Symposium at AAAI-FSS 2022
Authors:
Zhao Han,
Emmanuel Senft,
Muneeb I. Ahmad,
Shelly Bagchi,
Amir Yazdani,
Jason R. Wilson,
Boyoung Kim,
Ruchen Wen,
Justin W. Hart,
Daniel Hernández García,
Matteo Leonetti,
Ross Mead,
Reuth Mirsky,
Ahalya Prabhakar,
Megan L. Zimmerman
Abstract:
The Artificial Intelligence (AI) for Human-Robot Interaction (HRI) Symposium has been a successful venue of discussion and collaboration on AI theory and methods aimed at HRI since 2014. This year, after a review of the achievements of the AI-HRI community over the last decade in 2021, we are focusing on a visionary theme: exploring the future of AI-HRI. Accordingly, we added a Blue Sky Ideas trac…
▽ More
The Artificial Intelligence (AI) for Human-Robot Interaction (HRI) Symposium has been a successful venue of discussion and collaboration on AI theory and methods aimed at HRI since 2014. This year, after a review of the achievements of the AI-HRI community over the last decade in 2021, we are focusing on a visionary theme: exploring the future of AI-HRI. Accordingly, we added a Blue Sky Ideas track to foster a forward-thinking discussion on future research at the intersection of AI and HRI. As always, we appreciate all contributions related to any topic on AI/HRI and welcome new researchers who wish to take part in this growing community.
With the success of past symposia, AI-HRI impacts a variety of communities and problems, and has pioneered the discussions in recent trends and interests. This year's AI-HRI Fall Symposium aims to bring together researchers and practitioners from around the globe, representing a number of university, government, and industry laboratories. In doing so, we hope to accelerate research in the field, support technology transition and user adoption, and determine future directions for our group and our research.
△ Less
Submitted 28 November, 2022; v1 submitted 28 September, 2022;
originally announced September 2022.
-
Commonsense and Named Entity Aware Knowledge Grounded Dialogue Generation
Authors:
Deeksha Varshney,
Akshara Prabhakar,
Asif Ekbal
Abstract:
Grounding dialogue on external knowledge and interpreting linguistic patterns in dialogue history context, such as ellipsis, anaphora, and co-references is critical for dialogue comprehension and generation. In this paper, we present a novel open-domain dialogue generation model which effectively utilizes the large-scale commonsense and named entity based knowledge in addition to the unstructured…
▽ More
Grounding dialogue on external knowledge and interpreting linguistic patterns in dialogue history context, such as ellipsis, anaphora, and co-references is critical for dialogue comprehension and generation. In this paper, we present a novel open-domain dialogue generation model which effectively utilizes the large-scale commonsense and named entity based knowledge in addition to the unstructured topic-specific knowledge associated with each utterance. We enhance the commonsense knowledge with named entity-aware structures using co-references. Our proposed model utilizes a multi-hop attention layer to preserve the most accurate and critical parts of the dialogue history and the associated knowledge. In addition, we employ a Commonsense and Named Entity Enhanced Attention Module, which starts with the extracted triples from various sources and gradually finds the relevant supporting set of triples using multi-hop attention with the query vector obtained from the interactive dialogue-knowledge module. Empirical results on two benchmark dataset demonstrate that our model significantly outperforms the state-of-the-art methods in terms of both automatic evaluation metrics and human judgment. Our code is publicly available at \href{https://github.com/deekshaVarshney/CNTF}{https://github.com/deekshaVarshney/CNTF}; \href{https://www.iitp.ac.in/~ai-nlp-ml/resources/codes/CNTF.zip}{https://www.iitp.ac.in/-ai-nlp-ml/resources/ codes/CNTF.zip}.
△ Less
Submitted 27 May, 2022;
originally announced May 2022.
-
Audio-Visual Object Classification for Human-Robot Collaboration
Authors:
A. Xompero,
Y. L. Pang,
T. Patten,
A. Prabhakar,
B. Calli,
A. Cavallaro
Abstract:
Human-robot collaboration requires the contactless estimation of the physical properties of containers manipulated by a person, for example while pouring content in a cup or moving a food box. Acoustic and visual signals can be used to estimate the physical properties of such objects, which may vary substantially in shape, material and size, and also be occluded by the hands of the person. To faci…
▽ More
Human-robot collaboration requires the contactless estimation of the physical properties of containers manipulated by a person, for example while pouring content in a cup or moving a food box. Acoustic and visual signals can be used to estimate the physical properties of such objects, which may vary substantially in shape, material and size, and also be occluded by the hands of the person. To facilitate comparisons and stimulate progress in solving this problem, we present the CORSMAL challenge and a dataset to assess the performance of the algorithms through a set of well-defined performance scores. The tasks of the challenge are the estimation of the mass, capacity, and dimensions of the object (container), and the classification of the type and amount of its content. A novel feature of the challenge is our real-to-simulation framework for visualising and assessing the impact of estimation errors in human-to-robot handovers.
△ Less
Submitted 3 March, 2022;
originally announced March 2022.
-
CL-NERIL: A Cross-Lingual Model for NER in Indian Languages
Authors:
Akshara Prabhakar,
Gouri Sankar Majumder,
Ashish Anand
Abstract:
Develo** Named Entity Recognition (NER) systems for Indian languages has been a long-standing challenge, mainly owing to the requirement of a large amount of annotated clean training instances. This paper proposes an end-to-end framework for NER for Indian languages in a low-resource setting by exploiting parallel corpora of English and Indian languages and an English NER dataset. The proposed f…
▽ More
Develo** Named Entity Recognition (NER) systems for Indian languages has been a long-standing challenge, mainly owing to the requirement of a large amount of annotated clean training instances. This paper proposes an end-to-end framework for NER for Indian languages in a low-resource setting by exploiting parallel corpora of English and Indian languages and an English NER dataset. The proposed framework includes an annotation projection method that combines word alignment score and NER tag prediction confidence score on source language (English) data to generate weakly labeled data in a target Indian language. We employ a variant of the Teacher-Student model and optimize it jointly on the pseudo labels of the Teacher model and predictions on the generated weakly labeled data. We also present manually annotated test sets for three Indian languages: Hindi, Bengali, and Gujarati. We evaluate the performance of the proposed framework on the test sets of the three Indian languages. Empirical results show a minimum 10% performance improvement compared to the zero-shot transfer learning model on all languages. This indicates that weakly labeled data generated using the proposed annotation projection method in target Indian languages can complement well-annotated source language data to enhance performance. Our code is publicly available at https://github.com/aksh555/CL-NERIL
△ Less
Submitted 23 November, 2021;
originally announced November 2021.
-
3G R&D: R&D for the Next Generation of Ground-based Gravitational-wave Detectors
Authors:
David McClelland,
Harald Lueck,
Rana Adhikari,
Masaki Ando,
GariLynn Billingsley,
Geppo Cagnoli,
Matt Evans,
Martin Fejer,
Andreas Freise,
Paul Fulda,
Eric Genin,
Gabriela González,
Jan Harms,
Stefan Hild,
Giovanni Losurdo,
Ian Martin,
Anil Prabhakar,
Stuart Reid,
Fulvio Ricci,
Norna Robertson,
Jo van den Brand,
Benno Willke,
Michael Zucker,
Alessandro Bertolini,
Stefan Danilishin
, et al. (21 additional authors not shown)
Abstract:
To deliver on the promise of next generation gravitational-wave observatories, a sustained and coordinated detector research and development program is required. This report examines in detail the wide range of nearer- and longer-term detector R&D programs needed for next generation GW detectors commensurate with the key science targets presented in "The Next Generation Global Gravitational Wave O…
▽ More
To deliver on the promise of next generation gravitational-wave observatories, a sustained and coordinated detector research and development program is required. This report examines in detail the wide range of nearer- and longer-term detector R&D programs needed for next generation GW detectors commensurate with the key science targets presented in "The Next Generation Global Gravitational Wave Observatory: The Science Book", including considerations of site selection and large-scale vacuum infrastructure. The report makes a series of detailed recommendations on the needed advances in detector technology and the timescales needed to achieve those advances. It also identifies areas where larger-scale globally coordinated R&D efforts will be critical to ensuring success while minimizing costs.
This report is the third in a six part series of reports by the GWIC 3G Subcommittee: i) Expanding the Reach of Gravitational Wave Observatories to the Edge of the Universe, ii) The Next Generation Global Gravitational Wave Observatory: The Science Book, iii) 3G R&D: R&D for the Next Generation of Ground-based Gravitational Wave Detectors (this report), iv) Gravitational Wave Data Analysis: Computing Challenges in the 3G Era, v) Future Ground-based Gravitational-wave Observatories: Synergies with Other Scientific Communities, and vi) An Exploration of Possible Governance Models for the Future Global Gravitational-Wave Observatory Network.
△ Less
Submitted 12 November, 2021;
originally announced November 2021.
-
Multimodal Sensory Learning for Real-time, Adaptive Manipulation
Authors:
Ahalya Prabhakar,
Stanislas Furrer,
Lorenzo Panchetti,
Maxence Perret,
Aude Billard
Abstract:
Adaptive control for real-time manipulation requires quick estimation and prediction of object properties. While robot learning in this area primarily focuses on using vision, many tasks cannot rely on vision due to object occlusion. Here, we formulate a learning framework that uses multimodal sensory fusion of tactile and audio data in order to quickly characterize and predict an object's propert…
▽ More
Adaptive control for real-time manipulation requires quick estimation and prediction of object properties. While robot learning in this area primarily focuses on using vision, many tasks cannot rely on vision due to object occlusion. Here, we formulate a learning framework that uses multimodal sensory fusion of tactile and audio data in order to quickly characterize and predict an object's properties. The predictions are used in a developed reactive controller to adapt the grip on the object to compensate for the predicted inertial forces experienced during motion. Drawing inspiration from how humans interact with objects, we propose an experimental setup from which we can understand how to best utilize different sensory signals and actively interact with and manipulate objects to quickly learn their object properties for safe manipulation.
△ Less
Submitted 9 October, 2021;
originally announced October 2021.
-
Credit Assignment Safety Learning from Human Demonstrations
Authors:
Ahalya Prabhakar,
Aude Billard
Abstract:
A critical need in assistive robotics, such as assistive wheelchairs for navigation, is a need to learn task intent and safety guarantees through user interactions in order to ensure safe task performance. For tasks where the objectives from the user are not easily defined, learning from user demonstrations has been a key step in enabling learning. However, most robot learning from demonstration (…
▽ More
A critical need in assistive robotics, such as assistive wheelchairs for navigation, is a need to learn task intent and safety guarantees through user interactions in order to ensure safe task performance. For tasks where the objectives from the user are not easily defined, learning from user demonstrations has been a key step in enabling learning. However, most robot learning from demonstration (LfD) methods primarily rely on optimal demonstration in order to successfully learn a control policy, which can be challenging to acquire from novice users. Recent work does use suboptimal and failed demonstrations to learn about task intent; few focus on learning safety guarantees to prevent repeat failures experienced, essential for assistive robots. Furthermore, interactive human-robot learning aims to minimize effort from the human user to facilitate deployment in the real-world. As such, requiring users to label the unsafe states or keyframes from the demonstrations should not be a necessary requirement for learning. Here, we propose an algorithm to learn a safety value function from a set of suboptimal and failed demonstrations that is used to generate a real-time safety control filter. Importantly, we develop a credit assignment method that extracts the failure states from the failed demonstrations without requiring human labelling or prespecified knowledge of unsafe regions. Furthermore, we extend our formulation to allow for user-specific safety functions, by incorporating user-defined safety rankings from which we can generate safety level sets according to the users' preferences. By using both suboptimal and failed demonstrations and the developed credit assignment formulation, we enable learning a safety value function with minimal effort needed from the user, making it more feasible for widespread use in human-robot interactive learning tasks.
△ Less
Submitted 9 October, 2021;
originally announced October 2021.
-
A spatial-photonic Ising machine to solve the two-way number-partitioning problem
Authors:
Vikram Ramesh,
Vighnesh Natarajan,
Anil Prabhakar
Abstract:
We evaluate the performance of different algorithms in minimizing the Hamiltonian of a spatial-photonic Ising machine (SPIM). We then encode the number-partitioning problem on the SPIM and adiabatically arrive at good solutions for the problem for over 16000 spins, with a time complexity that only scales linearly with problem size. Finally, we benchmark our machine performance against the classica…
▽ More
We evaluate the performance of different algorithms in minimizing the Hamiltonian of a spatial-photonic Ising machine (SPIM). We then encode the number-partitioning problem on the SPIM and adiabatically arrive at good solutions for the problem for over 16000 spins, with a time complexity that only scales linearly with problem size. Finally, we benchmark our machine performance against the classical solver, Gurobi, and also a D-Wave 5000+ quantum annealer. With just one spatial light modulator, and and adiabatic evolution scheme for the phase, our results surpass current state-of-the-art SPIMs. We reduce hardware costs, and can solve larger problems more efficiently.
△ Less
Submitted 3 October, 2021;
originally announced October 2021.
-
Gated InGaAs Detector Characterization with Sub-Picosecond Weak Coherent Pulses
Authors:
Gautam Kumar Shaw,
Shyam Sridharan,
Anil Prabhakar
Abstract:
We propose and demonstrate a method to characterize a gated InGaAs single-photon detector (SPD). Ultrashort weak coherent pulses, from a mode-locked sub-picosecond pulsed laser, were used to measure photon counts, at varying arrival times relative to the start of the SPD gate voltage. The uneven detection probabilities within the gate window were used to estimate the afterpulse probability with re…
▽ More
We propose and demonstrate a method to characterize a gated InGaAs single-photon detector (SPD). Ultrashort weak coherent pulses, from a mode-locked sub-picosecond pulsed laser, were used to measure photon counts, at varying arrival times relative to the start of the SPD gate voltage. The uneven detection probabilities within the gate window were used to estimate the afterpulse probability with respect to various detector parameters: excess bias, width of gate window and hold-off time. We estimated a lifetime of 2.1 microseconds for the half-life of trapped carriers, using a power-law fit to the decay in afterpulse probability. Finally, we quantify the timing jitter of the SPD using a time to digital converter with a resolution of 55 ps.
△ Less
Submitted 11 July, 2022; v1 submitted 10 June, 2021;
originally announced June 2021.
-
Ergodic imitation: Learning from what to do and what not to do
Authors:
Aleksandra Kalinowska,
Ahalya Prabhakar,
Kathleen Fitzsimons,
Todd Murphey
Abstract:
With growing access to versatile robotics, it is beneficial for end users to be able to teach robots tasks without needing to code a control policy. One possibility is to teach the robot through successful task executions. However, near-optimal demonstrations of a task can be difficult to provide and even successful demonstrations can fail to capture task aspects key to robust skill replication. H…
▽ More
With growing access to versatile robotics, it is beneficial for end users to be able to teach robots tasks without needing to code a control policy. One possibility is to teach the robot through successful task executions. However, near-optimal demonstrations of a task can be difficult to provide and even successful demonstrations can fail to capture task aspects key to robust skill replication. Here, we propose a learning from demonstration (LfD) approach that enables learning of robust task definitions without the need for near-optimal demonstrations. We present a novel algorithmic framework for learning tasks based on the ergodic metric -- a measure of information content in motion. Moreover, we make use of negative demonstrations -- demonstrations of what not to do -- and show that they can help compensate for imperfect demonstrations, reduce the number of demonstrations needed, and highlight crucial task elements improving robot performance. In a proof-of-concept example of cart-pole inversion, we show that negative demonstrations alone can be sufficient to successfully learn and recreate a skill. Through a human subject study with 24 participants, we show that consistently more information about a task can be captured from combined positive and negative (posneg) demonstrations than from the same amount of just positive demonstrations. Finally, we demonstrate our learning approach on simulated tasks of target reaching and table cleaning with a 7-DoF Franka arm. Our results point towards a future with robust, data-efficient LfD for novice users.
△ Less
Submitted 31 March, 2021;
originally announced March 2021.
-
Equivalence of space and time-bins in DPS-QKD
Authors:
Gautam Shaw,
Shyam Sridharan,
Shashank Ranu,
Foram Shingala,
Prabha Mandayam,
Anil Prabhakar
Abstract:
Key generation efficiency, and security, in DPS-QKD improve with an increase in the number of path delays or time-bin superpositions. We demonstrate the implementation of super-position states using time-bins, and establish an equivalence with path-based superposition, thus yielding a simpler implementation of higher-order superposition states for differential phase-shift quantum key distribution…
▽ More
Key generation efficiency, and security, in DPS-QKD improve with an increase in the number of path delays or time-bin superpositions. We demonstrate the implementation of super-position states using time-bins, and establish an equivalence with path-based superposition, thus yielding a simpler implementation of higher-order superposition states for differential phase-shift quantum key distribution (DPS-QKD). We set up DPS-QKD, over 105 km of single mode optical fiber, with a quantum bit error rate of less than 15% at a secure key rate of 2 kbps. With temporal guard bands, the QBER reduced to less than 10%, but with a 20% reduction in the key rate.
△ Less
Submitted 11 July, 2022; v1 submitted 7 August, 2020;
originally announced August 2020.
-
Ergodic Specifications for Flexible Swarm Control: From User Commands to Persistent Adaptation
Authors:
Ahalya Prabhakar,
Ian Abraham,
Annalisa Taylor,
Millicent Schlafly,
Katarina Popovic,
Giovani Diniz,
Brendan Teich,
Borislava Simidchieva,
Shane Clark,
Todd Murphey
Abstract:
This paper presents a formulation for swarm control and high-level task planning that is dynamically responsive to user commands and adaptable to environmental changes. We design an end-to-end pipeline from a tactile tablet interface for user commands to onboard control of robotic agents based on decentralized ergodic coverage. Our approach demonstrates reliable and dynamic control of a swarm coll…
▽ More
This paper presents a formulation for swarm control and high-level task planning that is dynamically responsive to user commands and adaptable to environmental changes. We design an end-to-end pipeline from a tactile tablet interface for user commands to onboard control of robotic agents based on decentralized ergodic coverage. Our approach demonstrates reliable and dynamic control of a swarm collective through the use of ergodic specifications for planning and executing agent trajectories as well as responding to user and external inputs. We validate our approach in a virtual reality simulation environment and in real-world experiments at the DARPA OFFSET Urban Swarm Challenge FX3 field tests with a robotic swarm where user-based control of the swarm and mission-based tasks require a dynamic and flexible response to changing conditions and objectives in real-time.
△ Less
Submitted 10 June, 2020;
originally announced June 2020.
-
An Ergodic Measure for Active Learning From Equilibrium
Authors:
Ian Abraham,
Ahalya Prabhakar,
Todd D. Murphey
Abstract:
This paper develops KL-Ergodic Exploration from Equilibrium ($\text{KL-E}^3$), a method for robotic systems to integrate stability into actively generating informative measurements through ergodic exploration. Ergodic exploration enables robotic systems to indirectly sample from informative spatial distributions globally, avoiding local optima, and without the need to evaluate the derivatives of t…
▽ More
This paper develops KL-Ergodic Exploration from Equilibrium ($\text{KL-E}^3$), a method for robotic systems to integrate stability into actively generating informative measurements through ergodic exploration. Ergodic exploration enables robotic systems to indirectly sample from informative spatial distributions globally, avoiding local optima, and without the need to evaluate the derivatives of the distribution against the robot dynamics. Using hybrid systems theory, we derive a controller that allows a robot to exploit equilibrium policies (i.e., policies that solve a task) while allowing the robot to explore and generate informative data using an ergodic measure that can extend to high-dimensional states. We show that our method is able to maintain Lyapunov attractiveness with respect to the equilibrium task while actively generating data for learning tasks such, as Bayesian optimization, model learning, and off-policy reinforcement learning. In each example, we show that our proposed method is capable of generating an informative distribution of data while synthesizing smooth control signals. We illustrate these examples using simulated systems and provide simplification of our method for real-time online learning in robotic systems.
△ Less
Submitted 7 December, 2020; v1 submitted 5 June, 2020;
originally announced June 2020.
-
Differential phase encoded measurement-device-independent quantum key distribution
Authors:
Shashank Kumar Ranu,
Anil Prabhakar,
Prabha Mandayam
Abstract:
We present a measurement-device-independent quantum key distribution (MDI-QKD) using single photons in a linear superposition of three orthogonal time-bin states, for generating the key. The orthogonal states correspond to three distinct paths in the delay line interferometers used by two (trusted) sources. The key information is decoded based on the measurement outcomes obtained by an untrusted t…
▽ More
We present a measurement-device-independent quantum key distribution (MDI-QKD) using single photons in a linear superposition of three orthogonal time-bin states, for generating the key. The orthogonal states correspond to three distinct paths in the delay line interferometers used by two (trusted) sources. The key information is decoded based on the measurement outcomes obtained by an untrusted third party Charles, who uses a beamsplitter to measure the phase difference between pulses traveling through different paths of the two delay lines. The proposed scheme combines the best of both differential-phase-shift (DPS) QKD and MDI-QKD. It is more robust against phase fluctuations, and also ensures protection against detector side-channel attacks. We prove unconditional security by demonstrating an equivalent protocol involving shared entanglement between the two trusted parties. We show that the secure key rate for our protocol compares well to existing protocols in the asymptotic regime. For the decoy-state variant of our protocol, we evaluate the secure key rate by using a phase-post-selection technique. Finally, we estimate the bit error rate and the phase error rate, in the finite key regime.
△ Less
Submitted 23 February, 2021; v1 submitted 27 May, 2019;
originally announced May 2019.
-
Active Area Coverage from Equilibrium
Authors:
Ian Abraham,
Ahalya Prabhakar,
Todd D. Murphey
Abstract:
This paper develops a method for robots to integrate stability into actively seeking out informative measurements through coverage. We derive a controller using hybrid systems theory that allows us to consider safe equilibrium policies during active data collection. We show that our method is able to maintain Lyapunov attractiveness while still actively seeking out data. Using incremental sparse G…
▽ More
This paper develops a method for robots to integrate stability into actively seeking out informative measurements through coverage. We derive a controller using hybrid systems theory that allows us to consider safe equilibrium policies during active data collection. We show that our method is able to maintain Lyapunov attractiveness while still actively seeking out data. Using incremental sparse Gaussian processes, we define distributions which allow a robot to actively seek out informative measurements. We illustrate our methods for shape estimation using a cart double pendulum, dynamic model learning of a hovering quadrotor, and generating gallo** gaits starting from stationary equilibrium by learning a dynamics model for the half-cheetah system from the Roboschool environment.
△ Less
Submitted 8 February, 2019;
originally announced February 2019.
-
Autonomous Visual Rendering using Physical Motion
Authors:
Ahalya Prabhakar,
Anastasia Mavrommati,
Jarvis Schultz,
Todd Murphey
Abstract:
This paper addresses the problem of enabling a robot to represent and recreate visual information through physical motion, focusing on drawing using pens, brushes, or other tools. This work uses ergodicity as a control objective that translates planar visual input to physical motion without preprocessing (e.g., image processing, motion primitives). % or human-generated training data (i.e., machine…
▽ More
This paper addresses the problem of enabling a robot to represent and recreate visual information through physical motion, focusing on drawing using pens, brushes, or other tools. This work uses ergodicity as a control objective that translates planar visual input to physical motion without preprocessing (e.g., image processing, motion primitives). % or human-generated training data (i.e., machine learning).
We achieve comparable results to existing drawing methods, while reducing the algorithmic complexity of the software. We demonstrate that optimal ergodic control algorithms with different time-horizon characteristics (infinitesimal, finite, and receding horizon) can generate qualitatively and stylistically different motions that render a wide range of visual information (e.g., letters, portraits, landscapes). In addition, we show that ergodic control enables the same software design to apply to multiple robotic systems by incorporating their particular dynamics, thereby reducing the dependence on task-specific robots. Finally, we demonstrate physical drawings with the Baxter robot.
△ Less
Submitted 8 September, 2017;
originally announced September 2017.
-
Ergodic Exploration using Binary Sensing for Non-Parametric Shape Estimation
Authors:
Ian Abraham,
Ahalya Prabhakar,
Mitra J. Z. Hartmann,
Todd D. Murphey
Abstract:
Current methods to estimate object shape---using either vision or touch---generally depend on high-resolution sensing. Here, we exploit ergodic exploration to demonstrate successful shape estimation when using a low-resolution binary contact sensor. The measurement model is posed as a collision-based tactile measurement, and classification methods are used to discriminate between shape boundary re…
▽ More
Current methods to estimate object shape---using either vision or touch---generally depend on high-resolution sensing. Here, we exploit ergodic exploration to demonstrate successful shape estimation when using a low-resolution binary contact sensor. The measurement model is posed as a collision-based tactile measurement, and classification methods are used to discriminate between shape boundary regions in the search space. Posterior likelihood estimates of the measurement model help the system actively seek out regions where the binary sensor is most likely to return informative measurements. Results show successful shape estimation of various objects as well as the ability to identify multiple objects in an environment. Interestingly, it is shown that ergodic exploration utilizes non-contact motion to gather significant information about shape. The algorithm is extended in three dimensions in simulation and we present two dimensional experimental results using the Rethink Baxter robot.
△ Less
Submitted 5 September, 2017;
originally announced September 2017.
-
Magnetization spin dynamics in a (LuBi)3Fe5O12 (BLIG) epitaxial film
Authors:
M. Malathi,
G. Venkat,
A. Arora,
I. I. Syvorotka,
V. Sivasubramanian,
A. Prabhakar
Abstract:
Bismuth substituted lutetium iron garnet (BLIG) films exhibit larger Faraday rotation, and have a higher Curie temperature than yttrium iron garnet. We have observed magnetic stripe domains and measured domain widths of 1.4 μμm using Fourier domain polarization microscopy, Faraday rotation experiments yield a coercive field of 5 Oe. These characterizations form the basis of micromagnetic simulatio…
▽ More
Bismuth substituted lutetium iron garnet (BLIG) films exhibit larger Faraday rotation, and have a higher Curie temperature than yttrium iron garnet. We have observed magnetic stripe domains and measured domain widths of 1.4 μμm using Fourier domain polarization microscopy, Faraday rotation experiments yield a coercive field of 5 Oe. These characterizations form the basis of micromagnetic simulations that allow us to estimate and compare spin wave excitations in BLIG films. We observed that these films support thermal magnons with a precessional frequency of 7 GHz with a line width of 400 MHz. Further, we studied the dependence of precessional frequency on the externally applied magnetic field. Brillouin light scattering experiments and precession frequencies predicted by simulations show similar trend with increasing field.
△ Less
Submitted 11 June, 2017;
originally announced June 2017.
-
Absorbing boundary layers for spin wave micromagnetics
Authors:
G. Venkat,
H. Fangohr,
A. Prabhakar
Abstract:
Micromagnetic simulations are used to investigate the effects of different absorbing boundary layers (ABLs) on spin waves (SWs) reflected from the edges of a magnetic nano-structure. We define the conditions that a suitable ABL must fulfill and compare the performance of abrupt, linear, polynomial and tan hyperbolic dam** profiles in the ABL. We first consider normal incidence in a permalloy str…
▽ More
Micromagnetic simulations are used to investigate the effects of different absorbing boundary layers (ABLs) on spin waves (SWs) reflected from the edges of a magnetic nano-structure. We define the conditions that a suitable ABL must fulfill and compare the performance of abrupt, linear, polynomial and tan hyperbolic dam** profiles in the ABL. We first consider normal incidence in a permalloy stripe and propose a transmission line model to quantify reflections and calculate the loss introduced into the stripe due to the ABL. We find that a parabolic dam** profile absorbs the SW energy efficiently and has a low reflection coefficient, thus performing much better than the commonly used abrupt dam** profile. We then investigated SWs that are obliquely incident at 26.6, 45 and 63.4 degrees on the edge of a yttrium-iron-garnet film. The parabolic dam** profile again performs efficiently by showing a high SW energy transfer to the ABL and a low reflected SW amplitude.
△ Less
Submitted 11 June, 2017;
originally announced June 2017.
-
Coherent microwave generation by spintronic feedback oscillator
Authors:
Dinesh Kumar,
K. Konishi,
Nikhil Kumar,
S. Miwa,
A. Fukushima,
K. Yakushiji,
S. Yuasa,
H. Kubota,
C. V. Tomy,
A. Prabhakar,
Y. Suzuki,
A. Tulapurkar
Abstract:
The transfer of spin angular momentum to a nanomagnet from a spin polarized current provides an efficient means of controlling the magnetization direction in nanomagnets. A unique consequence of this spin torque is that the spontaneous oscillations of the magnetization can be induced by applying a combination of a dc bias current and a magnetic field. Here we experimentally demonstrate a different…
▽ More
The transfer of spin angular momentum to a nanomagnet from a spin polarized current provides an efficient means of controlling the magnetization direction in nanomagnets. A unique consequence of this spin torque is that the spontaneous oscillations of the magnetization can be induced by applying a combination of a dc bias current and a magnetic field. Here we experimentally demonstrate a different effect, which can drive a nanomagnet into spontaneous oscillations without the need of external spin torque injection. For the demonstration of this effect, we use a nano-pillar of magnetic tunnel junction (MTJ) powered by a dc current and connected to a coplanar waveguide (CPW) lying above the free layer of the MTJ. Any fluctuation of the free layer magnetization is converted into oscillating voltage via the tunneling magneto-resistance effect and is fed back into the MTJ by the CPW through inductive coupling. As a result of this feedback, the magnetization of the free layer can be driven into a continual precession. The combination of MTJ and CPW behaves similar to a laser system and outputs a stable rf power with quality factor exceeding 10,000.
△ Less
Submitted 12 August, 2016;
originally announced August 2016.
-
Teaching Python programming with automatic assessment and feedback provision
Authors:
Hans Fangohr,
Neil O'Brien,
Anil Prabhakar,
Arti Kashyap
Abstract:
We describe a method of automatic feedback provision for students learning programming and computational methods in Python. We have implemented, used and refined this system since 2009 for growing student numbers, and summarise the design and experience of using it. The core idea is to use a unit testing framework: the teacher creates a set of unit tests, and the student code is tested by running…
▽ More
We describe a method of automatic feedback provision for students learning programming and computational methods in Python. We have implemented, used and refined this system since 2009 for growing student numbers, and summarise the design and experience of using it. The core idea is to use a unit testing framework: the teacher creates a set of unit tests, and the student code is tested by running these tests. With our implementation, students typically submit work for assessment, and receive feedback by email within a few minutes after submission. The choice of tests and the reporting back to the student is chosen to optimise the educational value for the students. The system very significantly reduces the staff time required to establish whether a student's solution is correct, and shifts the emphasis of computing laboratory student contact time from assessing correctness to providing guidance. The self-paced nature of the automatic feedback provision supports a student-centred learning approach. Students can re-submit their work repeatedly and iteratively improve their solution, and enjoy using the system. We include an evaluation of the system and data from using it in a class of 425 students.
△ Less
Submitted 11 September, 2015;
originally announced September 2015.
-
Mesh Size and Damped Edge Effects in Micromagnetic Spin Wave Simulation
Authors:
G. Venkat,
M. Franchin,
H. Fangohr,
A. Prabhakar
Abstract:
We have studied the dependence of spin wave dispersion on the characteristics of the mesh used in a finite element micromagnetic simulation. It is shown that the dispersion curve has a cut off at a frequency which is analytically predictable. The frequency depends on the average mesh length used for the simulation. Based on this, a recipe to effectively obtain the dispersion relation has been sugg…
▽ More
We have studied the dependence of spin wave dispersion on the characteristics of the mesh used in a finite element micromagnetic simulation. It is shown that the dispersion curve has a cut off at a frequency which is analytically predictable. The frequency depends on the average mesh length used for the simulation. Based on this, a recipe to effectively obtain the dispersion relation has been suggested. In a separate study, spin wave reflections are absorbed by introducing highly damped edges in the device. However, an abrupt change in the dam** parameter causes reflections. We compare dam** profiles and identify an exponential dam** profile as causing significantly less reflections.
△ Less
Submitted 25 May, 2014; v1 submitted 19 May, 2014;
originally announced May 2014.
-
High frequency permeability of magnonic metamaterials with magnetic inclusions of complex shape
Authors:
O. Dmytriiev,
M. Dvornik,
R. V. Mikhaylovskiy,
M. Franchin,
H. Fangohr,
L. Giovannini,
F. Montoncello,
D. V. Berkov,
E. K. Semenova,
N. L. Gorn,
A. Prabhakar,
V. V. Kruglyak
Abstract:
We present a method of calculation of the effective magnetic permeability of magnonic metamaterials containing periodically arranged magnetic inclusions of arbitrary shapes. The spectrum of spin wave modes confined in the inclusions is fully taken into account. Within the scope of the proposed method, we compare two approaches. The first approach is based on a simple semi-analytical theory that us…
▽ More
We present a method of calculation of the effective magnetic permeability of magnonic metamaterials containing periodically arranged magnetic inclusions of arbitrary shapes. The spectrum of spin wave modes confined in the inclusions is fully taken into account. Within the scope of the proposed method, we compare two approaches. The first approach is based on a simple semi-analytical theory that uses the numerically calculated susceptibility tensor of an isolated inclusion as input data. Within the second approach, micromagnetic packages with periodic boundary conditions (PBC) are used to calculate the susceptibility of a single 2D periodic array of such inclusions, with the whole 3D metamaterial consisting of a stack of such arrays. To calculate the susceptibility tensor of an isolated inclusion, we have implemented and compared two different methods: (a) a micromagnetic method, in which we have employed three different micromagnetic packages: the finite element package NMAG and the two finite differences packages OOMMF and MicroMagus; and (b) the modified dynamical matrix method. To illustrate the methodology, we have calculated the effective permeability of a metamaterial consisting of a stack of hexagonal arrays of magnetic nanodisks in a non-magnetic matrix. The range of geometrical parameters for which such a metamaterial is characterized by the negative permeability has been identified. The critical comparison of the different micromagnetic packages and the dynamical matrix method (based on the calculation of the susceptibility tensor of an isolated inclusion) has demonstrated that their results agree to within 3 %.
△ Less
Submitted 27 March, 2012;
originally announced March 2012.
-
Enhanced spin transfer torque effect for transverse domain walls in cylindrical nanowires
Authors:
Matteo Franchin,
Andreas Knittel,
Maximilian Albert,
Dmitri Chernyshenko,
Thomas Fischbacher,
Anil Prabhakar,
Hans Fangohr
Abstract:
Recent studies have predicted extraordinary properties for transverse domain walls in cylindrical nanowires: zero depinning current, the absence of the Walker breakdown, and applications as domain wall oscillators. In order to reliably control the domain wall motion, it is important to understand how they interact with energy barriers.
In this paper, we study the motion and depinning of transver…
▽ More
Recent studies have predicted extraordinary properties for transverse domain walls in cylindrical nanowires: zero depinning current, the absence of the Walker breakdown, and applications as domain wall oscillators. In order to reliably control the domain wall motion, it is important to understand how they interact with energy barriers.
In this paper, we study the motion and depinning of transverse domain walls through potential barriers in ferromagnetic cylindrical nanowires. We use magnetic fields and spin-polarized currents to drive the domain walls along the wire. Using magnetic fields, we find that the minimum and the maximum fields required to push the domain wall through the barrier differ by 30 %. On the contrary, using spin-polarized currents, we find variations of a factor 130 between the minimum value of the depinning current density and the maximum value.
We study the depinning current density as a function of the height of the energy barrier using numerical and analytical methods. We find that, for a barrier of 40 k_B T, a depinning current density of about 5 uA is sufficient to depin the domain wall.
We reveal and explain the mechanism that leads to these unusually low depinning currents. One requirement for this new depinning mechanism is for the domain wall to be able to rotate around its own axis. With the right barrier design, the spin torque transfer term is acting exactly against the dam** in the micromagnetic system, and thus the low current density is sufficient to accumulate enough energy quickly. These key insights may be crucial in furthering the development of novel memory technologies, such as the racetrack memory, that can be controlled through low current densities.
△ Less
Submitted 15 April, 2011;
originally announced April 2011.