-
Ultrafast Carrier Relaxation and Second Harmonic Generation in a Higher-Fold Weyl Fermionic System PtAl
Authors:
Vikas Saini,
A**kya Punjal,
Utkarsh Kumar Pandey,
Ruturaj Vikrant Puranik,
Vikash Sharma,
Vivek Dwij,
Kritika Vijay,
Ruta Kulkarni,
Soma Banik,
Aditya Dharmadhikari,
Bahadur Singh,
Shriganesh Prabhu,
A. Thamizhavel
Abstract:
In topological materials, shielding of bulk and surface states by crystalline symmetries has provided hitherto unknown access to electronic states in condensed matter physics. Interestingly, photo-excited carriers relax on an ultrafast timescale, demonstrating large transient mobility that could be harnessed for the development of ultrafast optoelectronic devices. In addition, these devices are mu…
▽ More
In topological materials, shielding of bulk and surface states by crystalline symmetries has provided hitherto unknown access to electronic states in condensed matter physics. Interestingly, photo-excited carriers relax on an ultrafast timescale, demonstrating large transient mobility that could be harnessed for the development of ultrafast optoelectronic devices. In addition, these devices are much more effective than topologically trivial systems because topological states are resilient to the corresponding symmetry-invariant perturbations. By using optical pump probe measurements, we systematically describe the relaxation dynamics of a topologically nontrivial chiral single crystal, PtAl. Based on the experimental data on transient reflectivity and electronic structures, it has been found that the carrier relaxation process involves both acoustic and optical phonons with oscillation frequencies of 0.06 and 2.94 THz, respectively, in picosecond time scale. PtAl with a space group of $P$$2_{1}$3 allows only one non-zero susceptibility element i.e. $d_{14}$, in second harmonic generation (SHG) with a large value of 468(1) pm/V, which is significantly higher than that observed in standard GaAs(111) and ZnTe(110) crystals. The intensity dependence of the SHG signal in PtAl reveals a non-perturbative origin. The present study on PtAl provides deeper insight into topological states which will be useful for ultrafast optoelectronic devices.
△ Less
Submitted 7 October, 2023;
originally announced October 2023.
-
Applied Deep Learning to Identify and Localize Polyps from Endoscopic Images
Authors:
Chandana Raju,
Sumedh Vilas Datar,
Kushala Hari,
Kavin Vijay,
Suma Ningappa
Abstract:
Deep learning based neural networks have gained popularity for a variety of biomedical imaging applications. In the last few years several works have shown the use of these methods for colon cancer detection and the early results have been promising. These methods can potentially be utilized to assist doctor's and may help in identifying the number of lesions or abnormalities in a diagnosis sessio…
▽ More
Deep learning based neural networks have gained popularity for a variety of biomedical imaging applications. In the last few years several works have shown the use of these methods for colon cancer detection and the early results have been promising. These methods can potentially be utilized to assist doctor's and may help in identifying the number of lesions or abnormalities in a diagnosis session. From our literature survey we found out that there is a lack of publicly available labeled data. Thus, as part of this work, we have aimed at open sourcing a dataset which contains annotations of polyps and ulcers. This is the first dataset that's coming from India containing polyp and ulcer images. The dataset can be used for detection and classification tasks. We also evaluated our dataset with several popular deep learning object detection models that's trained on large publicly available datasets and found out empirically that the model trained on one dataset works well on our dataset that has data being captured in a different acquisition device.
△ Less
Submitted 22 January, 2023;
originally announced January 2023.
-
Effect on the Electronic and Magnetic Properties of Antiferromagnetic Topological Insulator MnBi$_2$Te$_4$ with Sn Do**
Authors:
Susmita Changdar,
Susanta Ghosh,
Kritika Vijay,
Indrani Kar,
Sayan Routh,
P. K. Maheswari,
Soumya Ghorai,
Soma Banik,
S. Thirupathaiah
Abstract:
We thoroughly investigate the effect of nonmagnetic Sn do** on the electronic and magnetic properties of antiferromagnetic topological insulator MnBi$_2$Te$_4$. We observe that Sn do** reduces the out-of-plane antiferromagnetic (AFM) interactions in MnBi$_2$Te$_4$ up to 68\% of Sn concentration and above the system is found to be paramagnetic. In this way, the anomalous Hall effect observed at…
▽ More
We thoroughly investigate the effect of nonmagnetic Sn do** on the electronic and magnetic properties of antiferromagnetic topological insulator MnBi$_2$Te$_4$. We observe that Sn do** reduces the out-of-plane antiferromagnetic (AFM) interactions in MnBi$_2$Te$_4$ up to 68\% of Sn concentration and above the system is found to be paramagnetic. In this way, the anomalous Hall effect observed at a very high field of 7.8 T in MnBi$_2$Te$_4$ is reduced to 2 T with 68\% of Sn do**. Electrical transport measurements suggest that all compositions are metallic in nature, while the low-temperature resistivity is sensitive to the AFM ordering and to the do**-induced disorder. Hall effect study demonstrates that Sn actually dopes electrons into the system, thus, enhancing the electron carrier density almost by two orders at 68\% of Sn. In contrast, SnBi$_2$Te$_4$ is found to be a p-type system. Angle-resolved photoemission spectroscopy (ARPES) studies show that the topological properties are intact at least up to 55\% of Sn as the Dirac surface states are present in the valance band, but in SnBi$_2$Te$_4$ we are unable to detect the topological states due to heavy hole do**. Overall, Sn do** significantly affects the electronic and magnetic properties of MnBi$_2$Te$_4$.
△ Less
Submitted 27 July, 2022;
originally announced July 2022.
-
Minimizing Communication while Maximizing Performance in Multi-Agent Reinforcement Learning
Authors:
Varun Kumar Vijay,
Hassam Sheikh,
Somdeb Majumdar,
Mariano Phielipp
Abstract:
Inter-agent communication can significantly increase performance in multi-agent tasks that require co-ordination to achieve a shared goal. Prior work has shown that it is possible to learn inter-agent communication protocols using multi-agent reinforcement learning and message-passing network architectures. However, these models use an unconstrained broadcast communication model, in which an agent…
▽ More
Inter-agent communication can significantly increase performance in multi-agent tasks that require co-ordination to achieve a shared goal. Prior work has shown that it is possible to learn inter-agent communication protocols using multi-agent reinforcement learning and message-passing network architectures. However, these models use an unconstrained broadcast communication model, in which an agent communicates with all other agents at every step, even when the task does not require it. In real-world applications, where communication may be limited by system constraints like bandwidth, power and network capacity, one might need to reduce the number of messages that are sent. In this work, we explore a simple method of minimizing communication while maximizing performance in multi-task learning: simultaneously optimizing a task-specific objective and a communication penalty. We show that the objectives can be optimized using Reinforce and the Gumbel-Softmax reparameterization. We introduce two techniques to stabilize training: 50% training and message forwarding. Training with the communication penalty on only 50% of the episodes prevents our models from turning off their outgoing messages. Second, repeating messages received previously helps models retain information, and further improves performance. With these techniques, we show that we can reduce communication by 75% with no loss of performance.
△ Less
Submitted 8 December, 2021; v1 submitted 15 June, 2021;
originally announced June 2021.
-
Generalization to Novel Objects using Prior Relational Knowledge
Authors:
Varun Kumar Vijay,
Abhinav Ganesh,
Hanlin Tang,
Arjun Bansal
Abstract:
To solve tasks in new environments involving objects unseen during training, agents must reason over prior information about those objects and their relations. We introduce the Prior Knowledge Graph network, an architecture for combining prior information, structured as a knowledge graph, with a symbolic parsing of the visual scene, and demonstrate that this approach is able to apply learned relat…
▽ More
To solve tasks in new environments involving objects unseen during training, agents must reason over prior information about those objects and their relations. We introduce the Prior Knowledge Graph network, an architecture for combining prior information, structured as a knowledge graph, with a symbolic parsing of the visual scene, and demonstrate that this approach is able to apply learned relations to novel objects whereas the baseline algorithms fail. Ablation experiments show that the agents ground the knowledge graph relations to semantically-relevant behaviors. In both a Sokoban game and the more complex Pacman environment, our network is also more sample efficient than the baselines, reaching the same performance in 5-10x fewer episodes. Once the agents are trained with our approach, we can manipulate agent behavior by modifying the knowledge graph in semantically meaningful ways. These results suggest that our network provides a framework for agents to reason over structured knowledge graphs while still leveraging gradient based learning approaches.
△ Less
Submitted 20 September, 2019; v1 submitted 26 June, 2019;
originally announced June 2019.
-
Design of a Multi-Modal End-Effector and Gras** System: How Integrated Design helped win the Amazon Robotics Challenge
Authors:
S. Wade-McCue,
N. Kelly-Boxall,
M. McTaggart,
D. Morrison,
A. W. Tow,
J. Erskine,
R. Grinover,
A. Gurman,
T. Hunn,
D. Lee,
A. Milan,
T. Pham,
G. Rallos,
A. Razjigaev,
T. Rowntree,
R. Smith,
K. Vijay,
Z. Zhuang,
C. Lehnert,
I. Reid,
P. Corke,
J. Leitner
Abstract:
We present the gras** system and design approach behind Cartman, the winning entrant in the 2017 Amazon Robotics Challenge. We investigate the design processes leading up to the final iteration of the system and describe the emergent solution by comparing it with key robotics design aspects. Following our experience, we propose a new design aspect, precision vs. redundancy, that should be consid…
▽ More
We present the gras** system and design approach behind Cartman, the winning entrant in the 2017 Amazon Robotics Challenge. We investigate the design processes leading up to the final iteration of the system and describe the emergent solution by comparing it with key robotics design aspects. Following our experience, we propose a new design aspect, precision vs. redundancy, that should be considered alongside the previously proposed design aspects of modularity vs. integration, generality vs. assumptions, computation vs. embodiment and planning vs. feedback. We present the gras** system behind Cartman, the winning robot in the 2017 Amazon Robotics Challenge. The system makes strong use of redundancy in design by implementing complimentary tools, a suction gripper and a parallel gripper. This multi-modal end-effector is combined with three grasp synthesis algorithms to accommodate the range of objects provided by Amazon during the challenge. We provide a detailed system description and an evaluation of its performance before discussing the broader nature of the system with respect to the key aspects of robotic design as initially proposed by the winners of the first Amazon Picking Challenge. To address the principal nature of our gras** system and the reason for its success, we propose an additional robotic design aspect `precision vs. redundancy'. The full design of our robotic system, including the end-effector, is open sourced and available at http://juxi.net/projects/AmazonRoboticsChallenge/
△ Less
Submitted 19 June, 2018; v1 submitted 3 October, 2017;
originally announced October 2017.
-
Semantic Segmentation from Limited Training Data
Authors:
A. Milan,
T. Pham,
K. Vijay,
D. Morrison,
A. W. Tow,
L. Liu,
J. Erskine,
R. Grinover,
A. Gurman,
T. Hunn,
N. Kelly-Boxall,
D. Lee,
M. McTaggart,
G. Rallos,
A. Razjigaev,
T. Rowntree,
T. Shen,
R. Smith,
S. Wade-McCue,
Z. Zhuang,
C. Lehnert,
G. Lin,
I. Reid,
P. Corke,
J. Leitner
Abstract:
We present our approach for robotic perception in cluttered scenes that led to winning the recent Amazon Robotics Challenge (ARC) 2017. Next to small objects with shiny and transparent surfaces, the biggest challenge of the 2017 competition was the introduction of unseen categories. In contrast to traditional approaches which require large collections of annotated data and many hours of training,…
▽ More
We present our approach for robotic perception in cluttered scenes that led to winning the recent Amazon Robotics Challenge (ARC) 2017. Next to small objects with shiny and transparent surfaces, the biggest challenge of the 2017 competition was the introduction of unseen categories. In contrast to traditional approaches which require large collections of annotated data and many hours of training, the task here was to obtain a robust perception pipeline with only few minutes of data acquisition and training time. To that end, we present two strategies that we explored. One is a deep metric learning approach that works in three separate steps: semantic-agnostic boundary detection, patch classification and pixel-wise voting. The other is a fully-supervised semantic segmentation approach with efficient dataset collection. We conduct an extensive analysis of the two methods on our ARC 2017 dataset. Interestingly, only few examples of each class are sufficient to fine-tune even very deep convolutional neural networks for this specific task.
△ Less
Submitted 22 September, 2017;
originally announced September 2017.
-
Cartman: The low-cost Cartesian Manipulator that won the Amazon Robotics Challenge
Authors:
D. Morrison,
A. W. Tow,
M. McTaggart,
R. Smith,
N. Kelly-Boxall,
S. Wade-McCue,
J. Erskine,
R. Grinover,
A. Gurman,
T. Hunn,
D. Lee,
A. Milan,
T. Pham,
G. Rallos,
A. Razjigaev,
T. Rowntree,
K. Vijay,
Z. Zhuang,
C. Lehnert,
I. Reid,
P. Corke,
J. Leitner
Abstract:
The Amazon Robotics Challenge enlisted sixteen teams to each design a pick-and-place robot for autonomous warehousing, addressing development in robotic vision and manipulation. This paper presents the design of our custom-built, cost-effective, Cartesian robot system Cartman, which won first place in the competition finals by stowing 14 (out of 16) and picking all 9 items in 27 minutes, scoring a…
▽ More
The Amazon Robotics Challenge enlisted sixteen teams to each design a pick-and-place robot for autonomous warehousing, addressing development in robotic vision and manipulation. This paper presents the design of our custom-built, cost-effective, Cartesian robot system Cartman, which won first place in the competition finals by stowing 14 (out of 16) and picking all 9 items in 27 minutes, scoring a total of 272 points. We highlight our experience-centred design methodology and key aspects of our system that contributed to our competitiveness. We believe these aspects are crucial to building robust and effective robotic systems.
△ Less
Submitted 25 February, 2018; v1 submitted 19 September, 2017;
originally announced September 2017.
-
Adaptive dictionary based approach for background noise and speaker classification and subsequent source separation
Authors:
K V Vijay Girish,
A G Ramakrishnan,
T V Ananthapadmanabha
Abstract:
A judicious combination of dictionary learning methods, block sparsity and source recovery algorithm are used in a hierarchical manner to identify the noises and the speakers from a noisy conversation between two people. Conversations are simulated using speech from two speakers, each with a different background noise, with varied SNR values, down to -10 dB. Ten each of randomly chosen male and fe…
▽ More
A judicious combination of dictionary learning methods, block sparsity and source recovery algorithm are used in a hierarchical manner to identify the noises and the speakers from a noisy conversation between two people. Conversations are simulated using speech from two speakers, each with a different background noise, with varied SNR values, down to -10 dB. Ten each of randomly chosen male and female speakers from the TIMIT database and all the noise sources from the NOISEX database are used for the simulations. For speaker identification, the relative value of weights recovered is used to select an appropriately small subset of the test data, assumed to contain speech. This novel choice of using varied amounts of test data results in an improvement in the speaker recognition rate of around 15% at SNR of 0 dB. Speech and noise are separated using dictionaries of the estimated speaker and noise, and an improvement of signal to distortion ratios of up to 10% is achieved at SNR of 0 dB. K-medoid and cosine similarity based dictionary learning methods lead to better recognition of the background noise and the speaker. Experiments are also conducted on cases, where either the background noise or the speaker is outside the set of trained dictionaries. In such cases, adaptive dictionary learning leads to performance comparable to the other case of complete dictionaries.
△ Less
Submitted 28 October, 2016; v1 submitted 30 September, 2016;
originally announced September 2016.
-
A dictionary learning and source recovery based approach to classify diverse audio sources
Authors:
K V Vijay Girish,
T V Ananthapadmanabha,
A G Ramakrishnan
Abstract:
A dictionary learning based audio source classification algorithm is proposed to classify a sample audio signal as one amongst a finite set of different audio sources. Cosine similarity measure is used to select the atoms during dictionary learning. Based on three objective measures proposed, namely, signal to distortion ratio (SDR), the number of non-zero weights and the sum of weights, a frame-w…
▽ More
A dictionary learning based audio source classification algorithm is proposed to classify a sample audio signal as one amongst a finite set of different audio sources. Cosine similarity measure is used to select the atoms during dictionary learning. Based on three objective measures proposed, namely, signal to distortion ratio (SDR), the number of non-zero weights and the sum of weights, a frame-wise source classification accuracy of 98.2% is obtained for twelve different sources. Cent percent accuracy has been obtained using moving SDR accumulated over six successive frames for ten of the audio sources tested, while the two other sources require accumulation of 10 and 14 frames.
△ Less
Submitted 27 October, 2015;
originally announced October 2015.
-
Detection of transitions between broad phonetic classes in a speech signal
Authors:
T V Ananthapadmanabha,
K V Vijay Girish,
A G Ramakrishnan
Abstract:
Detection of transitions between broad phonetic classes in a speech signal is an important problem which has applications such as landmark detection and segmentation. The proposed hierarchical method detects silence to non-silence transitions, high amplitude (mostly sonorants) to low ampli- tude (mostly fricatives/affricates/stop bursts) transitions and vice-versa. A subset of the extremum (minimu…
▽ More
Detection of transitions between broad phonetic classes in a speech signal is an important problem which has applications such as landmark detection and segmentation. The proposed hierarchical method detects silence to non-silence transitions, high amplitude (mostly sonorants) to low ampli- tude (mostly fricatives/affricates/stop bursts) transitions and vice-versa. A subset of the extremum (minimum or maximum) samples between every pair of successive zero-crossings is selected above a second pass threshold, from each bandpass filtered speech signal frame. Relative to the mid-point (reference) of a frame, locations of the first and the last extrema lie on either side, if the speech signal belongs to a homogeneous segment; else, both these locations lie on the left or the right side of the reference, indicating a transition frame. When tested on the entire TIMIT database, of the transitions detected, 93.6% are within a tolerance of 20 ms from the hand labeled boundaries. Sonorant, unvoiced non-sonorant and silence classes and their respective onsets are detected with an accuracy of about 83.5% for the same tolerance. The results are as good as, and in some respects better than the state-of-the-art methods for similar tasks.
△ Less
Submitted 3 November, 2014;
originally announced November 2014.