-
Derandomized Non-Abelian Homomorphism Testing in Low Soundness Regime
Authors:
Tushant Mittal,
Sourya Roy
Abstract:
We give a randomness-efficient homomorphism test in the low soundness regime for functions, $f: G\to \mathbb{U}_t$, from an arbitrary finite group $G$ to $t\times t$ unitary matrices. We show that if such a function passes a derandomized Blum--Luby--Rubinfeld (BLR) test (using small-bias sets), then (i) it correlates with a function arising from a genuine homomorphism, and (ii) it has a non-trivia…
▽ More
We give a randomness-efficient homomorphism test in the low soundness regime for functions, $f: G\to \mathbb{U}_t$, from an arbitrary finite group $G$ to $t\times t$ unitary matrices. We show that if such a function passes a derandomized Blum--Luby--Rubinfeld (BLR) test (using small-bias sets), then (i) it correlates with a function arising from a genuine homomorphism, and (ii) it has a non-trivial Fourier mass on a low-dimensional irreducible representation.
In the full randomness regime, such a test for matrix-valued functions on finite groups implicitly appears in the works of Gowers and Hatami [Sbornik: Mathematics '17], and Moore and Russell [SIAM Journal on Discrete Mathematics '15]. Thus, our work can be seen as a near-optimal derandomization of their results. Our key technical contribution is a "degree-2 expander mixing lemma'' that shows that Gowers' $\mathrm{U}^2$ norm can be efficiently estimated by restricting it to a small-bias subset. Another corollary is a "derandomized'' version of a useful lemma due to Babai, Nikolov, and Pyber [SODA'08].
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Quantum-centric Supercomputing for Materials Science: A Perspective on Challenges and Future Directions
Authors:
Yuri Alexeev,
Maximilian Amsler,
Paul Baity,
Marco Antonio Barroca,
Sanzio Bassini,
Torey Battelle,
Daan Camps,
David Casanova,
Young jai Choi,
Frederic T. Chong,
Charles Chung,
Chris Codella,
Antonio D. Corcoles,
James Cruise,
Alberto Di Meglio,
Jonathan Dubois,
Ivan Duran,
Thomas Eckl,
Sophia Economou,
Stephan Eidenbenz,
Bruce Elmegreen,
Clyde Fare,
Ismael Faro,
Cristina Sanz Fernández,
Rodrigo Neumann Barros Ferreira
, et al. (102 additional authors not shown)
Abstract:
Computational models are an essential tool for the design, characterization, and discovery of novel materials. Hard computational tasks in materials science stretch the limits of existing high-performance supercomputing centers, consuming much of their simulation, analysis, and data resources. Quantum computing, on the other hand, is an emerging technology with the potential to accelerate many of…
▽ More
Computational models are an essential tool for the design, characterization, and discovery of novel materials. Hard computational tasks in materials science stretch the limits of existing high-performance supercomputing centers, consuming much of their simulation, analysis, and data resources. Quantum computing, on the other hand, is an emerging technology with the potential to accelerate many of the computational tasks needed for materials science. In order to do that, the quantum technology must interact with conventional high-performance computing in several ways: approximate results validation, identification of hard problems, and synergies in quantum-centric supercomputing. In this paper, we provide a perspective on how quantum-centric supercomputing can help address critical computational problems in materials science, the challenges to face in order to solve representative use cases, and new suggested directions.
△ Less
Submitted 14 December, 2023;
originally announced December 2023.
-
NFResNet: Multi-scale and U-shaped Networks for Deblurring
Authors:
Tanish Mittal,
Preyansh Agrawal,
Esha Pahwa,
Aarya Makwana
Abstract:
Multi-Scale and U-shaped Networks are widely used in various image restoration problems, including deblurring. Kee** in mind the wide range of applications, we present a comparison of these architectures and their effects on image deblurring. We also introduce a new block called as NFResblock. It consists of a Fast Fourier Transformation layer and a series of modified Non-Linear Activation Free…
▽ More
Multi-Scale and U-shaped Networks are widely used in various image restoration problems, including deblurring. Kee** in mind the wide range of applications, we present a comparison of these architectures and their effects on image deblurring. We also introduce a new block called as NFResblock. It consists of a Fast Fourier Transformation layer and a series of modified Non-Linear Activation Free Blocks. Based on these architectures and additions, we introduce NFResnet and NFResnet+, which are modified multi-scale and U-Net architectures, respectively. We also use three different loss functions to train these architectures: Charbonnier Loss, Edge Loss, and Frequency Reconstruction Loss. Extensive experiments on the Deep Video Deblurring dataset, along with ablation studies for each component, have been presented in this paper. The proposed architectures achieve a considerable increase in Peak Signal to Noise (PSNR) ratio and Structural Similarity Index (SSIM) value.
△ Less
Submitted 12 December, 2023; v1 submitted 12 December, 2022;
originally announced December 2022.
-
Naturalistic Head Motion Generation from Speech
Authors:
Trisha Mittal,
Zakaria Aldeneh,
Masha Fedzechkina,
Anurag Ranjan,
Barry-John Theobald
Abstract:
Synthesizing natural head motion to accompany speech for an embodied conversational agent is necessary for providing a rich interactive experience. Most prior works assess the quality of generated head motion by comparing them against a single ground-truth using an objective metric. Yet there are many plausible head motion sequences to accompany a speech utterance. In this work, we study the varia…
▽ More
Synthesizing natural head motion to accompany speech for an embodied conversational agent is necessary for providing a rich interactive experience. Most prior works assess the quality of generated head motion by comparing them against a single ground-truth using an objective metric. Yet there are many plausible head motion sequences to accompany a speech utterance. In this work, we study the variation in the perceptual quality of head motions sampled from a generative model. We show that, despite providing more diverse head motions, the generative model produces motions with varying degrees of perceptual quality. We finally show that objective metrics commonly used in previous research do not accurately reflect the perceptual quality of generated head motions. These results open an interesting avenue for future work to investigate better objective metrics that correlate with human perception of quality.
△ Less
Submitted 26 October, 2022;
originally announced October 2022.
-
Almost Ramanujan Expanders from Arbitrary Expanders via Operator Amplification
Authors:
Fernando Granha Jeronimo,
Tushant Mittal,
Sourya Roy,
Avi Wigderson
Abstract:
We give an efficient algorithm that transforms any bounded degree expander graph into another that achieves almost optimal (namely, near-quadratic, $d \leq 1/λ^{2+o(1)}$) trade-off between (any desired) spectral expansion $λ$ and degree $d$. Furthermore, the algorithm is local: every vertex can compute its new neighbors as a subset of its original neighborhood of radius $O(\log(1/λ))$. The optimal…
▽ More
We give an efficient algorithm that transforms any bounded degree expander graph into another that achieves almost optimal (namely, near-quadratic, $d \leq 1/λ^{2+o(1)}$) trade-off between (any desired) spectral expansion $λ$ and degree $d$. Furthermore, the algorithm is local: every vertex can compute its new neighbors as a subset of its original neighborhood of radius $O(\log(1/λ))$. The optimal quadratic trade-off is known as the Ramanujan bound, so our construction gives almost Ramanujan expanders from arbitrary expanders.
The locality of the transformation preserves structural properties of the original graph, and thus has many consequences. Applied to Cayley graphs, our transformation shows that any expanding finite group has almost Ramanujan expanding generators. Similarly, one can obtain almost optimal explicit constructions of quantum expanders, dimension expanders, monotone expanders, etc., from existing (suboptimal) constructions of such objects. Another consequence is a "derandomized" random walk on the original (suboptimal) expander with almost optimal convergence rate. Our transformation also applies when the degree is not bounded or the expansion is not constant.
We obtain our results by a generalization of Ta-Shma's technique in his breakthrough paper [STOC 2017], used to obtain explicit almost optimal binary codes. Specifically, our spectral amplification extends Ta-Shma's analysis of bias amplification from scalars to matrices of arbitrary dimension in a very natural way. Curiously, while Ta-Shma's explicit bias amplification derandomizes a well-known probabilistic argument (underlying the Gilbert--Varshamov bound), there seems to be no known probabilistic (or other existential) way of achieving our explicit ("high-dimensional") spectral amplification.
△ Less
Submitted 14 September, 2022;
originally announced September 2022.
-
Video Manipulations Beyond Faces: A Dataset with Human-Machine Analysis
Authors:
Trisha Mittal,
Ritwik Sinha,
Viswanathan Swaminathan,
John Collomosse,
Dinesh Manocha
Abstract:
As tools for content editing mature, and artificial intelligence (AI) based algorithms for synthesizing media grow, the presence of manipulated content across online media is increasing. This phenomenon causes the spread of misinformation, creating a greater need to distinguish between ``real'' and ``manipulated'' content. To this end, we present VideoSham, a dataset consisting of 826 videos (413…
▽ More
As tools for content editing mature, and artificial intelligence (AI) based algorithms for synthesizing media grow, the presence of manipulated content across online media is increasing. This phenomenon causes the spread of misinformation, creating a greater need to distinguish between ``real'' and ``manipulated'' content. To this end, we present VideoSham, a dataset consisting of 826 videos (413 real and 413 manipulated). Many of the existing deepfake datasets focus exclusively on two types of facial manipulations -- swap** with a different subject's face or altering the existing face. VideoSham, on the other hand, contains more diverse, context-rich, and human-centric, high-resolution videos manipulated using a combination of 6 different spatial and temporal attacks. Our analysis shows that state-of-the-art manipulation detection algorithms only work for a few specific attacks and do not scale well on VideoSham. We performed a user study on Amazon Mechanical Turk with 1200 participants to understand if they can differentiate between the real and manipulated videos in VideoSham. Finally, we dig deeper into the strengths and weaknesses of performances by humans and SOTA-algorithms to identify gaps that need to be filled with better AI algorithms. We present the dataset at https://github.com/adobe-research/VideoSham-dataset.
△ Less
Submitted 7 December, 2022; v1 submitted 26 July, 2022;
originally announced July 2022.
-
Estimating Emotion Contagion on Social Media via Localized Diffusion in Dynamic Graphs
Authors:
Trisha Mittal,
Puneet Mathur,
Rohan Chandra,
Apurva Bhatt,
Vikram Gupta,
Debdoot Mukherjee,
Aniket Bera,
Dinesh Manocha
Abstract:
We present a computational approach for estimating emotion contagion on social media networks. Built on a foundation of psychology literature, our approach estimates the degree to which the perceivers' emotional states (positive or negative) start to match those of the expressors, based on the latter's content. We use a combination of deep learning and social network analysis to model emotion cont…
▽ More
We present a computational approach for estimating emotion contagion on social media networks. Built on a foundation of psychology literature, our approach estimates the degree to which the perceivers' emotional states (positive or negative) start to match those of the expressors, based on the latter's content. We use a combination of deep learning and social network analysis to model emotion contagion as a diffusion process in dynamic social network graphs, taking into consideration key aspects like causality, homophily, and interference. We evaluate our approach on user behavior data obtained from a popular social media platform for sharing short videos. We analyze the behavior of 48 users over a span of 8 weeks (over 200k audio-visual short posts analyzed) and estimate how contagious the users with whom they engage with are on social media. As per the theory of diffusion, we account for the videos a user watches during this time (inflow) and the daily engagements; liking, sharing, downloading or creating new videos (outflow) to estimate contagion. To validate our approach and analysis, we obtain human feedback on these 48 social media platform users with an online study by collecting responses of about 150 participants. We report users who interact with more number of creators on the platform are 12% less prone to contagion, and those who consume more content of `negative' sentiment are 23% more prone to contagion. We will publicly release our code upon acceptance.
△ Less
Submitted 14 July, 2022;
originally announced July 2022.
-
Dynamics or Geysers and tracer transport over the south pole of Enceladus
Authors:
Wanying Kang,
John Marshall,
Tushar Mittal,
Suyash Bire
Abstract:
Over the south pole of Enceladus, an icy moon of Saturn, geysers eject water into space in a striped pattern, making Enceladus one of the most attractive destinations in the search for extraterrestrial life. We explore the ocean dynamics and tracer/heat transport associated with geysers as a function of the assumed salinity of the ocean and various core-shell heat partitions and bottom heating pat…
▽ More
Over the south pole of Enceladus, an icy moon of Saturn, geysers eject water into space in a striped pattern, making Enceladus one of the most attractive destinations in the search for extraterrestrial life. We explore the ocean dynamics and tracer/heat transport associated with geysers as a function of the assumed salinity of the ocean and various core-shell heat partitions and bottom heating patterns. We find that, even if heating is concentrated into a narrow band on the seafloor directly beneath the south pole, the warm fluid becomes quickly mixed with its surroundings due to baroclinic instability. The warming signal beneath the ice is diffuse and insufficient to prevent the geyser from freezing over. Instead, if heating is assumed to be local to the geyser, emanating from tidal dissipation in the ice itself, the geyser can be sustained. In this case, the upper ocean beneath the ice becomes stably stratified and thus a barrier to vertical communication, leading to transit timescales from the core to the ice shell of hundreds of years.
△ Less
Submitted 31 May, 2022;
originally announced May 2022.
-
Trends in Silicates in the $β$ Pictoris Disk
Authors:
Cicero X. Lu,
Christine H. Chen,
B. A. Sargent,
Dan M. Watson,
Carey M. Lisse,
Joel D. Green,
Michael L. Sitko,
Tushar Mittal,
V. Lebouteiller,
G. C. Sloan,
Isabel Rebollido,
Dean C. Hines,
Julien H. Girard,
Michael W. Werner,
Karl R. Stapelfeldt,
Winston Wu,
Kadin Worthen
Abstract:
While beta Pic is known to host silicates in ring-like structures, whether the properties of these silicate dust vary with stellocentric distance remains an open question. We re-analyze the beta Pictoris debris disk spectrum from the Spitzer Infrared Spectrograph (IRS) and a new IRTF/SpeX spectrum to investigate trends in Fe/Mg ratio, shape, and crystallinity in grains as a function of wavelength,…
▽ More
While beta Pic is known to host silicates in ring-like structures, whether the properties of these silicate dust vary with stellocentric distance remains an open question. We re-analyze the beta Pictoris debris disk spectrum from the Spitzer Infrared Spectrograph (IRS) and a new IRTF/SpeX spectrum to investigate trends in Fe/Mg ratio, shape, and crystallinity in grains as a function of wavelength, a proxy for stellocentric distance. By analyzing a re-calibrated and re-extracted spectrum, we identify a new 18 micron forsterite emission feature and recover a 23 micron forsterite emission feature with a substantially larger line-to-continuum ratio than previously reported. We find that these prominent spectral features are primarily produced by small submicron-sized grains, which are continuously generated and replenished from planetesimal collisions in the disk and can elucidate their parent bodies' composition. We discover three trends about these small grains: as stellocentric distance increases, (1) small silicate grains become more crystalline (less amorphous), (2) they become more irregular in shape, and (3) for crystalline silicate grains, the Fe/Mg ratio decreases. Applying these trends to beta Pic's planetary architecture, we find that the dust population exterior to the orbits of beta Pic b and c differs substantially in crystallinity and shape. We also find a tentative 3-5 micron dust excess due to spatially unresolved hot dust emission close to the star. From our findings, we infer that the surfaces of large planetesimals are more Fe-rich and collisionally-processed closer to the star but more Fe-poor and primordial farther from the star.
△ Less
Submitted 18 May, 2022;
originally announced May 2022.
-
3MASSIV: Multilingual, Multimodal and Multi-Aspect dataset of Social Media Short Videos
Authors:
Vikram Gupta,
Trisha Mittal,
Puneet Mathur,
Vaibhav Mishra,
Mayank Maheshwari,
Aniket Bera,
Debdoot Mukherjee,
Dinesh Manocha
Abstract:
We present 3MASSIV, a multilingual, multimodal and multi-aspect, expertly-annotated dataset of diverse short videos extracted from short-video social media platform - Moj. 3MASSIV comprises of 50k short videos (20 seconds average duration) and 100K unlabeled videos in 11 different languages and captures popular short video trends like pranks, fails, romance, comedy expressed via unique audio-visua…
▽ More
We present 3MASSIV, a multilingual, multimodal and multi-aspect, expertly-annotated dataset of diverse short videos extracted from short-video social media platform - Moj. 3MASSIV comprises of 50k short videos (20 seconds average duration) and 100K unlabeled videos in 11 different languages and captures popular short video trends like pranks, fails, romance, comedy expressed via unique audio-visual formats like self-shot videos, reaction videos, lip-synching, self-sung songs, etc. 3MASSIV presents an opportunity for multimodal and multilingual semantic understanding on these unique videos by annotating them for concepts, affective states, media types, and audio language. We present a thorough analysis of 3MASSIV and highlight the variety and unique aspects of our dataset compared to other contemporary popular datasets with strong baselines. We also show how the social media content in 3MASSIV is dynamic and temporal in nature, which can be used for semantic understanding tasks and cross-lingual analysis.
△ Less
Submitted 27 March, 2022;
originally announced March 2022.
-
Explicit Abelian Lifts and Quantum LDPC Codes
Authors:
Fernando Granha Jeronimo,
Tushant Mittal,
Ryan O'Donnell,
Pedro Paredes,
Madhur Tulsiani
Abstract:
For an abelian group $H$ acting on the set $[\ell]$, an $(H,\ell)$-lift of a graph $G_0$ is a graph obtained by replacing each vertex by $\ell$ copies, and each edge by a matching corresponding to the action of an element of $H$.
In this work, we show the following explicit constructions of expanders obtained via abelian lifts. For every (transitive) abelian group $H \leqslant \text{Sym}(\ell)$,…
▽ More
For an abelian group $H$ acting on the set $[\ell]$, an $(H,\ell)$-lift of a graph $G_0$ is a graph obtained by replacing each vertex by $\ell$ copies, and each edge by a matching corresponding to the action of an element of $H$.
In this work, we show the following explicit constructions of expanders obtained via abelian lifts. For every (transitive) abelian group $H \leqslant \text{Sym}(\ell)$, constant degree $d \ge 3$ and $ε> 0$, we construct explicit $d$-regular expander graphs $G$ obtained from an $(H,\ell)$-lift of a (suitable) base $n$-vertex expander $G_0$ with the following parameters:
(i) $λ(G) \le 2\sqrt{d-1} + ε$, for any lift size $\ell \le 2^{n^δ}$ where $δ=δ(d,ε)$,
(ii) $λ(G) \le ε\cdot d$, for any lift size $\ell \le 2^{n^{δ_0}}$ for a fixed $δ_0 > 0$, when $d \ge d_0(ε)$, or
(iii) $λ(G) \le \widetilde{O}(\sqrt{d})$, for lift size ``exactly'' $\ell = 2^{Θ(n)}$.
As corollaries, we obtain explicit quantum lifted product codes of Panteleev and Kalachev of almost linear distance (and also in a wide range of parameters) and explicit classical quasi-cyclic LDPC codes with wide range of circulant sizes.
Items $(i)$ and $(ii)$ above are obtained by extending the techniques of Mohanty, O'Donnell and Paredes [STOC 2020] for $2$-lifts to much larger abelian lift sizes (as a byproduct simplifying their construction). This is done by providing a new encoding of special walks arising in the trace power method, carefully "compressing'" depth-first search traversals. Result $(iii)$ is via a simpler proof of Agarwal et al. [SIAM J. Discrete Math 2019] at the expense of polylog factors in the expansion.
△ Less
Submitted 2 December, 2021;
originally announced December 2021.
-
Eformer: Edge Enhancement based Transformer for Medical Image Denoising
Authors:
Achleshwar Luthra,
Harsh Sulakhe,
Tanish Mittal,
Abhishek Iyer,
Santosh Yadav
Abstract:
In this work, we present Eformer - Edge enhancement based transformer, a novel architecture that builds an encoder-decoder network using transformer blocks for medical image denoising. Non-overlap** window-based self-attention is used in the transformer block that reduces computational requirements. This work further incorporates learnable Sobel-Feldman operators to enhance edges in the image an…
▽ More
In this work, we present Eformer - Edge enhancement based transformer, a novel architecture that builds an encoder-decoder network using transformer blocks for medical image denoising. Non-overlap** window-based self-attention is used in the transformer block that reduces computational requirements. This work further incorporates learnable Sobel-Feldman operators to enhance edges in the image and propose an effective way to concatenate them in the intermediate layers of our architecture. The experimental analysis is conducted by comparing deterministic learning and residual learning for the task of medical image denoising. To defend the effectiveness of our approach, our model is evaluated on the AAPM-Mayo Clinic Low-Dose CT Grand Challenge Dataset and achieves state-of-the-art performance, $i.e.$, 43.487 PSNR, 0.0067 RMSE, and 0.9861 SSIM. We believe that our work will encourage more research in transformer-based architectures for medical image denoising using residual learning.
△ Less
Submitted 9 November, 2021; v1 submitted 16 September, 2021;
originally announced September 2021.
-
Symbolic determinant identity testing and non-commutative ranks of matrix Lie algebras
Authors:
Gábor Ivanyos,
Tushant Mittal,
Youming Qiao
Abstract:
One approach to make progress on the symbolic determinant identity testing (SDIT) problem is to study the structure of singular matrix spaces. After settling the non-commutative rank problem (Garg-Gurvits-Oliveira-Wigderson, Found. Comput. Math. 2020; Ivanyos-Qiao-Subrahmanyam, Comput. Complex. 2018), a natural next step is to understand singular matrix spaces whose non-commutative rank is full. A…
▽ More
One approach to make progress on the symbolic determinant identity testing (SDIT) problem is to study the structure of singular matrix spaces. After settling the non-commutative rank problem (Garg-Gurvits-Oliveira-Wigderson, Found. Comput. Math. 2020; Ivanyos-Qiao-Subrahmanyam, Comput. Complex. 2018), a natural next step is to understand singular matrix spaces whose non-commutative rank is full. At present, examples of such matrix spaces are mostly sporadic, so it is desirable to discover them in a more systematic way.
In this paper, we make a step towards this direction, by studying the family of matrix spaces that are closed under the commutator operation, that is matrix Lie algebras. On the one hand, we demonstrate that matrix Lie algebras over the complex number field give rise to singular matrix spaces with full non-commutative ranks. On the other hand, we show that SDIT of such spaces can be decided in deterministic polynomial time. Moreover, we give a characterization for the matrix Lie algebras to yield a matrix space possessing singularity certificates as studied by Lov'asz (B. Braz. Math. Soc., 1989) and Raz and Wigderson (Building Bridges II, 2019).
△ Less
Submitted 4 December, 2021; v1 submitted 13 September, 2021;
originally announced September 2021.
-
How does salinity shape ocean circulation and ice geometry on Enceladus and other icy satellites?
Authors:
Wanying Kang,
Tushar Mittal,
Suyash Bire,
Jean-Michel Campin,
John Marshall
Abstract:
Of profound astrobiological interest, Enceladus appears to have a global subsurface ocean that is salty, indicating water-rock reaction at present or in the past, important for its habitability. Here, we investigate how salinity and the partition of heat production between the silicate core and the ice shell affect ocean dynamics and the associated heat transport -- a key factor that determines th…
▽ More
Of profound astrobiological interest, Enceladus appears to have a global subsurface ocean that is salty, indicating water-rock reaction at present or in the past, important for its habitability. Here, we investigate how salinity and the partition of heat production between the silicate core and the ice shell affect ocean dynamics and the associated heat transport -- a key factor that determines the equilibrium ice shell geometry. Assuming steady state conditions, we show that the meridional overturning circulation of the ocean, driven by heat and salt exchange with the ice, has opposing signs at very low and very high salinities. Regardless of these differing circulations, heat and freshwater converge towards the equator, where the ice is thick, acting to homogenize thickness variations. In order to maintain the observed ice thickness variation, the polar-amplified ice dissipation needs to be strong enough and ocean heat convergence cannot overwhelm well-constrained heat loss rates through the thick equatorial ice sheet. This requirement is found violated if the main heat source is in the core rather than the ice shell, or if the ocean is very fresh or very salty. Instead, with a salinity of intermediate range, the temperature- and salinity-induced density gradient largely cancel one another, leading to much reduced overturning and equatorial heat convergence rates and consistent budgets in appearance of a significant ice dissipation.
△ Less
Submitted 12 May, 2022; v1 submitted 14 April, 2021;
originally announced April 2021.
-
Affect2MM: Affective Analysis of Multimedia Content Using Emotion Causality
Authors:
Trisha Mittal,
Puneet Mathur,
Aniket Bera,
Dinesh Manocha
Abstract:
We present Affect2MM, a learning method for time-series emotion prediction for multimedia content. Our goal is to automatically capture the varying emotions depicted by characters in real-life human-centric situations and behaviors. We use the ideas from emotion causation theories to computationally model and determine the emotional state evoked in clips of movies. Affect2MM explicitly models the…
▽ More
We present Affect2MM, a learning method for time-series emotion prediction for multimedia content. Our goal is to automatically capture the varying emotions depicted by characters in real-life human-centric situations and behaviors. We use the ideas from emotion causation theories to computationally model and determine the emotional state evoked in clips of movies. Affect2MM explicitly models the temporal causality using attention-based methods and Granger causality. We use a variety of components like facial features of actors involved, scene understanding, visual aesthetics, action/situation description, and movie script to obtain an affective-rich representation to understand and perceive the scene. We use an LSTM-based learning model for emotion perception. To evaluate our method, we analyze and compare our performance on three datasets, SENDv1, MovieGraphs, and the LIRIS-ACCEDE dataset, and observe an average of 10-15% increase in the performance over SOTA methods for all three datasets.
△ Less
Submitted 11 March, 2021;
originally announced March 2021.
-
Dynamic Graph Modeling of Simultaneous EEG and Eye-tracking Data for Reading Task Identification
Authors:
Puneet Mathur,
Trisha Mittal,
Dinesh Manocha
Abstract:
We present a new approach, that we call AdaGTCN, for identifying human reader intent from Electroencephalogram~(EEG) and Eye movement~(EM) data in order to help differentiate between normal reading and task-oriented reading. Understanding the physiological aspects of the reading process~(the cognitive load and the reading intent) can help improve the quality of crowd-sourced annotated data. Our me…
▽ More
We present a new approach, that we call AdaGTCN, for identifying human reader intent from Electroencephalogram~(EEG) and Eye movement~(EM) data in order to help differentiate between normal reading and task-oriented reading. Understanding the physiological aspects of the reading process~(the cognitive load and the reading intent) can help improve the quality of crowd-sourced annotated data. Our method, Adaptive Graph Temporal Convolution Network (AdaGTCN), uses an Adaptive Graph Learning Layer and Deep Neighborhood Graph Convolution Layer for identifying the reading activities using time-locked EEG sequences recorded during word-level eye-movement fixations. Adaptive Graph Learning Layer dynamically learns the spatial correlations between the EEG electrode signals while the Deep Neighborhood Graph Convolution Layer exploits temporal features from a dense graph neighborhood to establish the state of the art in reading task identification over other contemporary approaches. We compare our approach with several baselines to report an improvement of 6.29% on the ZuCo 2.0 dataset, along with extensive ablation experiments
△ Less
Submitted 21 February, 2021;
originally announced February 2021.
-
Generating Emotive Gaits for Virtual Agents Using Affect-Based Autoregression
Authors:
Uttaran Bhattacharya,
Nicholas Rewkowski,
Pooja Guhan,
Niall L. Williams,
Trisha Mittal,
Aniket Bera,
Dinesh Manocha
Abstract:
We present a novel autoregression network to generate virtual agents that convey various emotions through their walking styles or gaits. Given the 3D pose sequences of a gait, our network extracts pertinent movement features and affective features from the gait. We use these features to synthesize subsequent gaits such that the virtual agents can express and transition between emotions represented…
▽ More
We present a novel autoregression network to generate virtual agents that convey various emotions through their walking styles or gaits. Given the 3D pose sequences of a gait, our network extracts pertinent movement features and affective features from the gait. We use these features to synthesize subsequent gaits such that the virtual agents can express and transition between emotions represented as combinations of happy, sad, angry, and neutral. We incorporate multiple regularizations in the training of our network to simultaneously enforce plausible movements and noticeable emotions on the virtual agents. We also integrate our approach with an AR environment using a Microsoft HoloLens and can generate emotive gaits at interactive rates to increase the social presence. We evaluate how human observers perceive both the naturalness and the emotions from the generated gaits of the virtual agents in a web-based study. Our results indicate around 89% of the users found the naturalness of the gaits satisfactory on a five-point Likert scale, and the emotions they perceived from the virtual agents are statistically similar to the intended emotions of the virtual agents. We also use our network to augment existing gait datasets with emotive gaits and will release this augmented dataset for future research in emotion prediction and emotive gait synthesis. Our project website is available at https://gamma.umd.edu/gen_emotive_gaits/.
△ Less
Submitted 31 July, 2021; v1 submitted 4 October, 2020;
originally announced October 2020.
-
MCQA: Multimodal Co-attention Based Network for Question Answering
Authors:
Abhishek Kumar,
Trisha Mittal,
Dinesh Manocha
Abstract:
We present MCQA, a learning-based algorithm for multimodal question answering. MCQA explicitly fuses and aligns the multimodal input (i.e. text, audio, and video), which forms the context for the query (question and answer). Our approach fuses and aligns the question and the answer within this context. Moreover, we use the notion of co-attention to perform cross-modal alignment and multimodal cont…
▽ More
We present MCQA, a learning-based algorithm for multimodal question answering. MCQA explicitly fuses and aligns the multimodal input (i.e. text, audio, and video), which forms the context for the query (question and answer). Our approach fuses and aligns the question and the answer within this context. Moreover, we use the notion of co-attention to perform cross-modal alignment and multimodal context-query alignment. Our context-query alignment module matches the relevant parts of the multimodal context and the query with each other and aligns them to improve the overall performance. We evaluate the performance of MCQA on Social-IQ, a benchmark dataset for multimodal question answering. We compare the performance of our algorithm with prior methods and observe an accuracy improvement of 4-7%.
△ Less
Submitted 25 April, 2020;
originally announced April 2020.
-
Emotions Don't Lie: An Audio-Visual Deepfake Detection Method Using Affective Cues
Authors:
Trisha Mittal,
Uttaran Bhattacharya,
Rohan Chandra,
Aniket Bera,
Dinesh Manocha
Abstract:
We present a learning-based method for detecting real and fake deepfake multimedia content. To maximize information for learning, we extract and analyze the similarity between the two audio and visual modalities from within the same video. Additionally, we extract and compare affective cues corresponding to perceived emotion from the two modalities within a video to infer whether the input video i…
▽ More
We present a learning-based method for detecting real and fake deepfake multimedia content. To maximize information for learning, we extract and analyze the similarity between the two audio and visual modalities from within the same video. Additionally, we extract and compare affective cues corresponding to perceived emotion from the two modalities within a video to infer whether the input video is "real" or "fake". We propose a deep learning network, inspired by the Siamese network architecture and the triplet loss. To validate our model, we report the AUC metric on two large-scale deepfake detection datasets, DeepFake-TIMIT Dataset and DFDC. We compare our approach with several SOTA deepfake detection methods and report per-video AUC of 84.4% on the DFDC and 96.6% on the DF-TIMIT datasets, respectively. To the best of our knowledge, ours is the first approach that simultaneously exploits audio and video modalities and also perceived emotions from the two modalities for deepfake detection.
△ Less
Submitted 1 August, 2020; v1 submitted 14 March, 2020;
originally announced March 2020.
-
EmotiCon: Context-Aware Multimodal Emotion Recognition using Frege's Principle
Authors:
Trisha Mittal,
Pooja Guhan,
Uttaran Bhattacharya,
Rohan Chandra,
Aniket Bera,
Dinesh Manocha
Abstract:
We present EmotiCon, a learning-based algorithm for context-aware perceived human emotion recognition from videos and images. Motivated by Frege's Context Principle from psychology, our approach combines three interpretations of context for emotion recognition. Our first interpretation is based on using multiple modalities(e.g. faces and gaits) for emotion recognition. For the second interpretatio…
▽ More
We present EmotiCon, a learning-based algorithm for context-aware perceived human emotion recognition from videos and images. Motivated by Frege's Context Principle from psychology, our approach combines three interpretations of context for emotion recognition. Our first interpretation is based on using multiple modalities(e.g. faces and gaits) for emotion recognition. For the second interpretation, we gather semantic context from the input image and use a self-attention-based CNN to encode this information. Finally, we use depth maps to model the third interpretation related to socio-dynamic interactions and proximity among agents. We demonstrate the efficiency of our network through experiments on EMOTIC, a benchmark dataset. We report an Average Precision (AP) score of 35.48 across 26 classes, which is an improvement of 7-8 over prior methods. We also introduce a new dataset, GroupWalk, which is a collection of videos captured in multiple real-world settings of people walking. We report an AP of 65.83 across 4 categories on GroupWalk, which is also an improvement over prior methods.
△ Less
Submitted 14 March, 2020;
originally announced March 2020.
-
CMetric: A Driving Behavior Measure Using Centrality Functions
Authors:
Rohan Chandra,
Uttaran Bhattacharya,
Trisha Mittal,
Aniket Bera,
Dinesh Manocha
Abstract:
We present a new measure, CMetric, to classify driver behaviors using centrality functions. Our formulation combines concepts from computational graph theory and social traffic psychology to quantify and classify the behavior of human drivers. CMetric is used to compute the probability of a vehicle executing a driving style, as well as the intensity used to execute the style. Our approach is desig…
▽ More
We present a new measure, CMetric, to classify driver behaviors using centrality functions. Our formulation combines concepts from computational graph theory and social traffic psychology to quantify and classify the behavior of human drivers. CMetric is used to compute the probability of a vehicle executing a driving style, as well as the intensity used to execute the style. Our approach is designed for realtime autonomous driving applications, where the trajectory of each vehicle or road-agent is extracted from a video. We compute a dynamic geometric graph (DGG) based on the positions and proximity of the road-agents and centrality functions corresponding to closeness and degree. These functions are used to compute the CMetric based on style likelihood and style intensity estimates. Our approach is general and makes no assumption about traffic density, heterogeneity, or how driving behaviors change over time. We present an algorithm to compute CMetric and demonstrate its performance on real-world traffic datasets. To test the accuracy of CMetric, we introduce a new evaluation protocol (called "Time Deviation Error") that measures the difference between human prediction and the prediction made by CMetric.
△ Less
Submitted 5 August, 2020; v1 submitted 9 March, 2020;
originally announced March 2020.
-
Forecasting Trajectory and Behavior of Road-Agents Using Spectral Clustering in Graph-LSTMs
Authors:
Rohan Chandra,
Tianrui Guan,
Srujan Panuganti,
Trisha Mittal,
Uttaran Bhattacharya,
Aniket Bera,
Dinesh Manocha
Abstract:
We present a novel approach for traffic forecasting in urban traffic scenarios using a combination of spectral graph analysis and deep learning. We predict both the low-level information (future trajectories) as well as the high-level information (road-agent behavior) from the extracted trajectory of each road-agent. Our formulation represents the proximity between the road agents using a weighted…
▽ More
We present a novel approach for traffic forecasting in urban traffic scenarios using a combination of spectral graph analysis and deep learning. We predict both the low-level information (future trajectories) as well as the high-level information (road-agent behavior) from the extracted trajectory of each road-agent. Our formulation represents the proximity between the road agents using a weighted dynamic geometric graph (DGG). We use a two-stream graph-LSTM network to perform traffic forecasting using these weighted DGGs. The first stream predicts the spatial coordinates of road-agents, while the second stream predicts whether a road-agent is going to exhibit overspeeding, underspeeding, or neutral behavior by modeling spatial interactions between road-agents. Additionally, we propose a new regularization algorithm based on spectral clustering to reduce the error margin in long-term prediction (3-5 seconds) and improve the accuracy of the predicted trajectories. Moreover, we prove a theoretical upper bound on the regularized prediction error. We evaluate our approach on the Argoverse, Lyft, Apolloscape, and NGSIM datasets and highlight the benefits over prior trajectory prediction methods. In practice, our approach reduces the average prediction error by approximately 75% over prior algorithms and achieves a weighted average accuracy of 91.2% for behavior prediction. Additionally, our spectral regularization improves long-term prediction by up to 70%.
△ Less
Submitted 5 August, 2020; v1 submitted 2 December, 2019;
originally announced December 2019.
-
Take an Emotion Walk: Perceiving Emotions from Gaits Using Hierarchical Attention Pooling and Affective Map**
Authors:
Uttaran Bhattacharya,
Christian Roncal,
Trisha Mittal,
Rohan Chandra,
Kyra Kapsaskis,
Kurt Gray,
Aniket Bera,
Dinesh Manocha
Abstract:
We present an autoencoder-based semi-supervised approach to classify perceived human emotions from walking styles obtained from videos or motion-captured data and represented as sequences of 3D poses. Given the motion on each joint in the pose at each time step extracted from 3D pose sequences, we hierarchically pool these joint motions in a bottom-up manner in the encoder, following the kinematic…
▽ More
We present an autoencoder-based semi-supervised approach to classify perceived human emotions from walking styles obtained from videos or motion-captured data and represented as sequences of 3D poses. Given the motion on each joint in the pose at each time step extracted from 3D pose sequences, we hierarchically pool these joint motions in a bottom-up manner in the encoder, following the kinematic chains in the human body. We also constrain the latent embeddings of the encoder to contain the space of psychologically-motivated affective features underlying the gaits. We train the decoder to reconstruct the motions per joint per time step in a top-down manner from the latent embeddings. For the annotated data, we also train a classifier to map the latent embeddings to emotion labels. Our semi-supervised approach achieves a mean average precision of 0.84 on the Emotion-Gait benchmark dataset, which contains both labeled and unlabeled gaits collected from multiple sources. We outperform current state-of-art algorithms for both emotion recognition and action recognition from 3D gaits by 7%--23% on the absolute. More importantly, we improve the average precision by 10%--50% on the absolute on classes that each makes up less than 25% of the labeled part of the Emotion-Gait benchmark dataset.
△ Less
Submitted 31 July, 2021; v1 submitted 20 November, 2019;
originally announced November 2019.
-
M3ER: Multiplicative Multimodal Emotion Recognition Using Facial, Textual, and Speech Cues
Authors:
Trisha Mittal,
Uttaran Bhattacharya,
Rohan Chandra,
Aniket Bera,
Dinesh Manocha
Abstract:
We present M3ER, a learning-based method for emotion recognition from multiple input modalities. Our approach combines cues from multiple co-occurring modalities (such as face, text, and speech) and also is more robust than other methods to sensor noise in any of the individual modalities. M3ER models a novel, data-driven multiplicative fusion method to combine the modalities, which learn to empha…
▽ More
We present M3ER, a learning-based method for emotion recognition from multiple input modalities. Our approach combines cues from multiple co-occurring modalities (such as face, text, and speech) and also is more robust than other methods to sensor noise in any of the individual modalities. M3ER models a novel, data-driven multiplicative fusion method to combine the modalities, which learn to emphasize the more reliable cues and suppress others on a per-sample basis. By introducing a check step which uses Canonical Correlational Analysis to differentiate between ineffective and effective modalities, M3ER is robust to sensor noise. M3ER also generates proxy features in place of the ineffectual modalities. We demonstrate the efficiency of our network through experimentation on two benchmark datasets, IEMOCAP and CMU-MOSEI. We report a mean accuracy of 82.7% on IEMOCAP and 89.0% on CMU-MOSEI, which, collectively, is an improvement of about 5% over prior work.
△ Less
Submitted 22 November, 2019; v1 submitted 8 November, 2019;
originally announced November 2019.
-
STEP: Spatial Temporal Graph Convolutional Networks for Emotion Perception from Gaits
Authors:
Uttaran Bhattacharya,
Trisha Mittal,
Rohan Chandra,
Tanmay Randhavane,
Aniket Bera,
Dinesh Manocha
Abstract:
We present a novel classifier network called STEP, to classify perceived human emotion from gaits, based on a Spatial Temporal Graph Convolutional Network (ST-GCN) architecture. Given an RGB video of an individual walking, our formulation implicitly exploits the gait features to classify the emotional state of the human into one of four emotions: happy, sad, angry, or neutral. We use hundreds of a…
▽ More
We present a novel classifier network called STEP, to classify perceived human emotion from gaits, based on a Spatial Temporal Graph Convolutional Network (ST-GCN) architecture. Given an RGB video of an individual walking, our formulation implicitly exploits the gait features to classify the emotional state of the human into one of four emotions: happy, sad, angry, or neutral. We use hundreds of annotated real-world gait videos and augment them with thousands of annotated synthetic gaits generated using a novel generative network called STEP-Gen, built on an ST-GCN based Conditional Variational Autoencoder (CVAE). We incorporate a novel push-pull regularization loss in the CVAE formulation of STEP-Gen to generate realistic gaits and improve the classification accuracy of STEP. We also release a novel dataset (E-Gait), which consists of $2,177$ human gaits annotated with perceived emotions along with thousands of synthetic gaits. In practice, STEP can learn the affective features and exhibits classification accuracy of 89% on E-Gait, which is 14 - 30% more accurate over prior methods.
△ Less
Submitted 31 July, 2021; v1 submitted 28 October, 2019;
originally announced October 2019.
-
GraphRQI: Classifying Driver Behaviors Using Graph Spectrums
Authors:
Rohan Chandra,
Uttaran Bhattacharya,
Trisha Mittal,
Xiaoyu Li,
Aniket Bera,
Dinesh Manocha
Abstract:
We present a novel algorithm (GraphRQI) to identify driver behaviors from road-agent trajectories. Our approach assumes that the road-agents exhibit a range of driving traits, such as aggressive or conservative driving. Moreover, these traits affect the trajectories of nearby road-agents as well as the interactions between road-agents. We represent these inter-agent interactions using unweighted a…
▽ More
We present a novel algorithm (GraphRQI) to identify driver behaviors from road-agent trajectories. Our approach assumes that the road-agents exhibit a range of driving traits, such as aggressive or conservative driving. Moreover, these traits affect the trajectories of nearby road-agents as well as the interactions between road-agents. We represent these inter-agent interactions using unweighted and undirected traffic graphs. Our algorithm classifies the driver behavior using a supervised learning algorithm by reducing the computation to the spectral analysis of the traffic graph. Moreover, we present a novel eigenvalue algorithm to compute the spectrum efficiently. We provide theoretical guarantees for the running time complexity of our eigenvalue algorithm and show that it is faster than previous methods by 2 times. We evaluate the classification accuracy of our approach on traffic videos and autonomous driving datasets corresponding to urban traffic. In practice, GraphRQI achieves an accuracy improvement of up to 25% over prior driver behavior classification algorithms. We also use our classification algorithm to predict the future trajectories of road-agents.
△ Less
Submitted 15 February, 2020; v1 submitted 30 September, 2019;
originally announced October 2019.
-
An Exo-Kuiper Belt and An Extended Halo around HD 191089 in Scattered Light
Authors:
Bin Ren,
Élodie Choquet,
Marshall D. Perrin,
Gaspard Duchêne,
John H. Debes,
Laurent Pueyo,
Malena Rice,
Christine Chen,
Glenn Schneider,
Thomas M. Esposito,
Charles A. Poteet,
Jason J. Wang,
S. Mark Ammons,
Megan Ansdell,
Pauline Arriaga,
Vanessa P. Bailey,
Travis Barman,
Juan Sebastián Bruzzone,
Joanna Bulger,
Jeffrey Chilcote,
Tara Cotten,
Robert J. De Rosa,
Rene Doyon,
Michael P. Fitzgerald,
Katherine B. Follette
, et al. (48 additional authors not shown)
Abstract:
We have obtained Hubble Space Telescope STIS and NICMOS, and Gemini/GPI scattered light images of the HD 191089 debris disk. We identify two spatial components: a ring resembling Kuiper Belt in radial extent (FWHM: ${\sim}$25 au, centered at ${\sim}$46 au), and a halo extending to ${\sim}$640 au. We find that the halo is significantly bluer than the ring, consistent with the scenario that the ring…
▽ More
We have obtained Hubble Space Telescope STIS and NICMOS, and Gemini/GPI scattered light images of the HD 191089 debris disk. We identify two spatial components: a ring resembling Kuiper Belt in radial extent (FWHM: ${\sim}$25 au, centered at ${\sim}$46 au), and a halo extending to ${\sim}$640 au. We find that the halo is significantly bluer than the ring, consistent with the scenario that the ring serves as the "birth ring" for the smaller dust in the halo. We measure the scattering phase functions in the 30°-150° scattering angle range and find the halo dust is both more forward- and backward-scattering than the ring dust. We measure a surface density power law index of -0.68${\pm}$0.04 for the halo, which indicates the slow-down of the radial outward motion of the dust. Using radiative transfer modeling, we attempt to simultaneously reproduce the (visible) total and (near-infrared) polarized intensity images of the birth ring. Our modeling leads to mutually inconsistent results, indicating that more complex models, such as the inclusion of more realistic aggregate particles, are needed.
△ Less
Submitted 31 July, 2019;
originally announced August 2019.
-
Game of Sketches: Deep Recurrent Models of Pictionary-style Word Guessing
Authors:
Ravi Kiran Sarvadevabhatla,
Shiv Surya,
Trisha Mittal,
Venkatesh Babu Radhakrishnan
Abstract:
The ability of intelligent agents to play games in human-like fashion is popularly considered a benchmark of progress in Artificial Intelligence. Similarly, performance on multi-disciplinary tasks such as Visual Question Answering (VQA) is considered a marker for gauging progress in Computer Vision. In our work, we bring games and VQA together. Specifically, we introduce the first computational mo…
▽ More
The ability of intelligent agents to play games in human-like fashion is popularly considered a benchmark of progress in Artificial Intelligence. Similarly, performance on multi-disciplinary tasks such as Visual Question Answering (VQA) is considered a marker for gauging progress in Computer Vision. In our work, we bring games and VQA together. Specifically, we introduce the first computational model aimed at Pictionary, the popular word-guessing social game. We first introduce Sketch-QA, an elementary version of Visual Question Answering task. Styled after Pictionary, Sketch-QA uses incrementally accumulated sketch stroke sequences as visual data. Notably, Sketch-QA involves asking a fixed question ("What object is being drawn?") and gathering open-ended guess-words from human guessers. We analyze the resulting dataset and present many interesting findings therein. To mimic Pictionary-style guessing, we subsequently propose a deep neural model which generates guess-words in response to temporally evolving human-drawn sketches. Our model even makes human-like mistakes while guessing, thus amplifying the human mimicry factor. We evaluate our model on the large-scale guess-word dataset generated via Sketch-QA task and compare with various baselines. We also conduct a Visual Turing Test to obtain human impressions of the guess-words generated by humans and our model. Experimental results demonstrate the promise of our approach for Pictionary and similarly themed games.
△ Less
Submitted 28 January, 2018;
originally announced January 2018.
-
Infrared Spectroscopy of HR 4796A's Bright Outer Cometary Ring + Tenuous Inner Hot Dust Cloud
Authors:
Carey M. Lisse,
Mike L. Sitko,
Massimo Marengo,
Ron J. Vervack,
Yanga R. Fernandez,
Tushar Mittal,
Christine H. Chen
Abstract:
We have obtained new NASA IRTF SpeX spectra of the HR 4796A debris ring system. We find a unique red excess flux that extends out to ~9 um in Spitzer IRS spectra, where thermal emission from cold, ~100K dust from the system's ring at ~75 AU takes over. Matching imaging ring photometry, we find the excess consists of NIR reflectance from the ring which is as red as that of old, processed comet nucl…
▽ More
We have obtained new NASA IRTF SpeX spectra of the HR 4796A debris ring system. We find a unique red excess flux that extends out to ~9 um in Spitzer IRS spectra, where thermal emission from cold, ~100K dust from the system's ring at ~75 AU takes over. Matching imaging ring photometry, we find the excess consists of NIR reflectance from the ring which is as red as that of old, processed comet nuclei, plus a tenuous thermal emission component from close-in, T ~ 850 K circumstellar material evincing an organic plus silicate emission feature complex at 7 - 13 um. Unusual, emission-like features due to atomic Si, S, Ca, and Sr were found at 0.96 - 1.07 um, likely sourced by rocky dust evaporating in the 850 K component. An empirical cometary dust phase function can reproduce the scattered light excess and 1:5 balance of scattered vs. thermal energy for the ring with optical depth Tau > 0.10 in an 8 AU wide belt of 4 AU vertical height and Mdust > 0.1-0.7 M_Mars. Our results are consistent with HR 4796A consisting of a narrow sheparded ring of devolatilized cometary material associated with multiple rocky planetesimal subcores, and a small steady stream of dust inflowing from this belt to a rock sublimation zone at approximately 1 AU from the primary. These subcores were built from comets that have been actively emitting large, reddish dust for > 0.4 Myr at 100K, the temperature at which cometary activity onset is seen in our Solar System.
△ Less
Submitted 9 August, 2017;
originally announced August 2017.
-
The Mahler measure for arbitrary tori
Authors:
Matilde Lalin,
Tushant Mittal
Abstract:
We consider a variation of the Mahler measure where the defining integral is performed over a more general torus. We focus our investigation on two particular polynomials related to certain elliptic curve $E$ and we establish new formulas for this variation of the Mahler measure in terms of $L'(E,0)$.
We consider a variation of the Mahler measure where the defining integral is performed over a more general torus. We focus our investigation on two particular polynomials related to certain elliptic curve $E$ and we establish new formulas for this variation of the Mahler measure in terms of $L'(E,0)$.
△ Less
Submitted 8 August, 2017;
originally announced August 2017.
-
Spectral Evidence for an Inner Carbon-Rich Circumstellar Dust Belt in the Young HD36546 A-Star System
Authors:
Carey M. Lisse,
Mike L. Sitko,
Ray W. Russell,
Massimo Marengo,
Thayne Currie,
Carl Melis,
Tushar Mittal,
Inseok Song
Abstract:
Using the NASA/IRTF SpeX & BASS spectrometers we have obtained novel 0.7 - 13 um observations of the newly imaged HD36546 debris disk system. The SpeX spectrum is most consistent with the photospheric emission expected from an Lstar ~ 20 Lsun, solar abundance A1.5V star with little/no extinction and excess emission from circumstellar dust detectable beyond 4.5 um. Non-detections of CO emission lin…
▽ More
Using the NASA/IRTF SpeX & BASS spectrometers we have obtained novel 0.7 - 13 um observations of the newly imaged HD36546 debris disk system. The SpeX spectrum is most consistent with the photospheric emission expected from an Lstar ~ 20 Lsun, solar abundance A1.5V star with little/no extinction and excess emission from circumstellar dust detectable beyond 4.5 um. Non-detections of CO emission lines and accretion signatures point to the gas poor circumstellar environment of a very old transition disk. Combining the SpeX and BASS spectra with archival WISE/AKARI/IRAS/Herschel photometery, we find an outer cold dust belt at ~135K and 20 - 40 AU from the primary, likely coincident with the disk imaged by Subaru (Currie et al. 2017), and a new second inner belt with temperature ~570K and an unusual, broad SED maximum in the 6 - 9 um region, tracing dust at 1.1 - 2.2 AU. An SED maximum at 6 - 9 um has been reported in just two other A-star systems, HD131488 and HD121191, both of ~10 Myr age (Melis et al. 2013). From Spitzer, we have also identified the ~12 Myr old A7V HD148567 system as having similar 5 - 35 um excess spectral features (Mittal et al. 2015). The Spitzer data allows us to rule out water emission and rule in carbonaceous materials - organics, carbonates, SiC - as the source of the 6 - 9 um excess. Assuming a common origin for the 4 young Astar systems' disks, we suggest they are experiencing an early era of carbon-rich planetesimal processing.
△ Less
Submitted 20 April, 2017;
originally announced April 2017.
-
Discovery and spectroscopy of the young Jovian planet 51 Eri b with the Gemini Planet Imager
Authors:
B. Macintosh,
J. R. Graham,
T. Barman,
R. J. De Rosa,
Q. Konopacky,
M. S. Marley,
C. Marois,
E. L. Nielsen,
L. Pueyo,
A. Rajan,
J. Rameau,
D. Saumon,
J. J. Wang,
J. Patience,
M. Ammons,
P. Arriaga,
E. Artigau,
S. Beckwith,
J. Brewster,
S. Bruzzone,
J. Bulger,
B. Burningham,
A. S. Burrows,
C. Chen,
E. Chiang
, et al. (63 additional authors not shown)
Abstract:
Directly detecting thermal emission from young extrasolar planets allows measurement of their atmospheric composition and luminosity, which is influenced by their formation mechanism. Using the Gemini Planet Imager, we discovered a planet orbiting the \$sim$20 Myr-old star 51 Eridani at a projected separation of 13 astronomical units. Near-infrared observations show a spectrum with strong methane…
▽ More
Directly detecting thermal emission from young extrasolar planets allows measurement of their atmospheric composition and luminosity, which is influenced by their formation mechanism. Using the Gemini Planet Imager, we discovered a planet orbiting the \$sim$20 Myr-old star 51 Eridani at a projected separation of 13 astronomical units. Near-infrared observations show a spectrum with strong methane and water vapor absorption. Modeling of the spectra and photometry yields a luminosity of L/LS=1.6-4.0 x 10-6 and an effective temperature of 600-750 K. For this age and luminosity, "hot-start" formation models indicate a mass twice that of Jupiter. This planet also has a sufficiently low luminosity to be consistent with the "cold- start" core accretion process that may have formed Jupiter.
△ Less
Submitted 9 October, 2015; v1 submitted 12 August, 2015;
originally announced August 2015.
-
IRS Spectra of Debris Disks in the Scorpius-Centaurus OB Association
Authors:
Hannah Jang-Condell,
Christine H. Chen,
Tushar Mittal,
P. Manoj,
Dan Watson,
Carey M. Lisse,
Erika Nesvold,
Marc Kuchner
Abstract:
We analyze Spitzer IRS spectra of 110 B-, A-, F-, and G-type stars with optically thin infrared excess in the Scorpius-Centaurus (ScoCen) OB association. The age of these stars ranges from 11-17 Myr. We fit the infrared excesses observed in these sources by Spitzer IRS and Spitzer MIPS to simple dust models according to Mie theory. We find that nearly all the objects in our study can be fit by one…
▽ More
We analyze Spitzer IRS spectra of 110 B-, A-, F-, and G-type stars with optically thin infrared excess in the Scorpius-Centaurus (ScoCen) OB association. The age of these stars ranges from 11-17 Myr. We fit the infrared excesses observed in these sources by Spitzer IRS and Spitzer MIPS to simple dust models according to Mie theory. We find that nearly all the objects in our study can be fit by one or two belts of dust. Dust around lower mass stars appears to be closer in than around higher mass stars, particularly for the warm dust component in the two-belt systems, suggesting mass-dependent evolution of debris disks around young stars. For those objects with stellar companions, all dust distances are consistent with trunction of the debris disk by the binary companion. The gaps between several of the two-belt systems can place limits on the planets that might lie between the belts, potentially constraining the mass and locations of planets that may be forming around these stars.
△ Less
Submitted 22 July, 2015; v1 submitted 17 June, 2015;
originally announced June 2015.
-
Discovery of Resolved Debris Disk Around HD 131835
Authors:
Li-Wei Hung,
Michael P. Fitzgerald,
Christine H. Chen,
Tushar Mittal,
Paul G. Kalas,
James R. Graham
Abstract:
We report the discovery of the resolved disk around HD 131835 and present the analysis and modeling of its thermal emission. HD 131835 is a ~15 Myr A2 star in the Scorpius-Centaurus OB association at a distance of 122.7 +16.2 -12.8 parsec. The extended disk has been detected to ~1.5" (200 AU) at 11.7 μm and 18.3 μm with T-ReCS on Gemini South. The disk is inclined at an angle of ~75° with the posi…
▽ More
We report the discovery of the resolved disk around HD 131835 and present the analysis and modeling of its thermal emission. HD 131835 is a ~15 Myr A2 star in the Scorpius-Centaurus OB association at a distance of 122.7 +16.2 -12.8 parsec. The extended disk has been detected to ~1.5" (200 AU) at 11.7 μm and 18.3 μm with T-ReCS on Gemini South. The disk is inclined at an angle of ~75° with the position angle of ~61°. The flux of HD 131835 system is 49.3+-7.6 mJy and 84+-45 mJy at 11.7 μm and 18.3 μm respectively. A model with three grain populations gives a satisfactory fit to both the spectral energy distribution and the images simultaneously. This best-fit model is composed of a hot continuous power-law disk and two rings. We characterized the grain temperature profile and found that the grains in all three populations are emitting at temperatures higher than blackbodies. In particular, the grains in the continuous disk are unusually warm; even when considering small graphite particles as the composition.
△ Less
Submitted 6 February, 2015;
originally announced February 2015.
-
Fast Modes and Dusty Horseshoes in Transitional Disks
Authors:
Tushar Mittal,
Eugene Chiang
Abstract:
The brightest transitional protoplanetary disks are often azimuthally asymmetric: their mm-wave thermal emission peaks strongly on one side. Dust overdensities can exceed $\sim$100:1, while gas densities vary by factors less than a few. We propose that these remarkable ALMA observations---which may bear on how planetesimals form---reflect a gravitational global mode in the gas disk. The mode is (1…
▽ More
The brightest transitional protoplanetary disks are often azimuthally asymmetric: their mm-wave thermal emission peaks strongly on one side. Dust overdensities can exceed $\sim$100:1, while gas densities vary by factors less than a few. We propose that these remarkable ALMA observations---which may bear on how planetesimals form---reflect a gravitational global mode in the gas disk. The mode is (1) fast---its pattern speed equals the disk's mean Keplerian frequency; (2) of azimuthal wavenumber $m=1$, displacing the host star from the barycenter; and (3) Toomre-stable. We solve for gas streamlines including the indirect stellar potential in the frame rotating with the pattern speed, under the drastic simplification that gas does not feel its own gravity. Near co-rotation, the gas disk takes the form of a horseshoe-shaped annulus. Dust particles with aerodynamic stop** times much shorter or much longer than the orbital period are dragged by gas toward the horseshoe center. For intermediate stop** times, dust converges toward a $\sim$45$^\circ$-wide arc on the co-rotation circle. Particles that do not reach their final accumulation points within disk lifetimes, either because of gas turbulence or long particle drift times, conform to horseshoe-shaped gas streamlines. Our mode is not self-consistent because we neglect gas self-gravity; still, we expect that trends between accumulation location and particle size, similar to those we have found, are generically predicted by fast modes and are potentially observable. Unlike vortices, global modes are not restricted in radial width to the pressure scale height; their large radial and azimuthal extents may better match observations.
△ Less
Submitted 5 December, 2014;
originally announced December 2014.
-
Polarimetry with the Gemini Planet Imager: Methods, Performance at First Light, and the Circumstellar Ring around HR 4796A
Authors:
Marshall D. Perrin,
Gaspard Duchene,
Max Millar-Blanchaer,
Michael P. Fitzgerald,
James R. Graham,
Sloane J. Wiktorowicz,
Paul G. Kalas,
Bruce Macintosh,
Brian Bauman,
Andrew Cardwell,
Jeffrey Chilcote,
Robert J. De Rosa,
Daren Dillon,
René Doyon,
Jennifer Dunn,
Donald Gavel,
Stephen Goodsell,
Markus Hartung,
Pascale Hibon,
Patrick Ingraham,
Daniel Kerley,
Quinn Konapacky,
James E. Larkin,
Jérôme Maire,
Franck Marchis
, et al. (19 additional authors not shown)
Abstract:
We present the first results from the polarimetry mode of the Gemini Planet Imager (GPI), which uses a new integral field polarimetry architecture to provide high contrast linear polarimetry with minimal systematic biases between the orthogonal polarizations. We describe the design, data reduction methods, and performance of polarimetry with GPI. Point spread function subtraction via differential…
▽ More
We present the first results from the polarimetry mode of the Gemini Planet Imager (GPI), which uses a new integral field polarimetry architecture to provide high contrast linear polarimetry with minimal systematic biases between the orthogonal polarizations. We describe the design, data reduction methods, and performance of polarimetry with GPI. Point spread function subtraction via differential polarimetry suppresses unpolarized starlight by a factor of over 100, and provides sensitivity to circumstellar dust reaching the photon noise limit for these observations. In the case of the circumstellar disk around HR 4796A, GPI's advanced adaptive optics system reveals the disk clearly even prior to PSF subtraction. In polarized light, the disk is seen all the way in to its semi-minor axis for the first time. The disk exhibits surprisingly strong asymmetry in polarized intensity, with the west side >9 times brighter than the east side despite the fact that the east side is slightly brighter in total intensity. Based on a synthesis of the total and polarized intensities, we now believe that the west side is closer to us, contrary to most prior interpretations. Forward scattering by relatively large silicate dust particles leads to the strong polarized intensity on the west side, and the ring must be slightly optically thick in order to explain the lower brightness in total intensity there. These findings suggest that the ring is geometrically narrow and dynamically cold, perhaps shepherded by larger bodies in the same manner as Saturn's F ring.
△ Less
Submitted 9 July, 2014;
originally announced July 2014.
-
Five Debris Disks Newly Revealed in Scattered Light from the HST NICMOS Archive
Authors:
Rémi Soummer,
Marshall D. Perrin,
Laurent Pueyo,
Élodie Choquet,
Christine Chen,
David A. Golimowski,
J. Brendan Hagan,
Tushar Mittal,
Margaret Moerchen,
Mamadou N'Diaye,
Abhijith Rajan,
Schuyler Wolff,
John Debes,
Dean C. Hines,
Glenn Schneider
Abstract:
We have spatially resolved five debris disks (HD 30447, HD 35841, HD 141943, HD 191089, and HD 202917) for the first time in near-infrared scattered light by reanalyzing archival Hubble Space Telescope (HST)/NICMOS coronagraphic images obtained between 1999 and 2006. One of these disks (HD 202917) was previously resolved at visible wavelengths using HST/Advanced Camera for Surveys. To obtain these…
▽ More
We have spatially resolved five debris disks (HD 30447, HD 35841, HD 141943, HD 191089, and HD 202917) for the first time in near-infrared scattered light by reanalyzing archival Hubble Space Telescope (HST)/NICMOS coronagraphic images obtained between 1999 and 2006. One of these disks (HD 202917) was previously resolved at visible wavelengths using HST/Advanced Camera for Surveys. To obtain these new disk images, we performed advanced point-spread function subtraction based on the Karhunen-Loeve Image Projection (KLIP) algorithm on recently reprocessed NICMOS data with improved detector artifact removal (Legacy Archive PSF Library And Circumstellar Environments Legacy program). Three of the disks (HD 30447, HD 35841, and HD 141943) appear edge-on, while the other two (HD 191089 and HD 202917) appear inclined. The inclined disks have been sculpted into rings; in particular, the disk around HD 202917 exhibits strong asymmetries. All five host stars are young (8-40 Myr), nearby (40-100 pc) F and G stars, and one (HD 141943) is a close analog to the young sun during the epoch of terrestrial planet formation. Our discoveries increase the number of debris disks resolved in scattered light from 19 to 23 (a 21% increase). Given their youth, proximity, and brightness (V = 7.2 to 8.5), these targets are excellent candidates for follow-up investigations of planet formation at visible wavelengths using the HST/STIS coronagraph, at near-infrared wavelengths with the Gemini Planet Imager (GPI) and Very Large Telescope (VLT)/SPHERE, and at thermal infrared wavelengths with the James Webb Space Telescope NIRCam and MIRI coronagraphs.
△ Less
Submitted 22 April, 2014;
originally announced April 2014.