-
TEGRA -- Scaling Up Terascale Graph Processing with Disaggregated Computing
Authors:
William Shaddix,
Mahyar Samani,
Marjan Fariborz,
S. J. Ben Yoo,
Jason Lowe-Power,
Venkatesh Akella
Abstract:
Graphs are essential for representing relationships in various domains, driving modern AI applications such as graph analytics and neural networks across science, engineering, cybersecurity, transportation, and economics. However, the size of modern graphs are rapidly expanding, posing challenges for traditional CPUs and GPUs in meeting real-time processing demands. As a result, hardware accelerat…
▽ More
Graphs are essential for representing relationships in various domains, driving modern AI applications such as graph analytics and neural networks across science, engineering, cybersecurity, transportation, and economics. However, the size of modern graphs are rapidly expanding, posing challenges for traditional CPUs and GPUs in meeting real-time processing demands. As a result, hardware accelerators for graph processing have been proposed. However, the largest graphs that can be handled by these systems is still modest often targeting Twitter graph(1.4B edges approximately). This paper aims to address this limitation by develo** a graph accelerator capable of terascale graph processing. Scale out architectures, architectures where nodes are replicated to expand to larger datasets, are natural for handling larger graphs. We argue that this approach is not appropriate for very large-scale graphs because it leads to under utilization of both memory resources and compute resources. Additionally, vertex and edge processing have different access patterns. Communication overheads also pose further challenges in designing scalable architectures. To overcome these issues, this paper proposes TEGRA, a scale-up architecture for terascale graph processing. TEGRA leverages a composable computing system with disaggregated resources and a communication architecture inspired by Active Messages. By employing direct communication between cores and optimizing memory interconnect utilization, TEGRA effectively reduces communication overhead and improves resource utilization, therefore enabling efficient processing of terascale graphs.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
Towards Reverse-Engineering the Brain: Brain-Derived Neuromorphic Computing Approach with Photonic, Electronic, and Ionic Dynamicity in 3D integrated circuits
Authors:
S. J. Ben Yoo,
Luis El-Srouji,
Suman Datta,
Shimeng Yu,
Jean Anne Incorvia,
Alberto Salleo,
Volker Sorger,
Juejun Hu,
Lionel C Kimerling,
Kristofer Bouchard,
Joy Geng,
Rishidev Chaudhuri,
Charan Ranganath,
Randall O'Reilly
Abstract:
The human brain has immense learning capabilities at extreme energy efficiencies and scale that no artificial system has been able to match. For decades, reverse engineering the brain has been one of the top priorities of science and technology research. Despite numerous efforts, conventional electronics-based methods have failed to match the scalability, energy efficiency, and self-supervised lea…
▽ More
The human brain has immense learning capabilities at extreme energy efficiencies and scale that no artificial system has been able to match. For decades, reverse engineering the brain has been one of the top priorities of science and technology research. Despite numerous efforts, conventional electronics-based methods have failed to match the scalability, energy efficiency, and self-supervised learning capabilities of the human brain. On the other hand, very recent progress in the development of new generations of photonic and electronic memristive materials, device technologies, and 3D electronic-photonic integrated circuits (3D EPIC ) promise to realize new brain-derived neuromorphic systems with comparable connectivity, density, energy-efficiency, and scalability. When combined with bio-realistic learning algorithms and architectures, it may be possible to realize an 'artificial brain' prototype with general self-learning capabilities. This paper argues the possibility of reverse-engineering the brain through architecting a prototype of a brain-derived neuromorphic computing system consisting of artificial electronic, ionic, photonic materials, devices, and circuits with dynamicity resembling the bio-plausible molecular, neuro/synaptic, neuro-circuit, and multi-structural hierarchical macro-circuits of the brain based on well-tested computational models. We further argue the importance of bio-plausible local learning algorithms applicable to the neuromorphic computing system that capture the flexible and adaptive unsupervised and self-supervised learning mechanisms central to human intelligence. Most importantly, we emphasize that the unique capabilities in brain-derived neuromorphic computing prototype systems will enable us to understand links between specific neuronal and network-level properties with system-level functioning and behavior.
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
HIMap: HybrId Representation Learning for End-to-end Vectorized HD Map Construction
Authors:
Yi Zhou,
Hui Zhang,
Jiaqian Yu,
Yifan Yang,
Sangil Jung,
Seung-In Park,
ByungIn Yoo
Abstract:
Vectorized High-Definition (HD) map construction requires predictions of the category and point coordinates of map elements (e.g. road boundary, lane divider, pedestrian crossing, etc.). State-of-the-art methods are mainly based on point-level representation learning for regressing accurate point coordinates. However, this pipeline has limitations in obtaining element-level information and handlin…
▽ More
Vectorized High-Definition (HD) map construction requires predictions of the category and point coordinates of map elements (e.g. road boundary, lane divider, pedestrian crossing, etc.). State-of-the-art methods are mainly based on point-level representation learning for regressing accurate point coordinates. However, this pipeline has limitations in obtaining element-level information and handling element-level failures, e.g. erroneous element shape or entanglement between elements. To tackle the above issues, we propose a simple yet effective HybrId framework named HIMap to sufficiently learn and interact both point-level and element-level information. Concretely, we introduce a hybrid representation called HIQuery to represent all map elements, and propose a point-element interactor to interactively extract and encode the hybrid information of elements, e.g. point position and element shape, into the HIQuery. Additionally, we present a point-element consistency constraint to enhance the consistency between the point-level and element-level information. Finally, the output point-element integrated HIQuery can be directly converted into map elements' class, point coordinates, and mask. We conduct extensive experiments and consistently outperform previous methods on both nuScenes and Argoverse2 datasets. Notably, our method achieves $77.8$ mAP on the nuScenes dataset, remarkably superior to previous SOTAs by $8.3$ mAP at least.
△ Less
Submitted 26 March, 2024; v1 submitted 13 March, 2024;
originally announced March 2024.
-
Unveiling the Significance of Toddler-Inspired Reward Transition in Goal-Oriented Reinforcement Learning
Authors:
Junseok Park,
Yoonsung Kim,
Hee Bin Yoo,
Min Whoo Lee,
Kibeom Kim,
Won-Seok Choi,
Minsu Lee,
Byoung-Tak Zhang
Abstract:
Toddlers evolve from free exploration with sparse feedback to exploiting prior experiences for goal-directed learning with denser rewards. Drawing inspiration from this Toddler-Inspired Reward Transition, we set out to explore the implications of varying reward transitions when incorporated into Reinforcement Learning (RL) tasks. Central to our inquiry is the transition from sparse to potential-ba…
▽ More
Toddlers evolve from free exploration with sparse feedback to exploiting prior experiences for goal-directed learning with denser rewards. Drawing inspiration from this Toddler-Inspired Reward Transition, we set out to explore the implications of varying reward transitions when incorporated into Reinforcement Learning (RL) tasks. Central to our inquiry is the transition from sparse to potential-based dense rewards, which share optimal strategies regardless of reward changes. Through various experiments, including those in egocentric navigation and robotic arm manipulation tasks, we found that proper reward transitions significantly influence sample efficiency and success rates. Of particular note is the efficacy of the toddler-inspired Sparse-to-Dense (S2D) transition. Beyond these performance metrics, using Cross-Density Visualizer technique, we observed that transitions, especially the S2D, smooth the policy loss landscape, promoting wide minima that enhance generalization in RL models.
△ Less
Submitted 18 March, 2024; v1 submitted 11 March, 2024;
originally announced March 2024.
-
Experimental Demonstration of Imperfection-Agnostic Local Learning Rules on Photonic Neural Networks with Mach-Zehnder Interferometric Meshes
Authors:
Luis El Srouji,
Mehmet Berkay On,
Yun-Jhu Lee,
Mahmoud Abdelghany,
S. J. Ben Yoo
Abstract:
Mach-Zehnder Interferometric meshes are attractive for low-loss photonic matrix multiplication but are challenging to program. Using least-squares optimization of directional derivatives, we experimentally demonstrate that desired matrix updates can be implemented agnostic to hardware imperfections. \c{opyright} 2024 The Author(s)
Mach-Zehnder Interferometric meshes are attractive for low-loss photonic matrix multiplication but are challenging to program. Using least-squares optimization of directional derivatives, we experimentally demonstrate that desired matrix updates can be implemented agnostic to hardware imperfections. \c{opyright} 2024 The Author(s)
△ Less
Submitted 7 January, 2024;
originally announced January 2024.
-
0.08 fF, 0.72 nA dark current, 91% Quantum Efficiency, 38 Gb/s Nano-photodetector on a 45 nm CMOS Silicon-Photonic Platform
Authors:
Mingye Fu,
S. J. Ben Yoo
Abstract:
We demonstrated a Germanium-on-Silicon photodetector utilizing an asymmetric-Fabry-Perot resonator with 0.08 fF capacitance. The measurements at 1315.5 nm show 0.72 nA (3.40 nA) dark current, 0.93 A/W (0.96 A/W) responsivity, 36 Gb/s (38 Gb/s) operation at -1V (-2V) bias.
We demonstrated a Germanium-on-Silicon photodetector utilizing an asymmetric-Fabry-Perot resonator with 0.08 fF capacitance. The measurements at 1315.5 nm show 0.72 nA (3.40 nA) dark current, 0.93 A/W (0.96 A/W) responsivity, 36 Gb/s (38 Gb/s) operation at -1V (-2V) bias.
△ Less
Submitted 7 January, 2024;
originally announced January 2024.
-
Gappy AE: A Nonlinear Approach for Gappy Data Reconstruction using Auto-Encoder
Authors:
Youngkyu Kim,
Youngsoo Choi,
Byounghyun Yoo
Abstract:
We introduce a novel data reconstruction algorithm known as Gappy auto-encoder (Gappy AE) to address the limitations associated with Gappy proper orthogonal decomposition (Gappy POD), a widely used method for data reconstruction when dealing with sparse measurements or missing data. Gappy POD has inherent constraints in accurately representing solutions characterized by slowly decaying Kolmogorov…
▽ More
We introduce a novel data reconstruction algorithm known as Gappy auto-encoder (Gappy AE) to address the limitations associated with Gappy proper orthogonal decomposition (Gappy POD), a widely used method for data reconstruction when dealing with sparse measurements or missing data. Gappy POD has inherent constraints in accurately representing solutions characterized by slowly decaying Kolmogorov N-widths, primarily due to its reliance on linear subspaces for data prediction. In contrast, Gappy AE leverages the power of nonlinear manifold representations to address data reconstruction challenges of conventional Gappy POD. It excels at real-time state prediction in scenarios where only sparsely measured data is available, filling in the gaps effectively. This capability makes Gappy AE particularly valuable, such as for digital twin and image correction applications. To demonstrate the superior data reconstruction performance of Gappy AE with sparse measurements, we provide several numerical examples, including scenarios like 2D diffusion, 2D radial advection, and 2D wave equation problems. Additionally, we assess the impact of four distinct sampling algorithms - discrete empirical interpolation method, the S-OPT algorithm, Latin hypercube sampling, and uniformly distributed sampling - on data reconstruction accuracy. Our findings conclusively show that Gappy AE outperforms Gappy POD in data reconstruction when sparse measurements are given.
△ Less
Submitted 31 March, 2024; v1 submitted 13 December, 2023;
originally announced December 2023.
-
Demonstration of Programmable Brain-Inspired Optoelectronic Neuron in Photonic Spiking Neural Network with Neural Heterogeneity
Authors:
Yun-Jhu Lee,
Mehmet Berkay On,
Luis El Srouji,
Li Zhang,
Mahmoud Abdelghany,
S. J. Ben Yoo
Abstract:
Photonic Spiking Neural Networks (PSNN) composed of the co-integrated CMOS and photonic elements can offer low loss, low power, highly-parallel, and high-throughput computing for brain-inspired neuromorphic systems. In addition, heterogeneity of neuron dynamics can also bring greater diversity and expressivity to brain-inspired networks, potentially allowing for the implementation of complex funct…
▽ More
Photonic Spiking Neural Networks (PSNN) composed of the co-integrated CMOS and photonic elements can offer low loss, low power, highly-parallel, and high-throughput computing for brain-inspired neuromorphic systems. In addition, heterogeneity of neuron dynamics can also bring greater diversity and expressivity to brain-inspired networks, potentially allowing for the implementation of complex functions with fewer neurons. In this paper, we design, fabricate, and experimentally demonstrate an optoelectronic spiking neuron that can simultaneously achieve high programmability for heterogeneous biological neural networks and maintain high-speed computing. We demonstrate that our neuron can be programmed to tune four essential parameters of neuron dynamics under 1GSpike/s input spiking pattern signals. A single neuron circuit can be tuned to output three spiking patterns, including chattering behaviors. The PSNN consisting of the optoelectronic spiking neuron and a Mach-Zehnder interferometer (MZI) mesh synaptic network achieves 89.3% accuracy on the Iris dataset. Our neuron power consumption is 1.18 pJ/spike output, mainly limited by the power efficiency of the vertical-cavity-lasers, optical coupling efficiency, and the 45 nm CMOS platform used in this experiment, and is predicted to achieve 36.84 fJ/spike output with a 7 nm CMOS platform (e.g. ASAP7) integrated with silicon photonics containing on-chip micron-scale lasers.
△ Less
Submitted 26 November, 2023;
originally announced November 2023.
-
Improving Neural Radiance Field using Near-Surface Sampling with Point Cloud Generation
Authors:
Hye Bin Yoo,
Hyun Min Han,
Sung Soo Hwang,
Il Yong Chun
Abstract:
Neural radiance field (NeRF) is an emerging view synthesis method that samples points in a three-dimensional (3D) space and estimates their existence and color probabilities. The disadvantage of NeRF is that it requires a long training time since it samples many 3D points. In addition, if one samples points from occluded regions or in the space where an object is unlikely to exist, the rendering q…
▽ More
Neural radiance field (NeRF) is an emerging view synthesis method that samples points in a three-dimensional (3D) space and estimates their existence and color probabilities. The disadvantage of NeRF is that it requires a long training time since it samples many 3D points. In addition, if one samples points from occluded regions or in the space where an object is unlikely to exist, the rendering quality of NeRF can be degraded. These issues can be solved by estimating the geometry of 3D scene. This paper proposes a near-surface sampling framework to improve the rendering quality of NeRF. To this end, the proposed method estimates the surface of a 3D object using depth images of the training set and sampling is performed around there only. To obtain depth information on a novel view, the paper proposes a 3D point cloud generation method and a simple refining method for projected depth from a point cloud. Experimental results show that the proposed near-surface sampling NeRF framework can significantly improve the rendering quality, compared to the original NeRF and three different state-of-the-art NeRF. In addition, one can significantly accelerate the training time of a NeRF model with the proposed near-surface sampling framework.
△ Less
Submitted 17 March, 2024; v1 submitted 6 October, 2023;
originally announced October 2023.
-
Semi-orthogonal subspaces for value mediate a tradeoff between binding and generalization
Authors:
W. Jeffrey Johnston,
Justin M. Fine,
Seng Bum Michael Yoo,
R. Becket Ebitz,
Benjamin Y. Hayden
Abstract:
When choosing between options, we must associate their values with the action needed to select them. We hypothesize that the brain solves this binding problem through neural population subspaces. To test this hypothesis, we examined neuronal responses in five reward-sensitive regions in macaques performing a risky choice task with sequential offers. Surprisingly, in all areas, the neural populatio…
▽ More
When choosing between options, we must associate their values with the action needed to select them. We hypothesize that the brain solves this binding problem through neural population subspaces. To test this hypothesis, we examined neuronal responses in five reward-sensitive regions in macaques performing a risky choice task with sequential offers. Surprisingly, in all areas, the neural population encoded the values of offers presented on the left and right in distinct subspaces. We show that the encoding we observe is sufficient to bind the values of the offers to their respective positions in space while preserving abstract value information, which may be important for rapid learning and generalization to novel contexts. Moreover, after both offers have been presented, all areas encode the value of the first and second offers in orthogonal subspaces. In this case as well, the orthogonalization provides binding. Our binding-by-subspace hypothesis makes two novel predictions borne out by the data. First, behavioral errors should correlate with putative spatial (but not temporal) misbinding in the neural representation. Second, the specific representational geometry that we observe across animals also indicates that behavioral errors should increase when offers have low or high values, compared to when they have medium values, even when controlling for value difference. Together, these results support the idea that the brain makes use of semi-orthogonal subspaces to bind features together.
△ Less
Submitted 14 September, 2023;
originally announced September 2023.
-
BackTrack: Robust template update via Backward Tracking of candidate template
Authors:
Dongwook Lee,
Wonjun Choi,
Seohyung Lee,
ByungIn Yoo,
Eunho Yang,
Seongju Hwang
Abstract:
Variations of target appearance such as deformations, illumination variance, occlusion, etc., are the major challenges of visual object tracking that negatively impact the performance of a tracker. An effective method to tackle these challenges is template update, which updates the template to reflect the change of appearance in the target object during tracking. However, with template updates, in…
▽ More
Variations of target appearance such as deformations, illumination variance, occlusion, etc., are the major challenges of visual object tracking that negatively impact the performance of a tracker. An effective method to tackle these challenges is template update, which updates the template to reflect the change of appearance in the target object during tracking. However, with template updates, inadequate quality of new templates or inappropriate timing of updates may induce a model drift problem, which severely degrades the tracking performance. Here, we propose BackTrack, a robust and reliable method to quantify the confidence of the candidate template by backward tracking it on the past frames. Based on the confidence score of candidates from BackTrack, we can update the template with a reliable candidate at the right time while rejecting unreliable candidates. BackTrack is a generic template update scheme and is applicable to any template-based trackers. Extensive experiments on various tracking benchmarks verify the effectiveness of BackTrack over existing template update algorithms, as it achieves SOTA performance on various tracking benchmarks.
△ Less
Submitted 21 August, 2023;
originally announced August 2023.
-
Latent-OFER: Detect, Mask, and Reconstruct with Latent Vectors for Occluded Facial Expression Recognition
Authors:
Isack Lee,
Eungi Lee,
Seok Bong Yoo
Abstract:
Most research on facial expression recognition (FER) is conducted in highly controlled environments, but its performance is often unacceptable when applied to real-world situations. This is because when unexpected objects occlude the face, the FER network faces difficulties extracting facial features and accurately predicting facial expressions. Therefore, occluded FER (OFER) is a challenging prob…
▽ More
Most research on facial expression recognition (FER) is conducted in highly controlled environments, but its performance is often unacceptable when applied to real-world situations. This is because when unexpected objects occlude the face, the FER network faces difficulties extracting facial features and accurately predicting facial expressions. Therefore, occluded FER (OFER) is a challenging problem. Previous studies on occlusion-aware FER have typically required fully annotated facial images for training. However, collecting facial images with various occlusions and expression annotations is time-consuming and expensive. Latent-OFER, the proposed method, can detect occlusions, restore occluded parts of the face as if they were unoccluded, and recognize them, improving FER accuracy. This approach involves three steps: First, the vision transformer (ViT)-based occlusion patch detector masks the occluded position by training only latent vectors from the unoccluded patches using the support vector data description algorithm. Second, the hybrid reconstruction network generates the masking position as a complete image using the ViT and convolutional neural network (CNN). Last, the expression-relevant latent vector extractor retrieves and uses expression-related information from all latent vectors by applying a CNN-based class activation map. This mechanism has a significant advantage in preventing performance degradation from occlusion by unseen objects. The experimental results on several databases demonstrate the superiority of the proposed method over state-of-the-art methods.
△ Less
Submitted 21 July, 2023;
originally announced July 2023.
-
Programmable Integrated Photonics for Topological Hamiltonians
Authors:
Mehmet Berkay On,
Farshid Ashtiani,
David Sanchez-Jacome,
Daniel Perez-Lopez,
S. J. Ben Yoo,
Andrea Blanco-Redondo
Abstract:
A variety of topological Hamiltonians have been demonstrated in photonic platforms, leading to fundamental discoveries and enhanced robustness in applications such as lasing, sensing, and quantum technologies. To date, each topological photonic platform implements a specific type of Hamiltonian with inexistent or limited reconfigurability. Here, we propose and demonstrate different topological mod…
▽ More
A variety of topological Hamiltonians have been demonstrated in photonic platforms, leading to fundamental discoveries and enhanced robustness in applications such as lasing, sensing, and quantum technologies. To date, each topological photonic platform implements a specific type of Hamiltonian with inexistent or limited reconfigurability. Here, we propose and demonstrate different topological models by using the same reprogrammable integrated photonics platform, consisting of a hexagonal mesh of silicon Mach-Zehnder interferometers with phase-shifters. We specifically demonstrate a one-dimensional Su-Schrieffer-Heeger Hamiltonian supporting a localized topological edge mode and a higher-order topological insulator based on a two-dimensional breathing Kagome Hamiltonian with three corner states. These results highlight a nearly universal platform for topological models that may fast-track research progress toward applications of topological photonics and other coupled systems.
△ Less
Submitted 10 July, 2023;
originally announced July 2023.
-
Impact of Large Language Models on Generating Software Specifications
Authors:
Danning Xie,
Byungwoo Yoo,
Nan Jiang,
Mijung Kim,
Lin Tan,
Xiangyu Zhang,
Judy S. Lee
Abstract:
Software specifications are essential for ensuring the reliability of software systems. Existing specification extraction approaches, however, suffer from limited generalizability and require manual efforts. The recent emergence of Large Language Models (LLMs), which have been successfully applied to numerous software engineering tasks, offers a promising avenue for automating this process. In thi…
▽ More
Software specifications are essential for ensuring the reliability of software systems. Existing specification extraction approaches, however, suffer from limited generalizability and require manual efforts. The recent emergence of Large Language Models (LLMs), which have been successfully applied to numerous software engineering tasks, offers a promising avenue for automating this process. In this paper, we conduct the first empirical study to evaluate the capabilities of LLMs for generating software specifications from software comments or documentation. We evaluate LLMs' performance with Few Shot Learning (FSL), enabling LLMs to generalize from a small number of examples, as well as different prompt construction strategies, and compare the performance of LLMs with traditional approaches. Additionally, we conduct a comparative diagnosis of the failure cases from both LLMs and traditional methods, identifying their unique strengths and weaknesses. Lastly, we conduct extensive experiments on 15 state of the art LLMs, evaluating their performance and cost effectiveness for generating software specifications.
Our results show that with FSL, LLMs outperform traditional methods (by 5.6%), and more sophisticated prompt construction strategies can further enlarge this performance gap (up to 5.1 to 10.0%). Yet, LLMs suffer from their unique challenges, such as ineffective prompts and the lack of domain knowledge, which together account for 53 to 60% of LLM unique failures. The strong performance of open source models (e.g., StarCoder) makes closed source models (e.g., GPT 3 Davinci) less desirable due to size and cost. Our study offers valuable insights for future research to improve specification generation.
△ Less
Submitted 2 October, 2023; v1 submitted 5 June, 2023;
originally announced June 2023.
-
Quantum Wrapper Networking
Authors:
S. J. Ben Yoo,
Sandeep Kumar Singh,
Mehmet Berkay On,
Gamze Gul,
Gregory S. Kanter,
Roberto Proietti,
Prem Kumar
Abstract:
We introduce a new concept of Quantum Wrapper Networking, which enables control, management, and operation of quantum networks that can co-exist with classical networks while kee** the requirements for quantum networks intact. The quantum wrapper networks (QWNs) enable the transparent and interoperable transportation of quantum wrapper datagrams consisting of quantum payloads and, notably, class…
▽ More
We introduce a new concept of Quantum Wrapper Networking, which enables control, management, and operation of quantum networks that can co-exist with classical networks while kee** the requirements for quantum networks intact. The quantum wrapper networks (QWNs) enable the transparent and interoperable transportation of quantum wrapper datagrams consisting of quantum payloads and, notably, classical headers to facilitate the datagram switching without measuring or disturbing the qubits of the quantum payload. Furthermore, QWNs can utilize the common network control and management for performance monitoring on the classical header and infer the quantum channel quality.
△ Less
Submitted 30 April, 2023;
originally announced May 2023.
-
Object-Centric Multi-Task Learning for Human Instances
Authors:
Hyeongseok Son,
Sangil Jung,
Solae Lee,
Seongeun Kim,
Seung-In Park,
ByungIn Yoo
Abstract:
Human is one of the most essential classes in visual recognition tasks such as detection, segmentation, and pose estimation. Although much effort has been put into individual tasks, multi-task learning for these three tasks has been rarely studied. In this paper, we explore a compact multi-task network architecture that maximally shares the parameters of the multiple tasks via object-centric learn…
▽ More
Human is one of the most essential classes in visual recognition tasks such as detection, segmentation, and pose estimation. Although much effort has been put into individual tasks, multi-task learning for these three tasks has been rarely studied. In this paper, we explore a compact multi-task network architecture that maximally shares the parameters of the multiple tasks via object-centric learning. To this end, we propose a novel query design to encode the human instance information effectively, called human-centric query (HCQ). HCQ enables for the query to learn explicit and structural information of human as well such as keypoints. Besides, we utilize HCQ in prediction heads of the target tasks directly and also interweave HCQ with the deformable attention in Transformer decoders to exploit a well-learned object-centric representation. Experimental results show that the proposed multi-task network achieves comparable accuracy to state-of-the-art task-specific models in human detection, segmentation, and pose estimation task, while it consumes less computational costs.
△ Less
Submitted 12 March, 2023;
originally announced March 2023.
-
LatentGaze: Cross-Domain Gaze Estimation through Gaze-Aware Analytic Latent Code Manipulation
Authors:
Isack Lee,
Jun-Seok Yun,
Hee Hyeon Kim,
Youngju Na,
Seok Bong Yoo
Abstract:
Although recent gaze estimation methods lay great emphasis on attentively extracting gaze-relevant features from facial or eye images, how to define features that include gaze-relevant components has been ambiguous. This obscurity makes the model learn not only gaze-relevant features but also irrelevant ones. In particular, it is fatal for the cross-dataset performance. To overcome this challengin…
▽ More
Although recent gaze estimation methods lay great emphasis on attentively extracting gaze-relevant features from facial or eye images, how to define features that include gaze-relevant components has been ambiguous. This obscurity makes the model learn not only gaze-relevant features but also irrelevant ones. In particular, it is fatal for the cross-dataset performance. To overcome this challenging issue, we propose a gaze-aware analytic manipulation method, based on a data-driven approach with generative adversarial network inversion's disentanglement characteristics, to selectively utilize gaze-relevant features in a latent code. Furthermore, by utilizing GAN-based encoder-generator process, we shift the input image from the target domain to the source domain image, which a gaze estimator is sufficiently aware. In addition, we propose gaze distortion loss in the encoder that prevents the distortion of gaze information. The experimental results demonstrate that our method achieves state-of-the-art gaze estimation accuracy in a cross-domain gaze estimation tasks. This code is available at https://github.com/leeisack/LatentGaze/.
△ Less
Submitted 21 September, 2022;
originally announced September 2022.
-
HAZE-Net: High-Frequency Attentive Super-Resolved Gaze Estimation in Low-Resolution Face Images
Authors:
Jun-Seok Yun,
Youngju Na,
Hee Hyeon Kim,
Hyung-Il Kim,
Seok Bong Yoo
Abstract:
Although gaze estimation methods have been developed with deep learning techniques, there has been no such approach as aim to attain accurate performance in low-resolution face images with a pixel width of 50 pixels or less. To solve a limitation under the challenging low-resolution conditions, we propose a high-frequency attentive super-resolved gaze estimation network, i.e., HAZE-Net. Our networ…
▽ More
Although gaze estimation methods have been developed with deep learning techniques, there has been no such approach as aim to attain accurate performance in low-resolution face images with a pixel width of 50 pixels or less. To solve a limitation under the challenging low-resolution conditions, we propose a high-frequency attentive super-resolved gaze estimation network, i.e., HAZE-Net. Our network improves the resolution of the input image and enhances the eye features and those boundaries via a proposed super-resolution module based on a high-frequency attention block. In addition, our gaze estimation module utilizes high-frequency components of the eye as well as the global appearance map. We also utilize the structural location information of faces to approximate head pose. The experimental results indicate that the proposed method exhibits robust gaze estimation performance even in low-resolution face images with 28x28 pixels. The source code of this work is available at https://github.com/dbseorms16/HAZE_Net/.
△ Less
Submitted 21 September, 2022;
originally announced September 2022.
-
New Trends in Photonic Switching and Optical Network Architecture for Data Centre and Computing Systems
Authors:
S. J. Ben Yoo
Abstract:
AI/ML for data centres and data centres for AI/ML are defining new trends in cloud computing. Disaggregated heterogeneous reconfigurable computing systems realized by photonic interconnects and photonic switching expect greatly enhanced throughput and energy-efficiency for AI/ML workloads, especially when aided by an AI/ML control plane.
AI/ML for data centres and data centres for AI/ML are defining new trends in cloud computing. Disaggregated heterogeneous reconfigurable computing systems realized by photonic interconnects and photonic switching expect greatly enhanced throughput and energy-efficiency for AI/ML workloads, especially when aided by an AI/ML control plane.
△ Less
Submitted 19 September, 2022;
originally announced September 2022.
-
Scalable Nanophotonic-Electronic Spiking Neural Networks
Authors:
Luis El Srouji,
Yun-Jhu Lee,
Mehmet Berkay On,
Li Zhang,
S. J. Ben Yoo
Abstract:
Spiking neural networks (SNN) provide a new computational paradigm capable of highly parallelized, real-time processing. Photonic devices are ideal for the design of high-bandwidth, parallel architectures matching the SNN computational paradigm. Co-integration of CMOS and photonic elements allow low-loss photonic devices to be combined with analog electronics for greater flexibility of nonlinear c…
▽ More
Spiking neural networks (SNN) provide a new computational paradigm capable of highly parallelized, real-time processing. Photonic devices are ideal for the design of high-bandwidth, parallel architectures matching the SNN computational paradigm. Co-integration of CMOS and photonic elements allow low-loss photonic devices to be combined with analog electronics for greater flexibility of nonlinear computational elements. As such, we designed and simulated an optoelectronic spiking neuron circuit on a monolithic silicon photonics (SiPh) process that replicates useful spiking behaviors beyond the leaky integrate-and-fire (LIF). Additionally, we explored two learning algorithms with the potential for on-chip learning using Mach-Zehnder Interferometric (MZI) meshes as synaptic interconnects. A variation of Random Backpropagation (RPB) was experimentally demonstrated on-chip and matched the performance of a standard linear regression on a simple classification task. Meanwhile, the Contrastive Hebbian Learning (CHL) rule was applied to a simulated neural network composed of MZI meshes for a random input-output map** task. The CHL-trained MZI network performed better than random guessing but does not match the performance of the ideal neural network (without the constraints imposed by the MZI meshes). Through these efforts, we demonstrate that co-integrated CMOS and SiPh technologies are well-suited to the design of scalable SNN computing architectures.
△ Less
Submitted 28 August, 2022;
originally announced August 2022.
-
Subspace orthogonalization as a mechanism for binding values to space
Authors:
W. Jeffrey Johnston,
Justin M. Fine,
Seng Bum Michael Yoo,
R. Becket Ebitz,
Benjamin Y. Hayden
Abstract:
When choosing between options, we must solve an important binding problem. The values of the options must be associated with information about the action needed to select them. We hypothesize that the brain solves this binding problem through use of distinct population subspaces. To test this hypothesis, we examined the responses of single neurons in five reward-sensitive regions in rhesus macaque…
▽ More
When choosing between options, we must solve an important binding problem. The values of the options must be associated with information about the action needed to select them. We hypothesize that the brain solves this binding problem through use of distinct population subspaces. To test this hypothesis, we examined the responses of single neurons in five reward-sensitive regions in rhesus macaques performing a risky choice task. In all areas, neurons encoded the value of the offers presented on both the left and the right side of the display in semi-orthogonal subspaces, which served to bind the values of the two offers to their positions in space. Supporting the idea that this orthogonalization is functionally meaningful, we observed a session-to-session covariation between choice behavior and the orthogonalization of the two value subspaces: trials with less orthogonalized subspaces were associated with greater likelihood of choosing the less valued option. Further inspection revealed that these semi-orthogonal subspaces arose from a combination of linear and nonlinear mixed selectivity in the neural population. We show this combination of selectivity balances reliable binding with an ability to generalize value across different spatial locations. These results support the hypothesis that semi-orthogonal subspaces support reliable binding, which is essential to flexible behavior in the face of multiple options.
△ Less
Submitted 2 February, 2023; v1 submitted 13 May, 2022;
originally announced May 2022.
-
Slot-VPS: Object-centric Representation Learning for Video Panoptic Segmentation
Authors:
Yi Zhou,
Hui Zhang,
Hana Lee,
Shuyang Sun,
**jun Li,
Yangguang Zhu,
ByungIn Yoo,
Xiaojuan Qi,
Jae-Joon Han
Abstract:
Video Panoptic Segmentation (VPS) aims at assigning a class label to each pixel, uniquely segmenting and identifying all object instances consistently across all frames. Classic solutions usually decompose the VPS task into several sub-tasks and utilize multiple surrogates (e.g. boxes and masks, centres and offsets) to represent objects. However, this divide-and-conquer strategy requires complex p…
▽ More
Video Panoptic Segmentation (VPS) aims at assigning a class label to each pixel, uniquely segmenting and identifying all object instances consistently across all frames. Classic solutions usually decompose the VPS task into several sub-tasks and utilize multiple surrogates (e.g. boxes and masks, centres and offsets) to represent objects. However, this divide-and-conquer strategy requires complex post-processing in both spatial and temporal domains and is vulnerable to failures from surrogate tasks. In this paper, inspired by object-centric learning which learns compact and robust object representations, we present Slot-VPS, the first end-to-end framework for this task. We encode all panoptic entities in a video, including both foreground instances and background semantics, with a unified representation called panoptic slots. The coherent spatio-temporal object's information is retrieved and encoded into the panoptic slots by the proposed Video Panoptic Retriever, enabling it to localize, segment, differentiate, and associate objects in a unified manner. Finally, the output panoptic slots can be directly converted into the class, mask, and object ID of panoptic objects in the video. We conduct extensive ablation studies and demonstrate the effectiveness of our approach on two benchmark datasets, Cityscapes-VPS (\textit{val} and test sets) and VIPER (\textit{val} set), achieving new state-of-the-art performance of 63.7, 63.3 and 56.2 VPQ, respectively.
△ Less
Submitted 16 December, 2021;
originally announced December 2021.
-
Review Of Integrated Photonic Elastic WDM Switches For Data Centers
Authors:
Akhilesh S P Khope,
Anirban Samanta,
Xian Xiao,
Ben Yoo,
John E Bowers
Abstract:
In this review paper, we present an elaborate discussion on wavelength selective switches and their demonstrations. We also review packaging and electronic photonic integration of switches; a topic neglected in other review papers. We also cover wavelength locking which is paramount in switching networks with many tunable filters.
In this review paper, we present an elaborate discussion on wavelength selective switches and their demonstrations. We also review packaging and electronic photonic integration of switches; a topic neglected in other review papers. We also cover wavelength locking which is paramount in switching networks with many tunable filters.
△ Less
Submitted 23 May, 2021;
originally announced May 2021.
-
Izhikevich-Inspired Optoelectronic Neurons with Excitatory and Inhibitory Inputs for Energy-Efficient Photonic Spiking Neural Networks
Authors:
Yun-jhu Lee,
Mehmet Berkay On,
Xian Xiao,
Roberto Proietti,
S. J. Ben Yoo
Abstract:
We designed, prototyped, and experimentally demonstrated, for the first time to our knowledge, an optoelectronic spiking neuron inspired by the Izhikevich model incorporating both excitatory and inhibitory optical spiking inputs and producing optical spiking outputs accordingly. The optoelectronic neurons consist of three transistors acting as electrical spiking circuits, a vertical-cavity surface…
▽ More
We designed, prototyped, and experimentally demonstrated, for the first time to our knowledge, an optoelectronic spiking neuron inspired by the Izhikevich model incorporating both excitatory and inhibitory optical spiking inputs and producing optical spiking outputs accordingly. The optoelectronic neurons consist of three transistors acting as electrical spiking circuits, a vertical-cavity surface-emitting laser (VCSEL) for optical spiking outputs, and two photodetectors for excitatory and inhibitory optical spiking inputs. Additional inclusion of capacitors and resistors complete the Izhikevich-inspired optoelectronic neurons, which receive excitatory and inhibitory optical spikes as inputs from other optoelectronic neurons. We developed a detailed optoelectronic neuron model in Verilog-A and simulated the circuit-level operation of various cases with excitatory input and inhibitory input signals. The experimental results closely resemble the simulated results and demonstrate how the excitatory inputs trigger the optical spiking outputs while the inhibitory inputs suppress the outputs. Utilizing the simulated neuron model, we conducted simulations using fully connected (FC) and convolutional neural networks (CNN). The simulation results using MNIST handwritten digits recognition show 90% accuracy on unsupervised learning and 97% accuracy on a supervised modified FC neural network. We further designed a nanoscale optoelectronic neuron utilizing quantum impedance conversion where a 200 aJ/spike input can trigger the output from on-chip nanolasers with 10 fJ/spike. The nanoscale neuron can support a fanout of ~80 or overcome 19 dB excess optical loss while running at 10 GSpikes/second in the neural network, which corresponds to 100x throughput and 1000x energy-efficiency improvement compared to state-of-art electrical neuromorphic hardware such as Loihi and NeuroGrid.
△ Less
Submitted 2 May, 2021;
originally announced May 2021.
-
A reproducibility study of "Augmenting Genetic Algorithms with Deep Neural Networks for Exploring the Chemical Space"
Authors:
Kevin Maik Jablonka,
Fergus Mcilwaine,
Susana Garcia,
Berend Smit,
Brian Yoo
Abstract:
Nigam et al. reported a genetic algorithm (GA) utilizing the SELFIES representation and also propose an adaptive, neural network-based penalty that is supposed to improve the diversity of the generated molecules. The main claims of the paper are that this GA outperforms other generative techniques (as measured by the penalized logP) and that a neural network-based adaptive penalty increases the di…
▽ More
Nigam et al. reported a genetic algorithm (GA) utilizing the SELFIES representation and also propose an adaptive, neural network-based penalty that is supposed to improve the diversity of the generated molecules. The main claims of the paper are that this GA outperforms other generative techniques (as measured by the penalized logP) and that a neural network-based adaptive penalty increases the diversity of the generated molecules. In this work, we investigated the reproducibility of their claims. Overall, we were able to reproduce comparable results using the SELFIES-based GA, but mostly by exploiting deficiencies of the (easily optimizable) fitness function (i.e., generating long, sulfur containing chains). In addition, we reproduce results showing that the discriminator can be used to bias the generation of molecules to ones that are similar to the reference set. Lastly, we attempted to quantify the evolution of the diversity, understand the influence of some hyperparameters, and propose improvements to the adaptive penalty.
△ Less
Submitted 10 February, 2021; v1 submitted 1 February, 2021;
originally announced February 2021.
-
Intra-Domain Task-Adaptive Transfer Learning to Determine Acute Ischemic Stroke Onset Time
Authors:
Haoyue Zhang,
Jennifer S Polson,
Kambiz Nael,
Noriko Salamon,
Bryan Yoo,
Suzie El-Saden,
Fabien Scalzo,
William Speier,
Corey W Arnold
Abstract:
Treatment of acute ischemic strokes (AIS) is largely contingent upon the time since stroke onset (TSS). However, TSS may not be readily available in up to 25% of patients with unwitnessed AIS. Current clinical guidelines for patients with unknown TSS recommend the use of MRI to determine eligibility for thrombolysis, but radiology assessments have high inter-reader variability. In this work, we pr…
▽ More
Treatment of acute ischemic strokes (AIS) is largely contingent upon the time since stroke onset (TSS). However, TSS may not be readily available in up to 25% of patients with unwitnessed AIS. Current clinical guidelines for patients with unknown TSS recommend the use of MRI to determine eligibility for thrombolysis, but radiology assessments have high inter-reader variability. In this work, we present deep learning models that leverage MRI diffusion series to classify TSS based on clinically validated thresholds. We propose an intra-domain task-adaptive transfer learning method, which involves training a model on an easier clinical task (stroke detection) and then refining the model with different binary thresholds of TSS. We apply this approach to both 2D and 3D CNN architectures with our top model achieving an ROC-AUC value of 0.74, with a sensitivity of 0.70 and a specificity of 0.81 for classifying TSS < 4.5 hours. Our pretrained models achieve better classification metrics than the models trained from scratch, and these metrics exceed those of previously published models applied to our dataset. Furthermore, our pipeline accommodates a more inclusive patient cohort than previous work, as we did not exclude imaging studies based on clinical, demographic, or image processing criteria. When applied to this broad spectrum of patients, our deep learning model achieves an overall accuracy of 75.78% when classifying TSS < 4.5 hours, carrying potential therapeutic implications for patients with unknown TSS.
△ Less
Submitted 5 November, 2020;
originally announced November 2020.
-
Optical Sha** of Plasma Cavity for Controlled Laser Wakefield Acceleration
Authors:
Bobbili Sanyasi Rao,
Myung Hoon Cho,
Hyung Taek Kim,
Jung Hun Shin,
Kyung Hwan Oh,
Jong Ho Jeon,
Byung Ju Yoo,
Seong Ha Cho,
Seong Ku Lee,
Chang Hee Nam
Abstract:
Laser wakefield accelerators rely on relativistically moving micron-sized plasma cavities that provide extremely high electric field >100GV/m. Here, we demonstrate transverse sha** of the plasma cavity to produce controlled sub-GeV electron beams, adopting laser pulses with an axially rotatable ellipse-shaped focal spot. We showed the control capability on electron self-injection, charge, and tr…
▽ More
Laser wakefield accelerators rely on relativistically moving micron-sized plasma cavities that provide extremely high electric field >100GV/m. Here, we demonstrate transverse sha** of the plasma cavity to produce controlled sub-GeV electron beams, adopting laser pulses with an axially rotatable ellipse-shaped focal spot. We showed the control capability on electron self-injection, charge, and transverse profile of the electron beam by rotating the focal spot. We observed that the effect of the elliptical focal spot was imprinted in the profiles of the electron beams and the electron energy increased, as compared to the case of a circular focal spot. We performed 3D particle-in-cell (PIC) simulations which reproduced the experimental results and revealed dynamics of a new asymmetric self-injection process. This simple scheme offers a novel control method on laser wakefield acceleration to produce tailored electron beams and x-rays for various applications.
△ Less
Submitted 21 September, 2020; v1 submitted 2 July, 2020;
originally announced July 2020.
-
Joint Learning of Generative Translator and Classifier for Visually Similar Classes
Authors:
ByungIn Yoo,
Tristan Sylvain,
Yoshua Bengio,
Junmo Kim
Abstract:
In this paper, we propose a Generative Translation Classification Network (GTCN) for improving visual classification accuracy in settings where classes are visually similar and data is scarce. For this purpose, we propose joint learning from a scratch to train a classifier and a generative stochastic translation network end-to-end. The translation network is used to perform on-line data augmentati…
▽ More
In this paper, we propose a Generative Translation Classification Network (GTCN) for improving visual classification accuracy in settings where classes are visually similar and data is scarce. For this purpose, we propose joint learning from a scratch to train a classifier and a generative stochastic translation network end-to-end. The translation network is used to perform on-line data augmentation across classes, whereas previous works have mostly involved domain adaptation. To help the model further benefit from this data-augmentation, we introduce an adaptive fade-in loss and a quadruplet loss. We perform experiments on multiple datasets to demonstrate the proposed method's performance in varied settings. Of particular interest, training on 40% of the dataset is enough for our model to surpass the performance of baselines trained on the full dataset. When our architecture is trained on the full dataset, we achieve comparable performance with state-of-the-art methods despite using a light-weight architecture.
△ Less
Submitted 2 December, 2020; v1 submitted 15 December, 2019;
originally announced December 2019.
-
Quantum Networks For Open Science
Authors:
Thomas Ndousse-Fetter,
Nicholas Peters,
Warren Grice,
Prem Kumar,
Tom Chapuran,
Saikat Guha,
Scott Hamilton,
Inder Monga,
Ray Newell,
Andrei Nomerotski,
Don Towsley,
Ben Yoo
Abstract:
The United States Department of Energy convened the Quantum Networks for Open Science (QNOS) Workshop in September 2018. The workshop was primarily focused on quantum networks optimized for scientific applications with the expectation that the resulting quantum networks could be extended to lay the groundwork for a generalized network that will evolve into a quantum internet.
The United States Department of Energy convened the Quantum Networks for Open Science (QNOS) Workshop in September 2018. The workshop was primarily focused on quantum networks optimized for scientific applications with the expectation that the resulting quantum networks could be extended to lay the groundwork for a generalized network that will evolve into a quantum internet.
△ Less
Submitted 27 March, 2019;
originally announced October 2019.
-
A Generalized and Robust Method Towards Practical Gaze Estimation on Smart Phone
Authors:
Tianchu Guo,
Yongchao Liu,
Hui Zhang,
Xiabing Liu,
Youngjun Kwak,
Byung In Yoo,
Jae-Joon Han,
Changkyu Choi
Abstract:
Gaze estimation for ordinary smart phone, e.g. estimating where the user is looking at on the phone screen, can be applied in various applications. However, the widely used appearance-based CNN methods still have two issues for practical adoption. First, due to the limited dataset, gaze estimation is very likely to suffer from over-fitting, leading to poor accuracy at run time. Second, the current…
▽ More
Gaze estimation for ordinary smart phone, e.g. estimating where the user is looking at on the phone screen, can be applied in various applications. However, the widely used appearance-based CNN methods still have two issues for practical adoption. First, due to the limited dataset, gaze estimation is very likely to suffer from over-fitting, leading to poor accuracy at run time. Second, the current methods are usually not robust, i.e. their prediction results having notable jitters even when the user is performing gaze fixation, which degrades user experience greatly. For the first issue, we propose a new tolerant and talented (TAT) training scheme, which is an iterative random knowledge distillation framework enhanced with cosine similarity pruning and aligned orthogonal initialization. The knowledge distillation is a tolerant teaching process providing diverse and informative supervision. The enhanced pruning and initialization is a talented learning process prompting the network to escape from the local minima and re-born from a better start. For the second issue, we define a new metric to measure the robustness of gaze estimator, and propose an adversarial training based Disturbance with Ordinal loss (DwO) method to improve it. The experimental results show that our TAT method achieves state-of-the-art performance on GazeCapture dataset, and that our DwO method improves the robustness while kee** comparable accuracy.
△ Less
Submitted 16 October, 2019;
originally announced October 2019.
-
EigenRank by Committee: A Data Subset Selection and Failure Prediction paradigm for Robust Deep Learning based Medical Image Segmentation
Authors:
Bilwaj Gaonkar,
Joel Beckett,
Mark Attiah,
Christine Ahn,
Matthew Edwards,
Bayard Wilson,
Azim Laiwalla,
Banafsheh Salehi,
Bryan Yoo,
Alex Bui,
Luke Macyszyn
Abstract:
Translation of fully automated deep learning based medical image segmentation technologies to clinical workflows face two main algorithmic challenges. The first, is the collection and archival of large quantities of manually annotated ground truth data for both training and validation. The second is the relative inability of the majority of deep learning based segmentation techniques to alert phys…
▽ More
Translation of fully automated deep learning based medical image segmentation technologies to clinical workflows face two main algorithmic challenges. The first, is the collection and archival of large quantities of manually annotated ground truth data for both training and validation. The second is the relative inability of the majority of deep learning based segmentation techniques to alert physicians to a likely segmentation failure. Here we propose a novel algorithm, named `Eigenrank' which addresses both of these challenges. Eigenrank can select for manual labeling, a subset of medical images from a large database, such that a U-Net trained on this subset is superior to one trained on a randomly selected subset of the same size. Eigenrank can also be used to pick out, cases in a large database, where deep learning segmentation will fail. We present our algorithm, followed by results and a discussion of how Eigenrank exploits the Von Neumann information to perform both data subset selection and failure prediction for medical image segmentation using deep learning.
△ Less
Submitted 18 January, 2021; v1 submitted 17 August, 2019;
originally announced August 2019.
-
DeepRMSA: A Deep Reinforcement Learning Framework for Routing, Modulation and Spectrum Assignment in Elastic Optical Networks
Authors:
Xiaoliang Chen,
Baojia Li,
Roberto Proietti,
Hongbo Lu,
Zuqing Zhu,
S. J. Ben Yoo
Abstract:
This paper proposes DeepRMSA, a deep reinforcement learning framework for routing, modulation and spectrum assignment (RMSA) in elastic optical networks (EONs). DeepRMSA learns the correct online RMSA policies by parameterizing the policies with deep neural networks (DNNs) that can sense complex EON states. The DNNs are trained with experiences of dynamic lightpath provisioning. We first modify th…
▽ More
This paper proposes DeepRMSA, a deep reinforcement learning framework for routing, modulation and spectrum assignment (RMSA) in elastic optical networks (EONs). DeepRMSA learns the correct online RMSA policies by parameterizing the policies with deep neural networks (DNNs) that can sense complex EON states. The DNNs are trained with experiences of dynamic lightpath provisioning. We first modify the asynchronous advantage actor-critic algorithm and present an episode-based training mechanism for DeepRMSA, namely, DeepRMSA-EP. DeepRMSA-EP divides the dynamic provisioning process into multiple episodes (each containing the servicing of a fixed number of lightpath requests) and performs training by the end of each episode. The optimization target of DeepRMSA-EP at each step of servicing a request is to maximize the cumulative reward within the rest of the episode. Thus, we obviate the need for estimating the rewards related to unknown future states. To overcome the instability issue in the training of DeepRMSA-EP due to the oscillations of cumulative rewards, we further propose a window-based flexible training mechanism, i.e., DeepRMSA-FLX. DeepRMSA-FLX attempts to smooth out the oscillations by defining the optimization scope at each step as a sliding window, and ensuring that the cumulative rewards always include rewards from a fixed number of requests. Evaluations with the two sample topologies show that DeepRMSA-FLX can effectively stabilize the training while achieving blocking probability reductions of more than 20.3% and 14.3%, when compared with the baselines.
△ Less
Submitted 15 May, 2019; v1 submitted 6 May, 2019;
originally announced May 2019.
-
Proof-of-principle experiment for nanoparticle-assisted laser wakefield acceleration
Authors:
Constantin Aniculaesei,
Vishwa Bandhu Pathak,
Kyung Hwan Oh,
Calin Ioan Hojbota,
Prashant Kumar Singh,
Bo Ram Lee,
Hyung Taek Kim,
Enrico Brunetti,
Byung Ju Yoo,
Jae Hee Sung,
Seong Ku Lee,
Chang Hee Nam
Abstract:
In the present work, we demonstrate for the first time a proof-of-principle experiment for nanoparticle-assisted laser wakefield acceleration. The nanoparticles, generated through laser ablation of aluminium, were introduced into the plasma and used to trigger the injection of electrons into the nonlinear plasma wake excited by a high power femtosecond laser. In this experiment, a significant enha…
▽ More
In the present work, we demonstrate for the first time a proof-of-principle experiment for nanoparticle-assisted laser wakefield acceleration. The nanoparticles, generated through laser ablation of aluminium, were introduced into the plasma and used to trigger the injection of electrons into the nonlinear plasma wake excited by a high power femtosecond laser. In this experiment, a significant enhancement of the electron beam energy, energy spread and divergence is obtained compared with the case when electrons are self-injected. The best quality electron bunches presented peak energy up to 338 MeV with a relative energy spread of 4.7% and vertical divergence of 5.9 mrad. This method can be further improved by adding an aerodynamic lens system, for instance, which would control the nanoparticle size, density, material and injection position thus allowing accurate control of the laser wakefield accelerator.
△ Less
Submitted 9 February, 2019; v1 submitted 3 February, 2019;
originally announced February 2019.
-
Electron energy increase in a laser wakefield accelerator using longitudinally shaped plasma density profiles
Authors:
Constantin Aniculaesei,
Vishwa Bandhu Pathak,
Hyung Taek Kim,
Kyung Hwan Oh,
Byung Ju Yoo,
Enrico Brunetti,
Yong Ha Jang,
Calin Ioan Hojbota,
Junghun Shin,
Jeong Ho Jeon,
Seongha Cho,
Myung Hoon Cho,
Jae Hee Sung,
Seong Ku Lee,
Björn Manuel Hegelich,
Chang Hee Nam
Abstract:
The phase velocity of the wakefield of a laser wakefield accelerator can, theoretically, be manipulated by sha** the longitudinal plasma density profile, thus controlling the parameters of the generated electron beam. We present an experimental method where using a series of shaped longitudinal plasma density profiles we increased the mean electron peak energy by more than 50%, from 174.8 +/- 1.…
▽ More
The phase velocity of the wakefield of a laser wakefield accelerator can, theoretically, be manipulated by sha** the longitudinal plasma density profile, thus controlling the parameters of the generated electron beam. We present an experimental method where using a series of shaped longitudinal plasma density profiles we increased the mean electron peak energy by more than 50%, from 174.8 +/- 1.3 MeV to 262 +/- 9.7 MeV and the maximum peak energy from 182.1 MeV to 363.1 MeV. The divergence follows closely the change of mean energy and decreases from 58.95 +/- 0.45 mrad to 12.63 +/- 1.17 mrad along the horizontal axis and from 35.23 +/- 0.27 mrad to 8.26 +/- 0.69 mrad along the vertical axis. Particle-in-cell simulations show that a ramp in a plasma density profile can affect the evolution of the wakefield, thus qualitatively confirming the experimental results. The presented method can increase the electron energy for a fixed laser power and at the same time offer an energy tunable source of electrons.
△ Less
Submitted 8 September, 2018;
originally announced September 2018.
-
Optical Manipulation of Magnetic Vortex Visualized in situ by 4D Electron Microscopy
Authors:
Xuewen Fu,
Shawn D. Pollard,
Bin Chen,
Byung-Kuk Yoo,
Hyunsoo Yang,
Yimei Zhu
Abstract:
Understanding the fundamental dynamics of topological vortex and antivortex naturally formed in micro/nanoscale ferromagnetic building blocks under external perturbations is crucial to magnetic vortex based information processing and spintronic devices. All previous studies have focused on magnetic vortex-core switching via external magnetic fields, spin-polarized currents, or spin waves, which ha…
▽ More
Understanding the fundamental dynamics of topological vortex and antivortex naturally formed in micro/nanoscale ferromagnetic building blocks under external perturbations is crucial to magnetic vortex based information processing and spintronic devices. All previous studies have focused on magnetic vortex-core switching via external magnetic fields, spin-polarized currents, or spin waves, which have largely prohibited the investigation of novel spin configurations that could emerge from the ground states in ferromagnetic disks and their underlying dynamics. Here, we report in situ visualization of femtosecond laser quenching induced magnetic vortex change in various symmetric ferromagnetic Permalloy disks by Lorentz phase imaging using 4D electron microscopy. Besides the switching of magnetic vortex chirality and polarity, we observed with distinct occurrence frequencies a plenitude of complex magnetic structures that have never been observed by magnetic field or current assisted switching. These complex magnetic structures consist of a number of newly created topological magnetic defects (vortex and antivortex) strictly conserving the topological winding number, demonstrating the direct impact of topological invariant on the magnetization dynamics in ferromagnetic disks. Their spin configurations show mirror or rotation symmetry due to the geometrical confinement of the disks. Combined micromagnetic simulations with the experimental observations reveal the underlying magnetization dynamics and formation mechanism of the optical quenching induced complex magnetic structures. Their distinct occurrence rates are pertinent to their formation-growth energetics and pinning effects at the disk edge. Based on these findings, we propose a paradigm of optical-quenching-assisted fast switching of vortex cores for the control of magnetic vortex based information recording and spintronic devices.
△ Less
Submitted 19 December, 2017;
originally announced December 2017.
-
Novel gas target for Laser Wakefield Accelerators
Authors:
Constantin Aniculaesei,
Hyung Taek Kim,
Byeong Ju Yoo,
Kyung Hwan Oh,
Chang Hee Nam
Abstract:
A novel gas target for interactions between high power lasers and gaseous medium, especially for laser wakefield accelerators, has been designed, manufactured and characterized. The gas target has been designed to provide a uniform density profile along the central gas cell axis by combining a gas cell and slit nozzle. The gas density can be tuned from 10^16 atoms/cm^3 to 10^19 atoms/cm^3 and the…
▽ More
A novel gas target for interactions between high power lasers and gaseous medium, especially for laser wakefield accelerators, has been designed, manufactured and characterized. The gas target has been designed to provide a uniform density profile along the central gas cell axis by combining a gas cell and slit nozzle. The gas density can be tuned from 10^16 atoms/cm^3 to 10^19 atoms/cm^3 and the gas target length can be varied from 0 to 10 cm, both changes can be made simultaneously while kee** the flat-top gas profile. The gas distributions inside the gas cell have been measured using interferometry and validated using computational fluid dynamics.
△ Less
Submitted 3 September, 2017;
originally announced September 2017.
-
Deep generative-contrastive networks for facial expression recognition
Authors:
Youngsung Kim,
ByungIn Yoo,
Youngjun Kwak,
Changkyu Choi,
Junmo Kim
Abstract:
As the expressive depth of an emotional face differs with individuals or expressions, recognizing an expression using a single facial image at a moment is difficult. A relative expression of a query face compared to a reference face might alleviate this difficulty. In this paper, we propose to utilize contrastive representation that embeds a distinctive expressive factor for a discriminative purpo…
▽ More
As the expressive depth of an emotional face differs with individuals or expressions, recognizing an expression using a single facial image at a moment is difficult. A relative expression of a query face compared to a reference face might alleviate this difficulty. In this paper, we propose to utilize contrastive representation that embeds a distinctive expressive factor for a discriminative purpose. The contrastive representation is calculated at the embedding layer of deep networks by comparing a given (query) image with the reference image. We attempt to utilize a generative reference image that is estimated based on the given image. Consequently, we deploy deep neural networks that embed a combination of a generative model, a contrastive model, and a discriminative model with an end-to-end training manner. In our proposed networks, we attempt to disentangle a facial expressive factor in two steps including learning of a generator network and a contrastive encoder network. We conducted extensive experiments on publicly available face expression databases (CK+, MMI, Oulu-CASIA, and in-the-wild databases) that have been widely adopted in the recent literatures. The proposed method outperforms the known state-of-the art methods in terms of the recognition accuracy.
△ Less
Submitted 8 May, 2019; v1 submitted 21 March, 2017;
originally announced March 2017.
-
A New Benchmark For Evaluation Of Graph-Theoretic Algorithms
Authors:
Andy B. Yoo,
Yang Liu,
Sheila Vaidya,
Stephen Poole
Abstract:
We propose a new graph-theoretic benchmark in this paper. The benchmark is developed to address shortcomings of an existing widely-used graph benchmark. We thoroughly studied a large number of traditional and contemporary graph algorithms reported in the literature to have clear understanding of their algorithmic and run-time characteristics. Based on this study, we designed a suite of kernels, e…
▽ More
We propose a new graph-theoretic benchmark in this paper. The benchmark is developed to address shortcomings of an existing widely-used graph benchmark. We thoroughly studied a large number of traditional and contemporary graph algorithms reported in the literature to have clear understanding of their algorithmic and run-time characteristics. Based on this study, we designed a suite of kernels, each of which represents a specific class of graph algorithms. The kernels are designed to capture the typical run-time behavior of target algorithms accurately, while limiting computational and spatial overhead to ensure its computation finishes in reasonable time. We expect that the developed benchmark will serve as a much needed tool for evaluating different architectures and programming models to run graph algorithms.
△ Less
Submitted 5 May, 2010;
originally announced May 2010.