-
Toxic Memes: A Survey of Computational Perspectives on the Detection and Explanation of Meme Toxicities
Authors:
Delfina Sol Martinez Pandiani,
Erik Tjong Kim Sang,
Davide Ceolin
Abstract:
Internet memes, channels for humor, social commentary, and cultural expression, are increasingly used to spread toxic messages. Studies on the computational analyses of toxic memes have significantly grown over the past five years, and the only three surveys on computational toxic meme analysis cover only work published until 2022, leading to inconsistent terminology and unexplored trends. Our wor…
▽ More
Internet memes, channels for humor, social commentary, and cultural expression, are increasingly used to spread toxic messages. Studies on the computational analyses of toxic memes have significantly grown over the past five years, and the only three surveys on computational toxic meme analysis cover only work published until 2022, leading to inconsistent terminology and unexplored trends. Our work fills this gap by surveying content-based computational perspectives on toxic memes, and reviewing key developments until early 2024. Employing the PRISMA methodology, we systematically extend the previously considered papers, achieving a threefold result. First, we survey 119 new papers, analyzing 158 computational works focused on content-based toxic meme analysis. We identify over 30 datasets used in toxic meme analysis and examine their labeling systems. Second, after observing the existence of unclear definitions of meme toxicity in computational works, we introduce a new taxonomy for categorizing meme toxicity types. We also note an expansion in computational tasks beyond the simple binary classification of memes as toxic or non-toxic, indicating a shift towards achieving a nuanced comprehension of toxicity. Third, we identify three content-based dimensions of meme toxicity under automatic study: target, intent, and conveyance tactics. We develop a framework illustrating the relationships between these dimensions and meme toxicities. The survey analyzes key challenges and recent trends, such as enhanced cross-modal reasoning, integrating expert and cultural knowledge, the demand for automatic toxicity explanations, and handling meme toxicity in low-resource languages. Also, it notes the rising use of Large Language Models (LLMs) and generative AI for detecting and generating toxic memes. Finally, it proposes pathways for advancing toxic meme detection and interpretation.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
REE-HDSC: Recognizing Extracted Entities for the Historical Database Suriname Curacao
Authors:
Erik Tjong Kim Sang
Abstract:
We describe the project REE-HDSC and outline our efforts to improve the quality of named entities extracted automatically from texts generated by hand-written text recognition (HTR) software. We describe a six-step processing pipeline and test it by processing 19th and 20th century death certificates from the civil registry of Curacao. We find that the pipeline extracts dates with high precision b…
▽ More
We describe the project REE-HDSC and outline our efforts to improve the quality of named entities extracted automatically from texts generated by hand-written text recognition (HTR) software. We describe a six-step processing pipeline and test it by processing 19th and 20th century death certificates from the civil registry of Curacao. We find that the pipeline extracts dates with high precision but that the precision of person name extraction is low. Next we show how name precision extraction can be improved by retraining HTR models with names, post-processing and by identifying and removing incorrect names.
△ Less
Submitted 5 April, 2024; v1 submitted 19 December, 2023;
originally announced January 2024.
-
Molecular Autonomous Pathfinder using Deep Reinforcement Learning
Authors:
Ken-ichi Nomura,
Ankit Mishra,
Tian Sang,
Rajiv K. Kalia,
Aiichiro Nakano,
Priya Vashishta
Abstract:
Diffusion in solids is a slow process that dictates rate-limiting processes in key chemical reactions. Unlike crystalline solids that offer well-defined diffusion pathways, the lack of similar structural motifs in amorphous or glassy materials poses a great scientific challenge in estimating slow diffusion time. To tackle this problem, we have developed an AI-guided long-time atomistic simulation…
▽ More
Diffusion in solids is a slow process that dictates rate-limiting processes in key chemical reactions. Unlike crystalline solids that offer well-defined diffusion pathways, the lack of similar structural motifs in amorphous or glassy materials poses a great scientific challenge in estimating slow diffusion time. To tackle this problem, we have developed an AI-guided long-time atomistic simulation approach: Molecular Autonomous Pathfinder (MAP) framework based on Deep Reinforcement Learning (RL), where RL agent is trained to uncover energy efficient diffusion pathways. We employ Deep Q-Network architecture with distributed prioritized replay buffer enabling fully online agent training with accelerated experience sampling by an ensemble of asynchronous agents. After training, the agents provide atomistic configurations of diffusion pathways with their energy profile. We use a piecewise Nudged Elastic Band to refine the energy profile of the obtained pathway and corresponding diffusion time on the basis of transition state theory. With MAP, we have successfully identified atomistic mechanisms along molecular diffusion pathways in amorphous silica, with time scales comparable to experiments.
△ Less
Submitted 8 December, 2023;
originally announced December 2023.
-
An MRL-Based Design Solution for RIS-Assisted MU-MIMO Wireless System under Time-Varying Channels
Authors:
Meng-Qian Alexander Wu,
Tzu-Hsien Sang,
Luisa Schuhmacher,
Ming-Jie Guo,
Khodr Hammoud,
Sofie Pollin
Abstract:
Utilizing Deep Reinforcement Learning (DRL) for Reconfigurable Intelligent Surface (RIS) assisted wireless communication has been extensively researched. However, existing DRL methods either act as a simple optimizer or only solve problems with concurrent Channel State Information (CSI) represented in the training data set. Consequently, solutions for RIS-assisted wireless communication systems un…
▽ More
Utilizing Deep Reinforcement Learning (DRL) for Reconfigurable Intelligent Surface (RIS) assisted wireless communication has been extensively researched. However, existing DRL methods either act as a simple optimizer or only solve problems with concurrent Channel State Information (CSI) represented in the training data set. Consequently, solutions for RIS-assisted wireless communication systems under time-varying environments are relatively unexplored. However, communication problems should be considered with realistic assumptions; for instance, in scenarios where the channel is time-varying, the policy obtained by reinforcement learning should be applicable for situations where CSI is not well represented in the training data set. In this paper, we apply Meta-Reinforcement Learning (MRL) to the joint optimization problem of active beamforming at the Base Station (BS) and phase shift at the RIS, motivated by MRL's ability to extend the DRL concept of solving one Markov Decision Problem (MDP) to multiple MDPs. We provide simulation results to compare the average sum rate of the proposed approach with those of selected forerunners in the literature. Our approach improves the sum rate by more than 60% under time-varying CSI assumption while maintaining the advantages of typical DRL-based solutions. Our study's results emphasize the possibility of utilizing MRL-based designs in RIS-assisted wireless communication systems while considering realistic environment assumptions.
△ Less
Submitted 15 November, 2023;
originally announced November 2023.
-
Component-Wise Natural Gradient Descent -- An Efficient Neural Network Optimization
Authors:
Tran Van Sang,
Mhd Irvan,
Rie Shigetomi Yamaguchi,
Toshiyuki Nakata
Abstract:
Natural Gradient Descent (NGD) is a second-order neural network training that preconditions the gradient descent with the inverse of the Fisher Information Matrix (FIM). Although NGD provides an efficient preconditioner, it is not practicable due to the expensive computation required when inverting the FIM. This paper proposes a new NGD variant algorithm named Component-Wise Natural Gradient Desce…
▽ More
Natural Gradient Descent (NGD) is a second-order neural network training that preconditions the gradient descent with the inverse of the Fisher Information Matrix (FIM). Although NGD provides an efficient preconditioner, it is not practicable due to the expensive computation required when inverting the FIM. This paper proposes a new NGD variant algorithm named Component-Wise Natural Gradient Descent (CW-NGD). CW-NGD is composed of 2 steps. Similar to several existing works, the first step is to consider the FIM matrix as a block-diagonal matrix whose diagonal blocks correspond to the FIM of each layer's weights. In the second step, unique to CW-NGD, we analyze the layer's structure and further decompose the layer's FIM into smaller segments whose derivatives are approximately independent. As a result, individual layers' FIMs are approximated in a block-diagonal form that trivially supports the inversion. The segment decomposition strategy is varied by layer structure. Specifically, we analyze the dense and convolutional layers and design their decomposition strategies appropriately. In an experiment of training a network containing these 2 types of layers, we empirically prove that CW-NGD requires fewer iterations to converge compared to the state-of-the-art first-order and second-order methods.
△ Less
Submitted 11 October, 2022;
originally announced October 2022.
-
PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations
Authors:
Tong Sang,
Hongyao Tang,
Yi Ma,
Jianye Hao,
Yan Zheng,
Zhaopeng Meng,
Boyan Li,
Zhen Wang
Abstract:
Deep Reinforcement Learning (DRL) has been a promising solution to many complex decision-making problems. Nevertheless, the notorious weakness in generalization among environments prevent widespread application of DRL agents in real-world scenarios. Although advances have been made recently, most prior works assume sufficient online interaction on training environments, which can be costly in prac…
▽ More
Deep Reinforcement Learning (DRL) has been a promising solution to many complex decision-making problems. Nevertheless, the notorious weakness in generalization among environments prevent widespread application of DRL agents in real-world scenarios. Although advances have been made recently, most prior works assume sufficient online interaction on training environments, which can be costly in practical cases. To this end, we focus on an offline-training-online-adaptation setting, in which the agent first learns from offline experiences collected in environments with different dynamics and then performs online policy adaptation in environments with new dynamics. In this paper, we propose Policy Adaptation with Decoupled Representations (PAnDR) for fast policy adaptation. In offline training phase, the environment representation and policy representation are learned through contrastive learning and policy recovery, respectively. The representations are further refined by mutual information optimization to make them more decoupled and complete. With learned representations, a Policy-Dynamics Value Function (PDVF) [Raileanu et al., 2020] network is trained to approximate the values for different combinations of policies and environments from offline experiences. In online adaptation phase, with the environment context inferred from few experiences collected in new environments, the policy is optimized by gradient ascent with respect to the PDVF. Our experiments show that PAnDR outperforms existing algorithms in several representative policy adaptation problems.
△ Less
Submitted 30 May, 2022; v1 submitted 6 April, 2022;
originally announced April 2022.
-
Laser-Induced Graphitisation of Diamond Under 30 fs Laser Pulse Irradiation
Authors:
Bakhtiar Ali,
Han Xu,
Dashavir Chetty,
Robert T. Sang,
Igor V. Litvinyuk,
Maksym Rybachuk
Abstract:
The degree of laser-induced graphitisation from a sp3-bonded to a sp2-bonded carbon fraction in a single crystal chemical vapour deposited (CVD) diamond under a varying fluence of an ultrashort pulsed laser (30 fs, 800 nm, 1 kHz) irradiation has been studied. The tetrahedral CVD sp3-phase was found to transition to primarily an sp2-aromatic crystalline graphitic fraction below the critical fluence…
▽ More
The degree of laser-induced graphitisation from a sp3-bonded to a sp2-bonded carbon fraction in a single crystal chemical vapour deposited (CVD) diamond under a varying fluence of an ultrashort pulsed laser (30 fs, 800 nm, 1 kHz) irradiation has been studied. The tetrahedral CVD sp3-phase was found to transition to primarily an sp2-aromatic crystalline graphitic fraction below the critical fluence of 3.9 J/cm2, above which predominantly an amorphous carbon was formed. A fractional increase of fluence from 3.3 J/cm2 to 3.9 J/cm2 (~ 20 %) resulted in a substantial (~ three-fold) increased depth of the sp2-graphitised areas owing to the non-linear interactions associated with an fs-laser irradiation. Additionally, formation of C=O carbonyl group was observed below the critical threshold fluence; the C=O cleavage occurred gradually with the increase of irradiation fluence of 30 fs laser light. The implications for these findings on enhancement of fs-driven processing of diamond are discussed.
△ Less
Submitted 25 March, 2022;
originally announced March 2022.
-
PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration
Authors:
Pengyi Li,
Hongyao Tang,
Tianpei Yang,
Xiaotian Hao,
Tong Sang,
Yan Zheng,
Jianye Hao,
Matthew E. Taylor,
Wenyuan Tao,
Zhen Wang,
Fazl Barez
Abstract:
Learning to collaborate is critical in Multi-Agent Reinforcement Learning (MARL). Previous works promote collaboration by maximizing the correlation of agents' behaviors, which is typically characterized by Mutual Information (MI) in different forms. However, we reveal sub-optimal collaborative behaviors also emerge with strong correlations, and simply maximizing the MI can, surprisingly, hinder t…
▽ More
Learning to collaborate is critical in Multi-Agent Reinforcement Learning (MARL). Previous works promote collaboration by maximizing the correlation of agents' behaviors, which is typically characterized by Mutual Information (MI) in different forms. However, we reveal sub-optimal collaborative behaviors also emerge with strong correlations, and simply maximizing the MI can, surprisingly, hinder the learning towards better collaboration. To address this issue, we propose a novel MARL framework, called Progressive Mutual Information Collaboration (PMIC), for more effective MI-driven collaboration. PMIC uses a new collaboration criterion measured by the MI between global states and joint actions. Based on this criterion, the key idea of PMIC is maximizing the MI associated with superior collaborative behaviors and minimizing the MI associated with inferior ones. The two MI objectives play complementary roles by facilitating better collaborations while avoiding falling into sub-optimal ones. Experiments on a wide range of MARL benchmarks show the superior performance of PMIC compared with other algorithms.
△ Less
Submitted 21 February, 2023; v1 submitted 16 March, 2022;
originally announced March 2022.
-
Uncertainty-aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning
Authors:
Tong Sang,
Hongyao Tang,
Jianye Hao,
Yan Zheng,
Zhaopeng Meng
Abstract:
Value estimation is one key problem in Reinforcement Learning. Albeit many successes have been achieved by Deep Reinforcement Learning (DRL) in different fields, the underlying structure and learning dynamics of value function, especially with complex function approximation, are not fully understood. In this paper, we report that decreasing rank of $Q$-matrix widely exists during learning process…
▽ More
Value estimation is one key problem in Reinforcement Learning. Albeit many successes have been achieved by Deep Reinforcement Learning (DRL) in different fields, the underlying structure and learning dynamics of value function, especially with complex function approximation, are not fully understood. In this paper, we report that decreasing rank of $Q$-matrix widely exists during learning process across a series of continuous control tasks for different popular algorithms. We hypothesize that the low-rank phenomenon indicates the common learning dynamics of $Q$-matrix from stochastic high dimensional space to smooth low dimensional space. Moreover, we reveal a positive correlation between value matrix rank and value estimation uncertainty. Inspired by above evidence, we propose a novel Uncertainty-Aware Low-rank Q-matrix Estimation (UA-LQE) algorithm as a general framework to facilitate the learning of value function. Through quantifying the uncertainty of state-action value estimation, we selectively erase the entries of highly uncertain values in state-action value matrix and conduct low-rank matrix reconstruction for them to recover their values. Such a reconstruction exploits the underlying structure of value matrix to improve the value approximation, thus leading to a more efficient learning process of value function. In the experiments, we evaluate the efficacy of UA-LQE in several representative OpenAI MuJoCo continuous control tasks.
△ Less
Submitted 19 November, 2021;
originally announced November 2021.
-
Attosecond delays of high harmonic emissions from isotopes of molecular hydrogen measured by Gouy phase XUV interferometer
Authors:
Mumta Hena Mustary,
Liang Xu,
Wanyang Wu,
Nida Haram,
Dane E. Laban,
Han Xu,
Feng He,
Igor V. Litvinyuk,
R. T. Sang
Abstract:
High harmonic spectroscopy can access structural and dynamical information on molecular systems encoded in amplitude and phase of high harmonic generation (HHG) signals4. However, measurement of the harmonic phase is a daunting task. Here we present a precise measurement of HHG phase difference between two isotopes of molecular hydrogen using the advanced extreme-ultraviolet (XUV) Gouy phase inter…
▽ More
High harmonic spectroscopy can access structural and dynamical information on molecular systems encoded in amplitude and phase of high harmonic generation (HHG) signals4. However, measurement of the harmonic phase is a daunting task. Here we present a precise measurement of HHG phase difference between two isotopes of molecular hydrogen using the advanced extreme-ultraviolet (XUV) Gouy phase interferometer. The measured phase difference is about 200 mrad, corresponding to 3 attoseconds (1 as = 10^-18 s) time delay which is nearly independent of harmonic order. The measurements agree very well with numerical calculations of a four-dimensional time-dependent Schroedinger equation. Numerical simulations also reveal the effects of molecular orientation and intra-molecular two-centre interference on the measured phase difference. This technique opens a new avenue for measuring the phase of harmonic emission for different atoms and molecules. Together with isomeric or isotopic comparisons it also enables the observation of subtle effects of molecular structures and nuclear motion on electron dynamics in strong laser fields.
△ Less
Submitted 9 November, 2021;
originally announced November 2021.
-
Carrier-Envelope-Phase Dependent Strong-Field Excitation
Authors:
D. Chetty,
R. D. Glover,
X. M. Tong,
B. A. deHarak,
H. Xu,
N. Haram,
K. Bartschat,
A. J. Palmer,
A. N. Luiten,
P. S. Light,
I. V. Litvinyuk,
R. T. Sang
Abstract:
We present a joint experimental-theoretical study on the effect of the carrier-envelope phase (CEP) of a few-cycle pulse on the atomic excitation process. We focus on the excitation rates of argon as a function of CEP in the intensity range from 50-300 TW/cm$^2$, which covers the transition between the multiphoton and tunneling regimes. Through numerical simulations based on solving the time-depen…
▽ More
We present a joint experimental-theoretical study on the effect of the carrier-envelope phase (CEP) of a few-cycle pulse on the atomic excitation process. We focus on the excitation rates of argon as a function of CEP in the intensity range from 50-300 TW/cm$^2$, which covers the transition between the multiphoton and tunneling regimes. Through numerical simulations based on solving the time-dependent Schrödinger equation (TDSE), we show that the resulting bound-state population is highly sensitive to both the intensity and the CEP. Because the intensity varies over the interaction region, the CEP effect is considerably reduced in the experiment. Nevertheless, the data clearly agree with the theoretical prediction, and the results encourage the use of precisely tailored laser fields to coherently control the strong-field excitation process. We find a markedly different behavior for the CEP-dependent bound-state population at low and high intensities with a clear boundary, which we attribute to the transition from the multiphoton to the tunneling regime.
△ Less
Submitted 15 August, 2021;
originally announced August 2021.
-
Strong field ionisation of Argon: Electron momentum spectra and nondipole effects
Authors:
Nida Haram,
Han Xu,
Igor Ivanov,
Dashavir Chetty,
Igor Litvinyuk,
R. T. Sang
Abstract:
We investigate the influence of relativistic nondipole effects on the photoelectron spectra of argon, particularly in the low kinetic energy region (0 eV - 5 eV). In our experiment, we use intense linearly polarised 800 nm laser pulse to ionise Ar from a jet and we record photoelectron energy and momentum distributions using a reaction microscope (REMI). Our measurements show that nondipole effect…
▽ More
We investigate the influence of relativistic nondipole effects on the photoelectron spectra of argon, particularly in the low kinetic energy region (0 eV - 5 eV). In our experiment, we use intense linearly polarised 800 nm laser pulse to ionise Ar from a jet and we record photoelectron energy and momentum distributions using a reaction microscope (REMI). Our measurements show that nondipole effect can cause an energy-dependent asymmetry along the laser propagation direction in the photoelectron energy and momentum spectra. Model simulation based on time-dependent Dirac equation (TDDE) can reproduce our measurement results. The electron trajectory analysis based on classical model reveals that the photoelectron which obtains negative momentum shift along laser propagation direction is caused by the interplay between the Lorenz force induced radiation pressure during its free propagation in continuum and re-scattering by Coulomb potential of the parent ion when it is driven back by the laser field.
△ Less
Submitted 29 July, 2021;
originally announced July 2021.
-
Dutch General Public Reaction on Governmental COVID-19 Measures and Announcements in Twitter Data
Authors:
Shihan Wang,
Marijn Schraagen,
Erik Tjong Kim Sang,
Mehdi Dastani
Abstract:
Public sentiment (the opinions, attitudes or feelings expressed by the public) is a factor of interest for government, as it directly influences the implementation of policies. Given the unprecedented nature of the COVID-19 crisis, having an up-to-date representation of public sentiment on governmental measures and announcements is crucial. While the 'staying-at-home' policy makes face-to-face int…
▽ More
Public sentiment (the opinions, attitudes or feelings expressed by the public) is a factor of interest for government, as it directly influences the implementation of policies. Given the unprecedented nature of the COVID-19 crisis, having an up-to-date representation of public sentiment on governmental measures and announcements is crucial. While the 'staying-at-home' policy makes face-to-face interactions and interviews challenging, analysing real-time Twitter data that reflects public opinion toward policy measures is a cost-effective way to access public sentiment. In this context, we collect streaming data using the Twitter API starting from the COVID-19 outbreak in the Netherlands in February 2020, and track Dutch general public reactions on governmental measures and announcements. We provide temporal analysis of tweet frequency and public sentiment over the past seven months. We also identify public attitudes towards two Dutch policies in case studies: one regarding social distancing and one regarding wearing face masks. By presenting those preliminary results, we aim to provide visibility into the social media discussions around COVID-19 to the general public, scientists and policy makers. The data collection and analysis will be updated and expanded over time.
△ Less
Submitted 21 December, 2020; v1 submitted 12 June, 2020;
originally announced June 2020.
-
Observation of Dynamic Stark Resonances in Strong-Field Excitation
Authors:
Dashavir Chetty,
Rohan D. Glover,
Bruno A. deHarak,
Xiao-Min Tong,
Han Xu,
Tom Pauly,
Noah Smith,
Kathryn R. Hamilton,
Klaus Bartschat,
Joseph P. Ziegel,
Nicolas Douguet,
Andre N. Luiten,
Philip S. Light,
Igor V. Litvinyuk,
Robert T. Sang
Abstract:
We investigate AC Stark-shifted resonances in argon with ultrashort near-infrared pulses. Using 30 fs pulses we observe periodic enhancements of the excitation yield in the intensity regions corresponding to the absorption of 13 and 14 photons. By reducing the pulse duration to 6 fs with only a few optical cycles, we also demonstrate that the enhancements are significantly reduced beyond what is m…
▽ More
We investigate AC Stark-shifted resonances in argon with ultrashort near-infrared pulses. Using 30 fs pulses we observe periodic enhancements of the excitation yield in the intensity regions corresponding to the absorption of 13 and 14 photons. By reducing the pulse duration to 6 fs with only a few optical cycles, we also demonstrate that the enhancements are significantly reduced beyond what is measurable in the experiment. Comparing these to numerical predictions, which are in quantitative agreement with experimental results, we find that even though the quantum-state distribution can be broad, the enhancements are largely due to efficient population of a select few AC Stark-shifted resonant states rather than the closing of an ionization channel. Because these resonances are dependent on the frequency and intensity of the laser field, the broad bandwidth of the 6 fs pulses means that the resonance condition is fulfilled across a large range of intensities. This is further exaggerated by volume-averaging effects, resulting in excitation of the $5g$ state at almost all intensities and reducing the apparent magnitude of the enhancements. For 30 fs pulses, volume averaging also broadens the quantum state distribution but the enhancements are still large enough to survive. In this case, selectivity of excitation to a single state is reduced below 25% of the relative population. However, an analysis of TDSE simulations indicates that excitation of up to 60% into a single state is possible if volume averaging can be eliminated and the intensity can be precisely controlled.
△ Less
Submitted 2 April, 2020; v1 submitted 12 December, 2019;
originally announced December 2019.
-
Influences of Human Demographics, Brand Familiarity and Security Backgrounds on Homograph Recognition
Authors:
Tran Phuong Thao,
Yukiko Sawaya,
Hoang-Quoc Nguyen-Son,
Akira Yamada,
Ayumu Kubota,
Tran Van Sang,
Rie Shigetomi Yamaguchi
Abstract:
Homograph attack is a way that attackers deceive victims about which website domain name they are communicating with by exploiting the fact that many characters look alike. The attack becomes serious and is raising broad attention when recently many brand domains have been attacked such as Apple Inc., Adobe Inc., Lloyds Bank, etc. We first design a survey of human demographics, brand familiarity,…
▽ More
Homograph attack is a way that attackers deceive victims about which website domain name they are communicating with by exploiting the fact that many characters look alike. The attack becomes serious and is raising broad attention when recently many brand domains have been attacked such as Apple Inc., Adobe Inc., Lloyds Bank, etc. We first design a survey of human demographics, brand familiarity, and security backgrounds and apply it to 2,067 participants. We build a regression model to study which factors affect participants' ability in recognizing homograph domains. We find that for different levels of visual similarity, the participants exhibit different abilities. 13.95% of participants can recognize non-homographs while 16.60% of participants can recognize homographs whose the visual similarity with the target brand domains is under 99.9%; but when the similarity increases to 99.9%, the number of participants who can recognize homographs significantly drops down to only 0.19%; and for the homographs with 100% of visual similarity, there is no way for the participants to recognize. We also find that female participants tend to recognize homographs better the male but male participants tend to able to recognize non-homographs better than females. Security knowledge is a significant factor affecting both homographs and non-homographs; surprisingly, people who have strong security knowledge tend to be able to recognize homographs but not non-homographs. Furthermore, people who work or are educated in computer science or computer engineering do not appear as a factor affecting the ability in recognizing homographs; however, interestingly, right after they are explained about the homograph attack, people who work or are educated in computer science or computer engineering are the ones who can capture the situation the most quickly.
△ Less
Submitted 26 January, 2020; v1 submitted 23 April, 2019;
originally announced April 2019.
-
Utilizing a Transparency-driven Environment toward Trusted Automatic Genre Classification: A Case Study in Journalism History
Authors:
Aysenur Bilgin,
Laura Hollink,
Jacco van Ossenbruggen,
Erik Tjong Kim Sang,
Kim Smeenk,
Frank Harbers,
Marcel Broersma
Abstract:
With the growing abundance of unlabeled data in real-world tasks, researchers have to rely on the predictions given by black-boxed computational models. However, it is an often neglected fact that these models may be scoring high on accuracy for the wrong reasons. In this paper, we present a practical impact analysis of enabling model transparency by various presentation forms. For this purpose, w…
▽ More
With the growing abundance of unlabeled data in real-world tasks, researchers have to rely on the predictions given by black-boxed computational models. However, it is an often neglected fact that these models may be scoring high on accuracy for the wrong reasons. In this paper, we present a practical impact analysis of enabling model transparency by various presentation forms. For this purpose, we developed an environment that empowers non-computer scientists to become practicing data scientists in their own research field. We demonstrate the gradually increasing understanding of journalism historians through a real-world use case study on automatic genre classification of newspaper articles. This study is a first step towards trusted usage of machine learning pipelines in a responsible way.
△ Less
Submitted 1 October, 2018;
originally announced October 2018.
-
Effect of double pulse irradiation on the morphology of a picosecond laser produced chromium plasma
Authors:
Kavya H. Rao,
N. Smijesh,
D. Chetty,
I. V. Litvinyuk,
R. T. Sang
Abstract:
We describe the measurements to control the morphology and hence the characteristics of a picosecond laser produced chromium plasma plume upon double-pulse (DP) irradiation compared to its single-pulse (SP) counterpart. DP schemes are realized by employing two geometries wherein the inter-pulse delay ($τ_p$) in the collinear geometry and the spatial separation ($Δx$) are the control parameters for…
▽ More
We describe the measurements to control the morphology and hence the characteristics of a picosecond laser produced chromium plasma plume upon double-pulse (DP) irradiation compared to its single-pulse (SP) counterpart. DP schemes are realized by employing two geometries wherein the inter-pulse delay ($τ_p$) in the collinear geometry and the spatial separation ($Δx$) are the control parameters for schemes DP$_1$ and DP$_2$ respectively. The aspect ratio (plume length/plume width) decreases upon increasing parameters such as pressure, delay between pulses and the energy of the second pulse in DP1 scheme. Interestingly, the expansion conditions of the plume which occurs at higher pressures for SP scheme could be recreated in DP1 scheme for a lower pressure $\sim$ 10$^{-6}$ Torr. This could be potentially applied for immediate applications such as high harmonic generation and quality thin film production.
△ Less
Submitted 26 August, 2018;
originally announced August 2018.
-
Relativistic non-dipole effects in strong-field atomic ionization at moderate intensities
Authors:
Nida Haram,
Igor Ivanov,
Han Xu,
Kyung T. Kim,
Atia-tul-Noor,
U. Satya Sainadh,
R. D. Glover,
D. Chetty,
Igor Litvinyuk,
R. T. Sang
Abstract:
We present a detailed experimental and theoretical study on the relativistic non-dipole effects in strong-field atomic ionisation by near-infrared linearly-polarised few-cycle laser pulses in the intensity range 1014 -1015 W/cm2. We record high-resolution photoelectron momentum distributions of argon using a reaction microscope and compare our measurements with a truly ab-initio fully relativistic…
▽ More
We present a detailed experimental and theoretical study on the relativistic non-dipole effects in strong-field atomic ionisation by near-infrared linearly-polarised few-cycle laser pulses in the intensity range 1014 -1015 W/cm2. We record high-resolution photoelectron momentum distributions of argon using a reaction microscope and compare our measurements with a truly ab-initio fully relativistic 3D model based on the time-dependent Dirac equation. We observe counter-intuitive peak shifts of the transverse electron momentum distribution in the direction opposite to that of laser propagation as a function of laser intensity and demonstrate an excellent agreement between experimental results and theoretical predictions.
△ Less
Submitted 1 April, 2019; v1 submitted 31 July, 2018;
originally announced August 2018.
-
Laser-based metastable krypton generation
Authors:
M. A. Dakka,
G. Tsiminis,
P. S. Light,
R. D. Glover,
C. Perrella,
J. Moffatt,
N. A. Spooner,
R. T. Sang,
A. N. Luiten
Abstract:
We demonstrate the generation of metastable krypton in the long-lived 1s5 state using laser excitation. The atoms are excited through a two-photon absorption process into the 2p6 state using a pulsed optical parametric oscillator laser operating near 215 nm, after which the atoms decay quickly into the metastable state with a branching ratio of 75 %. The interaction dynamics are modeled using dens…
▽ More
We demonstrate the generation of metastable krypton in the long-lived 1s5 state using laser excitation. The atoms are excited through a two-photon absorption process into the 2p6 state using a pulsed optical parametric oscillator laser operating near 215 nm, after which the atoms decay quickly into the metastable state with a branching ratio of 75 %. The interaction dynamics are modeled using density matrix formalism and, by combining this with experimental observations, we are able to calculate photo-ionization and two-photon absorption cross-sections. When compared to traditional approaches to metastable production, this new approach shows great potential for high-density metastable krypton production with minimal heating of the sample. Here, we show metastable production efficiencies of up to 2% per pulse. The new experimental results gained here, when combined with the density matrix model we have developed, suggest that fractional efficiencies up to 30% are possible under optimal conditions.
△ Less
Submitted 11 August, 2018; v1 submitted 15 May, 2018;
originally announced May 2018.
-
Time-resolved optical emission spectroscopic studies of picosecond laser produced Cr plasma
Authors:
Kavya H. Rao,
N. Smijesh,
N. Klemke,
R. Philip,
I. V. Litvinyuk,
R. T. Sang
Abstract:
Time-resolved optical emission spectroscopic measurements of a plasma generated by irradiating a Cr target using 60 picosecond (ps) and 300 ps laser pulses is carried out to investigate the variation in the linewidth ($δλ$) of emission from neutrals and ions for increasing ambient pressures. Measurements ranging from 10$^{-6}$ Torr to 10$^2$ Torr show a distinctly different variation in the $δλ$ o…
▽ More
Time-resolved optical emission spectroscopic measurements of a plasma generated by irradiating a Cr target using 60 picosecond (ps) and 300 ps laser pulses is carried out to investigate the variation in the linewidth ($δλ$) of emission from neutrals and ions for increasing ambient pressures. Measurements ranging from 10$^{-6}$ Torr to 10$^2$ Torr show a distinctly different variation in the $δλ$ of neutrals (Cr I) compared to that of singly ionized Cr (Cr II), for both irradiations. $δλ$ increases monotonously with pressure for Cr II, but an oscillation is evident at intermediate pressures for Cr I. This oscillation does not depend on the laser pulse widths used. In spite of the differences in the plasma formation mechanisms, it is experimentally found that there is an optimum intermediate background pressure for which $δλ$ of neutrals drops to a minimum. Importantly, these results underline the fact that for intermediate pressures, the usual practice of calculating the plasma number density from the $δλ$ of neutrals needs to be judiciously done, to avoid reaching inaccurate conclusions.
△ Less
Submitted 21 February, 2018;
originally announced February 2018.
-
Plasma plumes produced by laser ablation of Al with single and double pulse schemes
Authors:
N Smijesh,
Kavya H. Rao,
D. Chetty,
I. V. Litvinyuk,
R. T. Sang
Abstract:
We generated and characterized plasma with single and double picosecond laser pulses in order to study the plume dynamics and to control the plasma properties. The double-pulse scheme was found to be superior for the generation of a homogeneous plasma. The lateral expansion was prominent for irradiation schemes wherein energy of the first pulse is lower/equal to that of the second pulse. While the…
▽ More
We generated and characterized plasma with single and double picosecond laser pulses in order to study the plume dynamics and to control the plasma properties. The double-pulse scheme was found to be superior for the generation of a homogeneous plasma. The lateral expansion was prominent for irradiation schemes wherein energy of the first pulse is lower/equal to that of the second pulse. While the velocities of the fast and slow species were found to be nearly equal, the emission counts corresponding to slow species are larger for single pulse compared to the double pulse.
△ Less
Submitted 2 November, 2017;
originally announced November 2017.
-
Attosecond angular streaking and tunnelling time in atomic hydrogen
Authors:
U. Satya Sainadh,
Han Xu,
Xiaoshan Wang,
Atia-Tul-Noor,
William C. Wallace,
Nicolas Douguet,
Alexander W. Bray,
Igor Ivanov,
Klaus Bartschat,
Anatoli Kheifets,
R. T. Sang,
I. V. Litvinyuk
Abstract:
Tunnelling, one of the key features of quantum mechanics, ignited an ongoing debate about the value, meaning and interpretation of 'tunnelling time'. Until recently the debate was purely theoretical, with the process considered to be instantaneous for all practical purposes. This changed with the development of ultrafast lasers and in particular, the 'attoclock' technique that is used to probe the…
▽ More
Tunnelling, one of the key features of quantum mechanics, ignited an ongoing debate about the value, meaning and interpretation of 'tunnelling time'. Until recently the debate was purely theoretical, with the process considered to be instantaneous for all practical purposes. This changed with the development of ultrafast lasers and in particular, the 'attoclock' technique that is used to probe the attosecond dynamics of electrons. Although the initial attoclock measurements hinted at instantaneous tunnelling, later experiments contradicted those findings, claiming to have measured finite tunnelling times. In each case these measurements were performed with multi-electron atoms. Atomic hydrogen (H), the simplest atomic system with a single electron, can be 'exactly' (subject only to numerical limitations) modelled using numerical solutions of the 3D-TDSE with measured experimental parameters and acts as a convenient benchmark for both accurate experimental measurements and calculations. Here we report the first attoclock experiment performed on H and find that our experimentally determined offset angles are in excellent agreement with accurate 3D-TDSE simulations performed using our experimental pulse parameters. The same simulations with a short-range Yukawa potential result in zero offset angles for all intensities. We conclude that the offset angle measured in the attoclock experiments originates entirely from electron scattering by the long-range Coulomb potential with no contribution from tunnelling time delay. That conclusion is supported by empirical observation that the electron offset angles follow closely the simple formula for the deflection angle of electrons undergoing classical Rutherford scattering by the Coulomb potential. Thus we confirm that, in H, tunnelling is instantaneous (with an upperbound of 1.8 as) within our experimental and numerical uncertainty.
△ Less
Submitted 1 March, 2018; v1 submitted 17 July, 2017;
originally announced July 2017.
-
Outer limits of subdifferentials for min-max type functions
Authors:
Andrew Eberhard,
Vera Roshchina,
Tian Sang
Abstract:
We generalise the outer subdifferential constructon suggested by Cánovas, Henrion, López and Parra for max type functions to pointwise minima of regular Lipschitz functions. We also answer an open question about the relation between the outer subdifferential of the support of a regular function and the end set of its subdifferential posed by Li, Meng and Yang.
We generalise the outer subdifferential constructon suggested by Cánovas, Henrion, López and Parra for max type functions to pointwise minima of regular Lipschitz functions. We also answer an open question about the relation between the outer subdifferential of the support of a regular function and the end set of its subdifferential posed by Li, Meng and Yang.
△ Less
Submitted 29 August, 2017; v1 submitted 11 January, 2017;
originally announced January 2017.
-
Compact convex sets with prescribed facial dimensions
Authors:
Vera Roshchina,
Tian Sang,
David Yost
Abstract:
While faces of a polytope form a well structured lattice, in which faces of each possible dimension are present, this is not true for general compact convex sets. We address the question of what dimensional patterns are possible for the faces of general closed convex sets. We show that for any finite sequence of positive integers there exist compact convex sets which only have extreme points and f…
▽ More
While faces of a polytope form a well structured lattice, in which faces of each possible dimension are present, this is not true for general compact convex sets. We address the question of what dimensional patterns are possible for the faces of general closed convex sets. We show that for any finite sequence of positive integers there exist compact convex sets which only have extreme points and faces with dimensions from this prescribed sequence. We also discuss another approach to dimensionality, considering the dimension of the union of all faces of the same dimension. We show that the questions arising from this approach are highly nontrivial and give examples of convex sets for which the sets of extreme points have fractal dimension.
△ Less
Submitted 21 March, 2017; v1 submitted 9 October, 2016;
originally announced October 2016.
-
On the conjecture by Demyanov-Ryabova in converting finite exhausters
Authors:
Tian Sang
Abstract:
In this paper, we prove the conjecture of Demyanov and Ryabova on the length of cycles in converting exhausters in an affinely independent setting and obtain a combinatorial reformulation of the conjecture.
Given a finite collection of polyhedra, we can obtain its "dual" collection by forming another collection of polyhedra, which are obtained as the convex hull of all support faces of all polyh…
▽ More
In this paper, we prove the conjecture of Demyanov and Ryabova on the length of cycles in converting exhausters in an affinely independent setting and obtain a combinatorial reformulation of the conjecture.
Given a finite collection of polyhedra, we can obtain its "dual" collection by forming another collection of polyhedra, which are obtained as the convex hull of all support faces of all polyhedra for a given direction in space. If we keep applying this process, we will eventually cycle due to the finiteness of the problem. Demyanov and Ryabova claim that this cycle will eventually reach a length of at most two.
We prove that the conjecture is true in the special case, that is, when we have affinely independent number of vertices in the given space. We also obtain an equivalent combinatorial reformulation for the problem, which should advance insight for the future work on this problem.
△ Less
Submitted 3 April, 2016; v1 submitted 24 January, 2016;
originally announced January 2016.
-
Precise and accurate measurements of strong-field photoionisation and a transferrable laser intensity calibration standard
Authors:
W. C. Wallace,
O. Ghafur,
C. Khurmi,
Satya Sainadh U.,
J. E. Calvert,
D. E. Laban,
M. G. Pullen,
K. Bartschat,
A. N. Grum-Grzhimailo,
D. Wells,
H. M. Quiney,
X. M. Tong,
I. V. Litvinyuk,
R. T. Sang,
D. Kielpinski
Abstract:
Ionization of atoms and molecules in strong laser fields is a fundamental process in many fields of research, especially in the emerging field of attosecond science. So far, demonstrably accurate data have only been acquired for atomic hydrogen (H), a species that is accessible to few investigators. Here we present measurements of the ionization yield for argon, krypton, and xenon with percentleve…
▽ More
Ionization of atoms and molecules in strong laser fields is a fundamental process in many fields of research, especially in the emerging field of attosecond science. So far, demonstrably accurate data have only been acquired for atomic hydrogen (H), a species that is accessible to few investigators. Here we present measurements of the ionization yield for argon, krypton, and xenon with percentlevel accuracy, calibrated using H, in a laser regime widely used in attosecond science. We derive a transferrable calibration standard for laser peak intensity, accurate to 1.3%, that is based on a simple reference curve. In addition, our measurements provide a much-needed benchmark for testing models of ionisation in noble-gas atoms, such as the widely employed single-active electron approximation.
△ Less
Submitted 17 January, 2016;
originally announced January 2016.
-
The interaction of excited atoms and few-cycle laser pulses
Authors:
J. E. Calvert,
Han Xu,
A. J. Palmer,
R. D. Glover,
D. E. Laban,
X. M. Tong,
V. K. Dolmatov,
A. S. Kheifets,
K. Bartschat,
I. V. Litvinyuk,
D. Kielpinski,
R. T. Sang
Abstract:
This work describes the first observations of the ionisation of neon in a metastable atomic state utilising a strong-field, few-cycle light pulse. We compare the observations to theoretical predictions based on the Ammosov-Delone-Krainov (ADK) theory and a solution to the time-dependent Schrodinger equation (TDSE). The TDSE provides better agreement with the experimental data than the ADK theory.…
▽ More
This work describes the first observations of the ionisation of neon in a metastable atomic state utilising a strong-field, few-cycle light pulse. We compare the observations to theoretical predictions based on the Ammosov-Delone-Krainov (ADK) theory and a solution to the time-dependent Schrodinger equation (TDSE). The TDSE provides better agreement with the experimental data than the ADK theory. We optically pump the target atomic species and demonstrate that the ionisation rate depends on the spin state of the target atoms and provide physically transparent interpretation of such a spin dependence in the frameworks of the spin-polarised Hartree-Fock and random-phase approximations.
△ Less
Submitted 18 January, 2016; v1 submitted 14 January, 2016;
originally announced January 2016.
-
Measuring laser carrier-envelope phase effects in the noble gases with an atomic hydrogen calibration standard
Authors:
Champak Khurmi,
W. C. Wallace,
Satya Sainadh U,
I. A. Ivanov,
A. S. Kheifets,
X. M. Tong,
I. V. Litvinyuk,
R. T. Sang,
D. Kielpinski
Abstract:
We present accurate measurements of carrier-envelope phase effects on ionisation of the noble gases with few-cycle laser pulses. The experimental apparatus is calibrated by using atomic hydrogen data to remove any systematic offsets and thereby obtain accurate CEP data on other generally used noble gases such as Ar, Kr and Xe. Experimental results for H are well supported by exact TDSE theoretical…
▽ More
We present accurate measurements of carrier-envelope phase effects on ionisation of the noble gases with few-cycle laser pulses. The experimental apparatus is calibrated by using atomic hydrogen data to remove any systematic offsets and thereby obtain accurate CEP data on other generally used noble gases such as Ar, Kr and Xe. Experimental results for H are well supported by exact TDSE theoretical simulations however significant differences are observed in case of noble gases.
△ Less
Submitted 10 January, 2016;
originally announced January 2016.
-
Isotope effect in tunnelling ionization of neutral hydrogen molecules
Authors:
X. Wang,
H. Xu,
A. Atia-Tul-Noor,
B. T. Hu,
D. Kielpinski,
R. T. Sang,
I. V. Litvinyuk
Abstract:
It has been recently predicted theoretically that due to nuclear motion light and heavy hydrogen molecules exposed to strong electric field should exhibit substantially different tunneling ionization rates (O.I. Tolstikhin, H.J. Worner and T. Morishita, Phys. Rev. A 87, 041401(R) (2013) [1]). We studied that isotope effect experimentally by measuring relative ionization yields for each species in…
▽ More
It has been recently predicted theoretically that due to nuclear motion light and heavy hydrogen molecules exposed to strong electric field should exhibit substantially different tunneling ionization rates (O.I. Tolstikhin, H.J. Worner and T. Morishita, Phys. Rev. A 87, 041401(R) (2013) [1]). We studied that isotope effect experimentally by measuring relative ionization yields for each species in a mixed H2/D2 gas jet interacting with intense femtosecond laser pulses. In a reaction microscope apparatus we detected ionic fragments from all contributing channels (single ionization, dissociation, and sequential double ionization) and determined the ratio of total single ionization yields for H2 and D2. The measured ratio agrees quantitatively with the prediction of the generalized weak-field asymptotic theory in an apparent failure of the frozen-nuclei approximation.
△ Less
Submitted 21 June, 2015;
originally announced June 2015.
-
Experimental observation of the elusive double-peak structure in R-dependent strong-field ionization rate of H2+
Authors:
Han Xu,
Feng He,
D. Kielpinski,
R. T. Sang,
I. V. Litvinyuk
Abstract:
When a diatomic molecule is ionized by an intense laser field, the ionization rate depends very strongly on the inter-nuclear separation. That dependence exhibits a pronounced maximum at the inter-nuclear separation known as the critical distance. This phenomenon was first demonstrated theoretically in H2+ and became known as charge-resonance enhanced ionization (CREI, in reference to a proposed p…
▽ More
When a diatomic molecule is ionized by an intense laser field, the ionization rate depends very strongly on the inter-nuclear separation. That dependence exhibits a pronounced maximum at the inter-nuclear separation known as the critical distance. This phenomenon was first demonstrated theoretically in H2+ and became known as charge-resonance enhanced ionization (CREI, in reference to a proposed physical mechanism) or simply enhanced ionisation (EI). All theoretical models of this phenomenon predict a double-peak structure in the R-dependent ionization rate of H2+. However, such double-peak structure has never been observed experimentally. It was even suggested that it is impossible to observe due to fast motion of the nuclear wavepackets. Here we report a few-cycle pump-probe experiment which clearly resolves that elusive double-peak structure. In the experiment, an expanding H2+ ion produced by an intense pump pulse is probed by a much weaker probe pulse. The predicted double-peak structure is clearly seen in delay-dependent kinetic energy spectra of protons when pump and probe pulses are polarized parallel to each other. No structure is seen when the probe is polarized perpendicular to the pump.
△ Less
Submitted 17 April, 2015;
originally announced April 2015.
-
Transverse electron momentum distribution in tunneling and over the barrier ionization by laser pulses with varying ellipticity
Authors:
I. A. Ivanov,
A. S. Kheifets,
J. E. Calvert,
S. Goodall,
X. Wang,
Han Xu,
A. J. Palmer,
D. Kielpinski,
I. V. Litvinyuk,
R. T. Sang
Abstract:
We study transverse electron momentum distribution (TEMD) in strong field atomic ionization driven by laser pulses with varying ellipticity. We show, both experimentally and theoretically, that the TEMD in the tunneling and over the barrier ionization regimes evolves in a qualitatively different way when the ellipticity parameter describing polarization state of the driving laser pulse increases.
We study transverse electron momentum distribution (TEMD) in strong field atomic ionization driven by laser pulses with varying ellipticity. We show, both experimentally and theoretically, that the TEMD in the tunneling and over the barrier ionization regimes evolves in a qualitatively different way when the ellipticity parameter describing polarization state of the driving laser pulse increases.
△ Less
Submitted 29 June, 2015; v1 submitted 16 March, 2015;
originally announced March 2015.
-
Benchmarking strong-field ionisation with atomic hydrogen
Authors:
D. Kielpinski,
R. T. Sang,
I. V. Litvinyuk
Abstract:
As the simplest atomic system, the hydrogen atom plays a key benchmarking role in laser and quantum physics. Atomic hydrogen is a widely used atomic test system for theoretical calculations of strong-field ionization, since approximate theories can be directly compared to numerical solutions of the time-dependent Schrödinger equation. However, relatively little experimental data is available for c…
▽ More
As the simplest atomic system, the hydrogen atom plays a key benchmarking role in laser and quantum physics. Atomic hydrogen is a widely used atomic test system for theoretical calculations of strong-field ionization, since approximate theories can be directly compared to numerical solutions of the time-dependent Schrödinger equation. However, relatively little experimental data is available for comparison to these calculations, since atomic hydrogen sources are difficult to construct and use. We review the existing experimental results on strong-field ionization of atomic hydrogen in multi-cycle and few-cycle laser pulses. Quantitative agreement has been achieved between experiment and theoretical predictions at the 10% uncertainty level, and has been used to develop an intensity calibration method with 1% uncertainty. Such quantitative agreement can be used to certify experimental techniques as being free from systematic errors, guaranteeing the accuracy of data obtained on species other than H. We review the experimental and theoretical techniques that enable these results.
△ Less
Submitted 25 March, 2014;
originally announced March 2014.
-
Carrier-Envelope-Phase Dependent Dissociation of Hydrogen
Authors:
Han Xu,
J - P Maclean,
D E Laban,
W C Wallace,
D Kielpinski,
R T Sang,
I V Litvinyuk
Abstract:
We studied dependence of dissociative ionization in H2 on carrier-envelope phase (CEP) of few-cycle (6fs) near-infrared (NIR) laser pulses. For low-energy channels, we present the first experimental observation of CEP dependence for total dissociation yield and the highest dwgree of asymmetry reported to date (40%). The observed modulations in both asymmetry and total yield could be understood in…
▽ More
We studied dependence of dissociative ionization in H2 on carrier-envelope phase (CEP) of few-cycle (6fs) near-infrared (NIR) laser pulses. For low-energy channels, we present the first experimental observation of CEP dependence for total dissociation yield and the highest dwgree of asymmetry reported to date (40%). The observed modulations in both asymmetry and total yield could be understood in terms of interference between different n-photon dissociation pathways - n and (n+1) photon channels for asymmetry, n and (n+2) photon channels for yield - as suggested by the general theory of CEP effects (Roudnev and Esry, Phys. Rev. Lett. 99, 220406 (2007), [1]). The yield modulation is found to be Pi-periodic in CEP, with its phase strongly dependent on fragment kinetic energy (and reversing its sign within the studied energy range), indicating that the dissociation does not simply follow the CEP dependence of maximum electric field, as a naive intuition might suggest. We also find that a positively chirped pulse can lead to a higher dissociation probability than a transform limited pulse.
△ Less
Submitted 12 November, 2012;
originally announced November 2012.
-
Experimental ionization of atomic hydrogen with few-cycle pulses
Authors:
M. G. Pullen,
W. C. Wallace,
D. E. Laban,
A. J. Palmer,
G. F. Hanne,
A. N. Grum-Grzhimailo,
B. Abeln,
K. Bartschat,
D. Weflen,
I. Ivanov,
A. Kheifets,
H. M. Quiney,
I. V. Litvinyuk,
R. T. Sang,
D. Kielpinski
Abstract:
We present the first experimental data on strong-field ionization of atomic hydrogen by few-cycle laser pulses. We obtain quantitative agreement at the 10% level between the data and an {\it ab initio} simulation over a wide range of laser intensities and electron energies.
We present the first experimental data on strong-field ionization of atomic hydrogen by few-cycle laser pulses. We obtain quantitative agreement at the 10% level between the data and an {\it ab initio} simulation over a wide range of laser intensities and electron energies.
△ Less
Submitted 3 October, 2011;
originally announced October 2011.
-
Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition
Authors:
Erik F. Tjong Kim Sang,
Fien De Meulder
Abstract:
We describe the CoNLL-2003 shared task: language-independent named entity recognition. We give background information on the data sets (English and German) and the evaluation method, present a general overview of the systems that have taken part in the task and discuss their performance.
We describe the CoNLL-2003 shared task: language-independent named entity recognition. We give background information on the data sets (English and German) and the evaluation method, present a general overview of the systems that have taken part in the task and discuss their performance.
△ Less
Submitted 12 June, 2003;
originally announced June 2003.
-
Introduction to the CoNLL-2002 Shared Task: Language-Independent Named Entity Recognition
Authors:
Erik F. Tjong Kim Sang
Abstract:
We describe the CoNLL-2002 shared task: language-independent named entity recognition. We give background information on the data sets and the evaluation method, present a general overview of the systems that have taken part in the task and discuss their performance.
We describe the CoNLL-2002 shared task: language-independent named entity recognition. We give background information on the data sets and the evaluation method, present a general overview of the systems that have taken part in the task and discuss their performance.
△ Less
Submitted 5 September, 2002;
originally announced September 2002.
-
Memory-Based Shallow Parsing
Authors:
Erik F. Tjong Kim Sang
Abstract:
We present memory-based learning approaches to shallow parsing and apply these to five tasks: base noun phrase identification, arbitrary base phrase recognition, clause detection, noun phrase parsing and full parsing. We use feature selection techniques and system combination methods for improving the performance of the memory-based learner. Our approach is evaluated on standard data sets and th…
▽ More
We present memory-based learning approaches to shallow parsing and apply these to five tasks: base noun phrase identification, arbitrary base phrase recognition, clause detection, noun phrase parsing and full parsing. We use feature selection techniques and system combination methods for improving the performance of the memory-based learner. Our approach is evaluated on standard data sets and the results are compared with that of other systems. This reveals that our approach works well for base phrase identification while its application towards recognizing embedded structures leaves some room for improvement.
△ Less
Submitted 24 April, 2002;
originally announced April 2002.
-
Combining a self-organising map with memory-based learning
Authors:
James Hammerton,
Erik F. Tjong Kim Sang
Abstract:
Memory-based learning (MBL) has enjoyed considerable success in corpus-based natural language processing (NLP) tasks and is thus a reliable method of getting a high-level of performance when building corpus-based NLP systems. However there is a bottleneck in MBL whereby any novel testing item has to be compared against all the training items in memory base. For this reason there has been some in…
▽ More
Memory-based learning (MBL) has enjoyed considerable success in corpus-based natural language processing (NLP) tasks and is thus a reliable method of getting a high-level of performance when building corpus-based NLP systems. However there is a bottleneck in MBL whereby any novel testing item has to be compared against all the training items in memory base. For this reason there has been some interest in various forms of memory editing whereby some method of selecting a subset of the memory base is employed to reduce the number of comparisons. This paper investigates the use of a modified self-organising map (SOM) to select a subset of the memory items for comparison. This method involves reducing the number of comparisons to a value proportional to the square root of the number of training items. The method is tested on the identification of base noun-phrases in the Wall Street Journal corpus, using sections 15 to 18 for training and section 20 for testing.
△ Less
Submitted 15 July, 2001;
originally announced July 2001.
-
Learning Computational Grammars
Authors:
John Nerbonne,
Anja Belz,
Nicola Cancedda,
Herve Dejean,
James Hammerton,
Rob Koeling,
Stasinos Konstantopoulos,
Miles Osborne,
Franck Thollard,
Erik F. Tjong Kim Sang
Abstract:
This paper reports on the "Learning Computational Grammars" (LCG) project, a postdoc network devoted to studying the application of machine learning techniques to grammars suitable for computational use. We were interested in a more systematic survey to understand the relevance of many factors to the success of learning, esp. the availability of annotated data, the kind of dependencies in the da…
▽ More
This paper reports on the "Learning Computational Grammars" (LCG) project, a postdoc network devoted to studying the application of machine learning techniques to grammars suitable for computational use. We were interested in a more systematic survey to understand the relevance of many factors to the success of learning, esp. the availability of annotated data, the kind of dependencies in the data, and the availability of knowledge bases (grammars). We focused on syntax, esp. noun phrase (NP) syntax.
△ Less
Submitted 15 July, 2001;
originally announced July 2001.
-
Introduction to the CoNLL-2001 Shared Task: Clause Identification
Authors:
Erik F. Tjong Kim Sang,
Herve Dejean
Abstract:
We describe the CoNLL-2001 shared task: dividing text into clauses. We give background information on the data sets, present a general overview of the systems that have taken part in the shared task and briefly discuss their performance.
We describe the CoNLL-2001 shared task: dividing text into clauses. We give background information on the data sets, present a general overview of the systems that have taken part in the shared task and briefly discuss their performance.
△ Less
Submitted 15 July, 2001;
originally announced July 2001.
-
Introduction to the CoNLL-2000 Shared Task: Chunking
Authors:
Erik F. Tjong Kim Sang,
Sabine Buchholz
Abstract:
We describe the CoNLL-2000 shared task: dividing text into syntactically related non-overlap** groups of words, so-called text chunking. We give background information on the data sets, present a general overview of the systems that have taken part in the shared task and briefly discuss their performance.
We describe the CoNLL-2000 shared task: dividing text into syntactically related non-overlap** groups of words, so-called text chunking. We give background information on the data sets, present a general overview of the systems that have taken part in the shared task and briefly discuss their performance.
△ Less
Submitted 18 September, 2000;
originally announced September 2000.
-
Meta-Learning for Phonemic Annotation of Corpora
Authors:
Veronique Hoste,
Walter Daelemans,
Erik Tjong Kim Sang,
Steven Gillis
Abstract:
We apply rule induction, classifier combination and meta-learning (stacked classifiers) to the problem of bootstrap** high accuracy automatic annotation of corpora with pronunciation information. The task we address in this paper consists of generating phonemic representations reflecting the Flemish and Dutch pronunciations of a word on the basis of its orthographic representation (which in tu…
▽ More
We apply rule induction, classifier combination and meta-learning (stacked classifiers) to the problem of bootstrap** high accuracy automatic annotation of corpora with pronunciation information. The task we address in this paper consists of generating phonemic representations reflecting the Flemish and Dutch pronunciations of a word on the basis of its orthographic representation (which in turn is based on the actual speech recordings). We compare several possible approaches to achieve the text-to-pronunciation map** task: memory-based learning, transformation-based learning, rule induction, maximum entropy modeling, combination of classifiers in stacked learning, and stacking of meta-learners. We are interested both in optimal accuracy and in obtaining insight into the linguistic regularities involved. As far as accuracy is concerned, an already high accuracy level (93% for Celex and 86% for Fonilex at word level) for single classifiers is boosted significantly with additional error reductions of 31% and 38% respectively using combination of classifiers, and a further 5% using combination of meta-learners, bringing overall word level accuracy to 96% for the Dutch variant and 92% for the Flemish variant. We also show that the application of machine learning methods indeed leads to increased insight into the linguistic regularities determining the variation between the two pronunciation variants studied.
△ Less
Submitted 18 August, 2000;
originally announced August 2000.
-
Applying System Combination to Base Noun Phrase Identification
Authors:
Erik F. Tjong Kim Sang,
Walter Daelemans,
Herve Dejean,
Rob Koeling,
Yuval Krymolowski,
Vasin Punyakanok,
Dan Roth
Abstract:
We use seven machine learning algorithms for one task: identifying base noun phrases. The results have been processed by different system combination methods and all of these outperformed the best individual result. We have applied the seven learners with the best combinator, a majority vote of the top five systems, to a standard data set and managed to improve the best published result for this…
▽ More
We use seven machine learning algorithms for one task: identifying base noun phrases. The results have been processed by different system combination methods and all of these outperformed the best individual result. We have applied the seven learners with the best combinator, a majority vote of the top five systems, to a standard data set and managed to improve the best published result for this data set.
△ Less
Submitted 17 August, 2000;
originally announced August 2000.
-
Noun Phrase Recognition by System Combination
Authors:
Erik F. Tjong Kim Sang
Abstract:
The performance of machine learning algorithms can be improved by combining the output of different systems. In this paper we apply this idea to the recognition of noun phrases.We generate different classifiers by using different representations of the data. By combining the results with voting techniques described in (Van Halteren et.al. 1998) we manage to improve the best reported performances…
▽ More
The performance of machine learning algorithms can be improved by combining the output of different systems. In this paper we apply this idea to the recognition of noun phrases.We generate different classifiers by using different representations of the data. By combining the results with voting techniques described in (Van Halteren et.al. 1998) we manage to improve the best reported performances on standard data sets for base noun phrases and arbitrary noun phrases.
△ Less
Submitted 10 May, 2000;
originally announced May 2000.
-
Representing Text Chunks
Authors:
Erik F. Tjong Kim Sang,
Jorn Veenstra
Abstract:
Dividing sentences in chunks of words is a useful preprocessing step for parsing, information extraction and information retrieval. (Ramshaw and Marcus, 1995) have introduced a "convenient" data representation for chunking by converting it to a tagging task. In this paper we will examine seven different data representations for the problem of recognizing noun phrase chunks. We will show that the…
▽ More
Dividing sentences in chunks of words is a useful preprocessing step for parsing, information extraction and information retrieval. (Ramshaw and Marcus, 1995) have introduced a "convenient" data representation for chunking by converting it to a tagging task. In this paper we will examine seven different data representations for the problem of recognizing noun phrase chunks. We will show that the the data representation choice has a minor influence on chunking performance. However, equipped with the most suitable data representation, our memory-based learning chunker was able to improve the best published chunking results for a standard data set.
△ Less
Submitted 6 July, 1999;
originally announced July 1999.