Skip to main content

Showing 1–50 of 175 results for author: Carbonell, J

.
  1. arXiv:2405.02407  [pdf, other

    hep-ph cond-mat.mtrl-sci nucl-th physics.atom-ph quant-ph

    Lepton-neutron interaction and S-wave low energy parameters

    Authors: Jaume Carbonell, Tobias Frederico

    Abstract: A lepton-neutron potential in configuration space is obtained. It is based on the Coulomb plus hyperfine interaction Hamiltonian integrated over the neutron charge and magnetic densities. Different parametrisations of the neutron electromagnetic form factors are compared. It is given in the operator form with a central, spin-spin, tensor and spin-orbit terms. The potentials for lowest partial wave… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: 16 pages, 10 figures, To appear in Phys. Rev. C

  2. arXiv:2309.14831  [pdf, other

    nucl-th hep-ex hep-ph nucl-ex physics.atom-ph

    Comparison of $\bar{\hbox{N}}\hbox{N}$ optical models

    Authors: Jaume Carbonell, Guillaume Hupin, Sławomir Wycech

    Abstract: We compare the strong part of the $\bar{\hbox{N}}\hbox{N}$ interaction obtained by the Nijmegen partial wave analysis and the results of some of the most popular $\bar{\hbox{N}}\hbox{N}$ optical potentials in configuration space. We have found severe discrepancies in most of the partial waves, especially above $p_{Lab}$=400 MeV/c where the partial wave analysis displays a resonant-like structure i… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

  3. Scaling of the $^{19}$B two-neutron halo properties close to unitarity

    Authors: Emiko Hiyama, Rimantas Lazauskas, Jaume Carbonell, Tobias Frederico

    Abstract: We explore the description of the bound $^{19}$B isotope in terms of a $^{17}$B+n+n three-body system where the two-body subsystems $^{17}$B+n and neutron-neutron (nn) have virtual states close to the continuum. Dimensionless scaling functions for the root-mean-square (rms) radii are defined and studied for different parameters of the neutron-core potential and considering three different models f… ▽ More

    Submitted 25 August, 2022; v1 submitted 24 August, 2022; originally announced August 2022.

    Comments: 9 pages, 7 figures

  4. Low energy structures in nuclear reactions with 4n in the final state

    Authors: Rimantas Lazauskas, Emiko Hiyama, Jaume Carbonell

    Abstract: We present a reaction model to describe the fast removal of the $α$-particle core in $^8$He nucleus with eventual emission of four neutrons. The obtained four neutron energy distributions allows to explain the sharp low energy peak observed by studying the missing mass spectra of four neutrons in [Nature Vol. 606, p. 678], as a consequence of dineutron-dineutron correlations.

    Submitted 14 February, 2023; v1 submitted 15 July, 2022; originally announced July 2022.

    Comments: Accepted to PRL

  5. arXiv:2207.04634  [pdf, other

    nucl-th hep-ph nucl-ex

    $^7$H ground state as a $^3$H+4n resonance

    Authors: Emiko Hiyama, Rimantas Lazauskas, Jaume Carbonell

    Abstract: We have investigated the possible existence of a $^7$H resonant state, considered as a five-body system consisting of a $^3$H core with four valence neutrons. To this aim, an effective n-$^3$H potential is constructed in order to reproduce the low energy elastic neutron scattering on $^3$H phase shifts and the $^5$H resonant ground state in terms of $^3$H-n-n system. The variational Gaussian Expan… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: 15 pages, 9 figures

    Journal ref: Physics Letters B833 (2022) 137367

  6. arXiv:2110.08628  [pdf, other

    nucl-th hep-ph nucl-ex

    Protonium annihilation densities in a unitary coupled channel model

    Authors: Emanuel Ydrefors, Jaume Carbonell

    Abstract: We consider a unitary coupled channel model to describe the low energy proton-antiproton scattering and the lower Coulomb-like protonium states. The existence of deeper quasi-bound states of nuclear nature is found to be a consequence of the experimental data. The properties of these states as well as the protonium annihilation densities are described and the difference with respect to the optical… ▽ More

    Submitted 16 October, 2021; originally announced October 2021.

    Comments: 14 pages

    Journal ref: Eur. Phys. J. A (2021) 57:303

  7. Antiproton-deuteron hydrogenic states in optical models

    Authors: Rimantas Lazauskas, Jaume Carbonell

    Abstract: By solving the Faddeev equations for the ppn system, we compute the antiproton-deuteron level shifts and widths for the lowest hydrogenic states as well as the corresponding pd scattering lengths and volumes. The pd annihilation densities are obtained and compared to the nuclear density of deuterium. The validity of the Trueman relation for composite particles is studied. The strong part of NN int… ▽ More

    Submitted 3 August, 2021; originally announced August 2021.

    Comments: To appear in Physics Letters B (2021)

    Journal ref: Physics Letters B 820 (2021) 136573

  8. The quest for light multineutron systems

    Authors: F. Miguel Marques, Jaume Carbonell

    Abstract: The long history of the research concerning the possible existence of bound or resonant states in light multineutron systems, essentially $^3$n and $^4$n, is reviewed. Both the experimental and the theoretical points of view have been considered, with the aim of showing a clear picture of all the different detection and calculation techniques that have been used, with particular emphasis in the is… ▽ More

    Submitted 22 February, 2021; originally announced February 2021.

    Comments: Accepted for publication in European Physical Journal A

  9. Hybrid nature of the abnormal solutions of the Bethe-Salpeter equation in the Wick-Cutkosky model

    Authors: J. Carbonell, V. A. Karmanov ans H. Sazdjian

    Abstract: In the Wick-Cutkosky model, where two scalar massive constituents interact by means of the exchange of a scalar massless particle, the Bethe-Salpeter equation has solutions of two types, called "normal" and "abnormal". In the non-relativistic limit, the normal solutions correspond to the usual Coulomb spectrum, whereas the abnormal ones do not have non-relativistic counterparts -- they are absent… ▽ More

    Submitted 10 January, 2021; originally announced January 2021.

    Comments: 22 pages, 20 figures, 2 tables. Accepted in Eur. Phys. J. C

    Journal ref: Eur. Phys. J. C (2021) 81:50

  10. 19B isotope as a 17B-n-n three-body cluster close to unitary limit

    Authors: J. Carbonell, E. Hiyama, R. Lazauskas, F. M. Marqués

    Abstract: We describe 19B in terms of a 17B-n-n three-body system, where the two-body subsystems 17B-n and n-n are unbound (virtual) states close to the unitary limit. The energy of 19B ground state is well reproduced and two low-lying resonances are predicted. Their eventual link with the Efimov physics is discussed. This model can be extended to describe the recently discovered resonant states in 20,21B.

    Submitted 24 December, 2020; originally announced December 2020.

    Comments: 27th International Nuclear Physics Conference (INPC2019). arXiv admin note: substantial text overlap with arXiv:1912.05427

    Journal ref: Journal of Physics: Conference Series 1643 (2020) 012120

  11. arXiv:2010.02500  [pdf, other

    cs.CL cs.LG

    Efficient Meta Lifelong-Learning with Limited Memory

    Authors: Zirui Wang, Sanket Vaibhav Mehta, Barnabás Póczos, Jaime Carbonell

    Abstract: Current natural language processing models work well on a single task, yet they often fail to continuously learn new tasks without forgetting previous ones as they are re-trained throughout their lifetime, a challenge known as lifelong learning. State-of-the-art lifelong language learning methods store past examples in episodic memory and replay them at both training and inference time. However, a… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: Published as a main conference paper at EMNLP 2020

  12. arXiv:2005.01866  [pdf, other

    cs.CL

    Soft Gazetteers for Low-Resource Named Entity Recognition

    Authors: Shruti Rijhwani, Shuyan Zhou, Graham Neubig, Jaime Carbonell

    Abstract: Traditional named entity recognition models use gazetteers (lists of entities) as features to improve performance. Although modern neural network models do not require such hand-crafted features for strong performance, recent work has demonstrated their utility for named entity recognition on English data. However, designing such features for low-resource languages is challenging, because exhausti… ▽ More

    Submitted 4 May, 2020; originally announced May 2020.

    Comments: Accepted at ACL 2020

  13. arXiv:2003.01343  [pdf

    cs.CL

    Improving Candidate Generation for Low-resource Cross-lingual Entity Linking

    Authors: Shuyan Zhou, Shruti Rijhwani, John Wieting, Jaime Carbonell, Graham Neubig

    Abstract: Cross-lingual entity linking (XEL) is the task of finding referents in a target-language knowledge base (KB) for mentions extracted from source-language texts. The first step of (X)EL is candidate generation, which retrieves a list of plausible candidate entities from the target-language KB for each mention. Approaches based on resources from Wikipedia have proven successful in the realm of relati… ▽ More

    Submitted 3 March, 2020; originally announced March 2020.

    Comments: Accepted to TACL 2020

  14. arXiv:2003.00576  [pdf, other

    cs.CL

    StructSum: Summarization via Structured Representations

    Authors: Vidhisha Balachandran, Artidoro Pagnoni, Jay Yoon Lee, Dheeraj Rajagopal, Jaime Carbonell, Yulia Tsvetkov

    Abstract: Abstractive text summarization aims at compressing the information of a long source document into a rephrased, condensed summary. Despite advances in modeling techniques, abstractive summarization models still suffer from several key challenges: (i) layout bias: they overfit to the style of training corpora; (ii) limited abstractiveness: they are optimized to copying n-grams from the source rather… ▽ More

    Submitted 16 February, 2021; v1 submitted 1 March, 2020; originally announced March 2020.

  15. arXiv:2002.05876  [pdf

    nucl-th physics.atom-ph

    Description of Four- and Five-Nucleon Systems by Solving Faddeev-Yakubovsky Equations in Configuration Space

    Authors: Rimantas Lazauskas, Jaume Carbonell

    Abstract: The Faddeev Yakubovsky equations constitute a rigorous formulation of the quantum mechanical N body problem in the framework of non relativistic dynamics. They allow the exact solutions of the Schrodinger equation for bound and scattering states to be obtained. In this review, we will present the general formalism as well as the numerical tools we use to solve Faddeev Yakubovsky equations in confi… ▽ More

    Submitted 14 February, 2020; originally announced February 2020.

    Comments: Published in Frontiers in Physics 7:251 (2020)

    Journal ref: Frontiers in Physics 7:251 (2020)

  16. arXiv:2001.11258  [pdf, ps, other

    cs.CL cs.CY cs.LG

    Harnessing Code Switching to Transcend the Linguistic Barrier

    Authors: Ashiqur R. KhudaBukhsh, Shriphani Palakodety, Jaime G. Carbonell

    Abstract: Code mixing (or code switching) is a common phenomenon observed in social-media content generated by a linguistically diverse user-base. Studies show that in the Indian sub-continent, a substantial fraction of social media posts exhibit code switching. While the difficulties posed by code mixed documents to further downstream analyses are well-understood, lending visibility to code mixed documents… ▽ More

    Submitted 15 June, 2020; v1 submitted 30 January, 2020; originally announced January 2020.

  17. arXiv:2001.00401  [pdf, ps, other

    hep-ph hep-th nucl-th

    Structure and EM form factors of purely relativistic systems

    Authors: V. A. Karmanov, J. Carbonell, H. Sazdjian

    Abstract: The Bethe-Salpeter equation for two massive scalar particles interacting by scalar massless exchange has solutions of two types, which differ from each other by their behavior in the non-relativistic limit: the normal solutions which turn into the Coulomb ones and the "abnormal" solutions. The latter ones have no non-relativistic counterparts and disappear in the non-relativistic limit. We studied… ▽ More

    Submitted 2 January, 2020; originally announced January 2020.

    Comments: 5 pages, 3 figures, Contribution in proceedings of the conference: Light Cone 2019, 16-20 September 2019, Ecole Polytechnique, Palaiseau, France

  18. arXiv:1912.05427  [pdf

    nucl-th nucl-ex quant-ph

    Low-energy neutron scattering on light nuclei and $^{19}$B as a $^{17}$B-$n$-$n$ three-body system in the unitary limit

    Authors: Jaume Carbonell, Emiko Hiyama, Rimantas Lazauskas, F. Miguel Marqués

    Abstract: We consider the evolution of the neutron-nucleus scattering length for the lightest nuclei. We show that, when increasing the number of neutrons in the target nucleus, the strong Pauli repulsion is weakened and the balance with the attractive nucleon-nucleon interaction results into a resonant virtual state in $^{18}$B. We describe $^{19}$B in terms of a $^{17}$B-$n$-$n$ three-body system where… ▽ More

    Submitted 11 December, 2019; originally announced December 2019.

    Comments: Proceedings for the 24th edition of European Few Body Conference, Surrey, UK, 2-4 September 2019

    Journal ref: SciPost Physics Proceedings 2019

  19. arXiv:1911.10088  [pdf, other

    cs.LG cs.CL stat.ML

    Optimizing Data Usage via Differentiable Rewards

    Authors: Xinyi Wang, Hieu Pham, Paul Michel, Antonios Anastasopoulos, Jaime Carbonell, Graham Neubig

    Abstract: To acquire a new skill, humans learn better and faster if a tutor, based on their current knowledge level, informs them of how much attention they should pay to particular content or practice problems. Similarly, a machine learning model could potentially be trained better with a scorer that "adapts" to its current learning state and estimates the importance of each training data instance. Trainin… ▽ More

    Submitted 16 June, 2021; v1 submitted 22 November, 2019; originally announced November 2019.

    Comments: Accepted at ICML 2020

  20. arXiv:1910.04708  [pdf, other

    cs.CL cs.LG

    Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework

    Authors: Zirui Wang, Jiateng Xie, Ruochen Xu, Yiming Yang, Graham Neubig, Jaime Carbonell

    Abstract: Learning multilingual representations of text has proven a successful method for many cross-lingual transfer learning tasks. There are two main paradigms for learning such representations: (1) alignment, which maps different independently trained monolingual representations into a shared space, and (2) joint training, which directly learns unified multilingual representations using monolingual and… ▽ More

    Submitted 17 February, 2020; v1 submitted 10 October, 2019; originally announced October 2019.

    Comments: Published as a conference paper at ICLR 2020. First two authors contributed equally. Source code is available at https://github.com/thespectrewithin/joint-align

  21. arXiv:1910.03206  [pdf, ps, other

    cs.CY cs.CL cs.IR cs.LG

    Voice for the Voiceless: Active Sampling to Detect Comments Supporting the Rohingyas

    Authors: Shriphani Palakodety, Ashiqur R. KhudaBukhsh, Jaime G. Carbonell

    Abstract: The Rohingya refugee crisis is one of the biggest humanitarian crises of modern times with more than 600,000 Rohingyas rendered homeless according to the United Nations High Commissioner for Refugees. While it has received sustained press attention globally, no comprehensive research has been performed on social media pertaining to this large evolving crisis. In this work, we construct a substanti… ▽ More

    Submitted 6 January, 2020; v1 submitted 8 October, 2019; originally announced October 2019.

  22. arXiv:1909.12940  [pdf, ps, other

    cs.CY cs.CL cs.LG

    Hope Speech Detection: A Computational Analysis of the Voice of Peace

    Authors: Shriphani Palakodety, Ashiqur R. KhudaBukhsh, Jaime G. Carbonell

    Abstract: The recent Pulwama terror attack (February 14, 2019, Pulwama, Kashmir) triggered a chain of escalating events between India and Pakistan adding another episode to their 70-year-old dispute over Kashmir. The present era of ubiquitious social media has never seen nuclear powers closer to war. In this paper, we analyze this evolving international crisis via a substantial corpus constructed using comm… ▽ More

    Submitted 24 February, 2020; v1 submitted 11 September, 2019; originally announced September 2019.

    Comments: Minor edits

  23. arXiv:1909.06743  [pdf, other

    cs.CL cs.LG

    Learning Rhyming Constraints using Structured Adversaries

    Authors: Harsh Jhamtani, Sanket Vaibhav Mehta, Jaime Carbonell, Taylor Berg-Kirkpatrick

    Abstract: Existing recurrent neural language models often fail to capture higher-level structure present in text: for example, rhyming patterns present in poetry. Much prior work on poetry generation uses manually defined constraints which are satisfied during decoding using either specialized decoding procedures or rejection sampling. The rhyming constraints themselves are typically not learned by the gene… ▽ More

    Submitted 15 September, 2019; originally announced September 2019.

    Comments: EMNLP-IJCNLP 2019 Short Paper

  24. arXiv:1908.08983  [pdf, other

    cs.CL

    A Little Annotation does a Lot of Good: A Study in Bootstrap** Low-resource Named Entity Recognizers

    Authors: Aditi Chaudhary, Jiateng Xie, Zaid Sheikh, Graham Neubig, Jaime G. Carbonell

    Abstract: Most state-of-the-art models for named entity recognition (NER) rely on the availability of large amounts of labeled data, making them challenging to extend to new, lower-resourced languages. However, there are now several proposed approaches involving either cross-lingual transfer learning, which learns from other highly resourced languages, or active learning, which efficiently selects effective… ▽ More

    Submitted 23 August, 2019; originally announced August 2019.

    Comments: Accepted at EMNLP 2019

  25. The Faddeev-Yakubovsky symphony

    Authors: Rimantas Lazauskas, Jaume Carbonell

    Abstract: We briefly summarize the main steps leading to the Faddeev-Yakubovsky equations in configuration space for N=3, 4 and 5 interacting particles.

    Submitted 13 August, 2019; originally announced August 2019.

  26. arXiv:1907.10129  [pdf, other

    cs.CL

    CMU-01 at the SIGMORPHON 2019 Shared Task on Crosslinguality and Context in Morphology

    Authors: Aditi Chaudhary, Elizabeth Salesky, Gayatri Bhat, David R. Mortensen, Jaime G. Carbonell, Yulia Tsvetkov

    Abstract: This paper presents the submission by the CMU-01 team to the SIGMORPHON 2019 task 2 of Morphological Analysis and Lemmatization in Context. This task requires us to produce the lemma and morpho-syntactic description of each token in a sequence, for 107 treebanks. We approach this task with a hierarchical neural conditional random field (CRF) model which predicts each coarse-grained feature (eg. PO… ▽ More

    Submitted 23 July, 2019; originally announced July 2019.

    Comments: In Proceedings of the ACL-SIGMORPHON 2019 Shared Task: Crosslinguality and Context in Morphology

  27. Modeling $^{19}$B as a $^{17}$B-n-n three-body system in the unitary limit

    Authors: Emiko Hiyama, Rimantas Lazauskas, F. Miguel Marqués, Jaume Carbonell

    Abstract: We present a model description of the bound $^{17}$B isotope in terms of a $^{17}$B-n-n three-body system where the two-body subsystems $^{17}$B-n and n-n are unbound (virtual) states close to the unitary limit. The $^{17}$B ground state is well described in terms of two-body potentials only, and two low-lying resonances are predicted. Their eventual link with the Efimov physics is discussed. Thi… ▽ More

    Submitted 3 July, 2019; originally announced July 2019.

    Comments: Accepted as a Rapid Communication in Physical Review C

    Journal ref: Phys. Rev. C 100, 011603 (2019)

  28. arXiv:1906.08237  [pdf, other

    cs.CL cs.LG

    XLNet: Generalized Autoregressive Pretraining for Language Understanding

    Authors: Zhilin Yang, Zihang Dai, Yiming Yang, Jaime Carbonell, Ruslan Salakhutdinov, Quoc V. Le

    Abstract: With the capability of modeling bidirectional contexts, denoising autoencoding based pretraining like BERT achieves better performance than pretraining approaches based on autoregressive language modeling. However, relying on corrupting the input with masks, BERT neglects dependency between the masked positions and suffers from a pretrain-finetune discrepancy. In light of these pros and cons, we p… ▽ More

    Submitted 2 January, 2020; v1 submitted 19 June, 2019; originally announced June 2019.

    Comments: Pretrained models and code are available at https://github.com/zihangdai/xlnet

  29. arXiv:1906.00376  [pdf, other

    cs.CL

    Domain Adaptation of Neural Machine Translation by Lexicon Induction

    Authors: Junjie Hu, Mengzhou Xia, Graham Neubig, Jaime Carbonell

    Abstract: It has been previously noted that neural machine translation (NMT) is very sensitive to domain shift. In this paper, we argue that this is a dual effect of the highly lexicalized nature of NMT, resulting in failure for sentences with large numbers of unknown words, and lack of supervision for domain-specific words. To remedy this problem, we propose an unsupervised adaptation method which fine-tun… ▽ More

    Submitted 2 June, 2019; originally announced June 2019.

    Journal ref: published at the 57th Annual Meeting of the Association for Computational Linguistics (ACL). July 2019

  30. arXiv:1903.02892  [pdf, ps, other

    hep-ph hep-th nucl-th

    Bound states of relativistic nature

    Authors: V. A. Karmanov, J. Carbonell, H. Sazdjian

    Abstract: Bethe-Salpeter equation, for massless exchange and large fine structure constant $α>π/4$, in addition to the Balmer series, provides another (abnormal) series of energy levels which are not given by the Schrödinger equation. So strong field can be created by a point-like charge $Z>107$. The nuclei with this charge, though available, they are far from to be point-like that weakens the field. Theref… ▽ More

    Submitted 7 March, 2019; originally announced March 2019.

    Comments: 13 pages, 10 figures, to be published in Proceedings of the International Conference: Nuclear Theory in the Supercomputing Era-2018 (NTSE-2018), Daejeon, South Korea, October 29 - November 2, 2018; eds. A. M. Shirokov and A. I. Mazur. Pacific National University, Khabarovsk, Russia, 2019

  31. arXiv:1902.10553  [pdf, other

    nucl-th nucl-ex physics.comp-ph

    Ab initio calculations of 5H resonant states

    Authors: R. Lazauskas, E. Hiyama, J. Carbonell

    Abstract: By solving the 5-body Faddeev-Yakubovsky equations in configuration space with realistic nuclear Hamiltonians we have studied the resonant states of $^5$H isotope. Two different methods, allowing to bypass the exponentially diverging boundary conditions, have been employed providing consistent results. The existence of $^5$H broad J$^π$=1/2$^+$,3/2$^+$,5/2$^+$ states as S-matrix poles has been con… ▽ More

    Submitted 27 February, 2019; originally announced February 2019.

    Comments: to appear in Physics Letters B (2019)

  32. arXiv:1902.08899  [pdf, other

    cs.CL

    The ARIEL-CMU Systems for LoReHLT18

    Authors: Aditi Chaudhary, Siddharth Dalmia, Junjie Hu, Xinjian Li, Austin Matthews, Aldrian Obaja Muis, Naoki Otani, Shruti Rijhwani, Zaid Sheikh, Nidhi Vyas, Xinyi Wang, Jiateng Xie, Ruochen Xu, Chunting Zhou, Peter J. Jansen, Yiming Yang, Lori Levin, Florian Metze, Teruko Mitamura, David R. Mortensen, Graham Neubig, Eduard Hovy, Alan W Black, Jaime Carbonell, Graham V. Horwood , et al. (5 additional authors not shown)

    Abstract: This paper describes the ARIEL-CMU submissions to the Low Resource Human Language Technologies (LoReHLT) 2018 evaluations for the tasks Machine Translation (MT), Entity Discovery and Linking (EDL), and detection of Situation Frames in Text and Speech (SF Text and Speech).

    Submitted 24 February, 2019; originally announced February 2019.

  33. arXiv:1901.02860  [pdf, other

    cs.LG cs.CL stat.ML

    Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context

    Authors: Zihang Dai, Zhilin Yang, Yiming Yang, Jaime Carbonell, Quoc V. Le, Ruslan Salakhutdinov

    Abstract: Transformers have a potential of learning longer-term dependency, but are limited by a fixed-length context in the setting of language modeling. We propose a novel neural architecture Transformer-XL that enables learning dependency beyond a fixed length without disrupting temporal coherence. It consists of a segment-level recurrence mechanism and a novel positional encoding scheme. Our method not… ▽ More

    Submitted 2 June, 2019; v1 submitted 9 January, 2019; originally announced January 2019.

    Comments: ACL 2019 long paper. Code and pretrained models are available at https://github.com/kimiyoung/transformer-xl

  34. arXiv:1811.09751  [pdf, other

    cs.LG stat.ML

    Characterizing and Avoiding Negative Transfer

    Authors: Zirui Wang, Zihang Dai, Barnabás Póczos, Jaime Carbonell

    Abstract: When labeled data is scarce for a specific target task, transfer learning often offers an effective solution by utilizing data from a related source task. However, when transferring knowledge from a less related source, it may inversely hurt the target performance, a phenomenon known as negative transfer. Despite its pervasiveness, negative transfer is usually described in an informal manner, lack… ▽ More

    Submitted 4 October, 2019; v1 submitted 23 November, 2018; originally announced November 2018.

    Comments: Published at CVPR 2019

  35. arXiv:1811.04154  [pdf, other

    cs.CL

    Zero-shot Neural Transfer for Cross-lingual Entity Linking

    Authors: Shruti Rijhwani, Jiateng Xie, Graham Neubig, Jaime Carbonell

    Abstract: Cross-lingual entity linking maps an entity mention in a source language to its corresponding entry in a structured knowledge base that is in a different (target) language. While previous work relies heavily on bilingual lexical resources to bridge the gap between the source and the target languages, these resources are scarce or unavailable for many low-resource languages. To address this problem… ▽ More

    Submitted 9 November, 2018; originally announced November 2018.

    Comments: To appear in AAAI 2019

  36. arXiv:1808.09861  [pdf, other

    cs.CL

    Neural Cross-Lingual Named Entity Recognition with Minimal Resources

    Authors: Jiateng Xie, Zhilin Yang, Graham Neubig, Noah A. Smith, Jaime Carbonell

    Abstract: For languages with no annotated resources, unsupervised transfer of natural language processing models such as named-entity recognition (NER) from resource-rich languages would be an appealing capability. However, differences in words and word order across languages make it a challenging problem. To improve map** of lexical items across languages, we propose a method that finds translations base… ▽ More

    Submitted 11 September, 2018; v1 submitted 29 August, 2018; originally announced August 2018.

    Comments: EMNLP 2018 long paper

  37. arXiv:1808.09543  [pdf, ps, other

    cs.CL

    Towards Semi-Supervised Learning for Deep Semantic Role Labeling

    Authors: Sanket Vaibhav Mehta, Jay Yoon Lee, Jaime Carbonell

    Abstract: Neural models have shown several state-of-the-art performances on Semantic Role Labeling (SRL). However, the neural models require an immense amount of semantic-role corpora and are thus not well suited for low-resource languages or domains. The paper proposes a semi-supervised semantic role labeling method that outperforms the state-of-the-art in limited SRL training corpora. The method is based… ▽ More

    Submitted 28 August, 2018; originally announced August 2018.

    Comments: EMNLP 2018

  38. arXiv:1808.09500  [pdf

    cs.CL

    Adapting Word Embeddings to New Languages with Morphological and Phonological Subword Representations

    Authors: Aditi Chaudhary, Chunting Zhou, Lori Levin, Graham Neubig, David R. Mortensen, Jaime G. Carbonell

    Abstract: Much work in Natural Language Processing (NLP) has been for resource-rich languages, making generalization to new, less-resourced languages challenging. We present two approaches for improving generalization to low-resourced languages by adapting continuous word representations using linguistically motivated subword units: phonemes, morphemes and graphemes. Our method requires neither parallel cor… ▽ More

    Submitted 28 August, 2018; originally announced August 2018.

    Comments: Accepted at EMNLP 2018

  39. arXiv:1807.02235  [pdf, other

    cs.LG stat.ML

    Towards more Reliable Transfer Learning

    Authors: Zirui Wang, Jaime Carbonell

    Abstract: Multi-source transfer learning has been proven effective when within-target labeled data is scarce. Previous work focuses primarily on exploiting domain similarities and assumes that source domains are richly or at least comparably labeled. While this strong assumption is never true in practice, this paper relaxes it and addresses challenges related to sources with diverse labeling volume and dive… ▽ More

    Submitted 5 July, 2018; originally announced July 2018.

    Comments: ECML-PKDD 2018

  40. arXiv:1806.00179  [pdf, other

    cs.LG cs.CV stat.ML

    The Nonlinearity Coefficient - Predicting Generalization in Deep Neural Networks

    Authors: George Philipp, Jaime G. Carbonell

    Abstract: For a long time, designing neural architectures that exhibit high performance was considered a dark art that required expert hand-tuning. One of the few well-known guidelines for architecture design is the avoidance of exploding gradients, though even this guideline has remained relatively vague and circumstantial. We introduce the nonlinearity coefficient (NLC), a measurement of the complexity of… ▽ More

    Submitted 30 January, 2019; v1 submitted 31 May, 2018; originally announced June 2018.

    Comments: Previous name: The Nonlinearity Coefficient - Predicting Overfitting in Deep Neural Networks

  41. Equation for the Nakanishi weight function using the inverse Stieltjes transform

    Authors: V. A. Karmanov, J. Carbonell, T. Frederico

    Abstract: The bound state Bethe-Salpeter amplitude was expressed by Nakanishi in terms of a smooth weight function g. By using the generalized Stieltjes transform, we derive an integral equation for the Nakanishi function g for a bound state case. It has the standard form g= Vg, where V is a two-dimensional integral operator. The prescription for obtaining the kernel V starting with the kernel K of the Beth… ▽ More

    Submitted 15 February, 2018; originally announced February 2018.

    Comments: 7 pages

  42. arXiv:1712.05577  [pdf, other

    cs.LG cs.CV

    The exploding gradient problem demystified - definition, prevalence, impact, origin, tradeoffs, and solutions

    Authors: George Philipp, Dawn Song, Jaime G. Carbonell

    Abstract: Whereas it is believed that techniques such as Adam, batch normalization and, more recently, SeLU nonlinearities "solve" the exploding gradient problem, we show that this is not the case in general and that in a range of popular MLP architectures, exploding gradients exist and that they limit the depth to which networks can be effectively trained, both in theory and in practice. We explain why exp… ▽ More

    Submitted 6 April, 2018; v1 submitted 15 December, 2017; originally announced December 2017.

    Comments: An earlier version of this paper was named "Gradients explode - Deep Networks are shallow - ResNet explained" and presented at the ICLR 2018 workshop (https://openreview.net/forum?id=rJjcdFkPM)

  43. arXiv:1712.05440  [pdf, other

    cs.LG cs.GT

    Nonparametric Neural Networks

    Authors: George Philipp, Jaime G. Carbonell

    Abstract: Automatically determining the optimal size of a neural network for a given task without prior information currently requires an expensive global search and training many networks from scratch. In this paper, we address the problem of automatically finding a good network size during a single training cycle. We introduce *nonparametric neural networks*, a non-probabilistic framework for conducting o… ▽ More

    Submitted 14 December, 2017; originally announced December 2017.

    Comments: ICLR 2017

  44. arXiv:1711.08352  [pdf, other

    cs.LG stat.ML

    Asymmetric Variational Autoencoders

    Authors: Guoqing Zheng, Yiming Yang, Jaime Carbonell

    Abstract: Variational inference for latent variable models is prevalent in various machine learning problems, typically solved by maximizing the Evidence Lower Bound (ELBO) of the true data likelihood with respect to a variational distribution. However, freely enriching the family of variational distribution is challenging since the ELBO requires variational likelihood evaluations of the latent variables. I… ▽ More

    Submitted 9 July, 2018; v1 submitted 20 November, 2017; originally announced November 2017.

    Comments: ICML 2018 Workshop on Theoretical Foundations and Applications of Deep Generative Models

  45. arXiv:1711.02255  [pdf, other

    cs.LG

    Convolutional Normalizing Flows

    Authors: Guoqing Zheng, Yiming Yang, Jaime Carbonell

    Abstract: Bayesian posterior inference is prevalent in various machine learning problems. Variational inference provides one way to approximate the posterior distribution, however its expressive power is limited and so is the accuracy of resulting approximation. Recently, there has a trend of using neural networks to approximate the variational posterior distribution due to the flexibility of neural network… ▽ More

    Submitted 9 July, 2018; v1 submitted 6 November, 2017; originally announced November 2017.

    Comments: ICML 2018 Workshop on Theoretical Foundations and Applications of Deep Generative Models

  46. arXiv:1707.08608  [pdf, ps, other

    cs.CL

    Gradient-based Inference for Networks with Output Constraints

    Authors: Jay Yoon Lee, Sanket Vaibhav Mehta, Michael Wick, Jean-Baptiste Tristan, Jaime Carbonell

    Abstract: Practitioners apply neural networks to increasingly complex problems in natural language processing, such as syntactic parsing and semantic role labeling that have rich output structures. Many such structured-prediction problems require deterministic constraints on the output values; for example, in sequence-to-sequence syntactic parsing, we require that the sequential outputs encode valid trees.… ▽ More

    Submitted 22 April, 2019; v1 submitted 26 July, 2017; originally announced July 2017.

    Comments: AAAI 2019

  47. arXiv:1707.04822  [pdf, other

    cs.LG cs.AI

    Block-Normalized Gradient Method: An Empirical Study for Training Deep Neural Network

    Authors: Adams Wei Yu, Lei Huang, Qihang Lin, Ruslan Salakhutdinov, Jaime Carbonell

    Abstract: In this paper, we propose a generic and simple strategy for utilizing stochastic gradient information in optimization. The technique essentially contains two consecutive steps in each iteration: 1) computing and normalizing each block (layer) of the mini-batch stochastic gradient; 2) selecting appropriate step size to update the decision variable (parameter) towards the negative of the block-norma… ▽ More

    Submitted 23 April, 2018; v1 submitted 16 July, 2017; originally announced July 2017.

  48. Modelling double charge exchange response function for tetraneutron system

    Authors: Rimantas Lazauskas, Emiko Hiyama, Jaume Carbonell

    Abstract: This work is an attempt to model the $4n$ response function of a recent RIKEN experimental study of the double charge exchange $^4$He($^8$He,$^8$Be)$^4$n reaction in order to put in evidence an eventual enhancement mechanism of the zero energy cross section, including a near-threshold resonance. This resonance can indeed be reproduced only by adding to the standard nuclear Hamiltonian an unphysica… ▽ More

    Submitted 22 May, 2017; originally announced May 2017.

    Journal ref: Progress of Theoretical and Experimental Physics, Volume 2017, Issue 7, July 2017, 073D03

  49. arXiv:1704.04160  [pdf, ps, other

    hep-ph hep-th nucl-th

    Bound state equation for the Nakanishi weight function

    Authors: J. Carbonell, T. Frederico, V. A. Karmanov

    Abstract: The bound state Bethe-Salpeter amplitude was expressed by Nakanishi using a two-dimensional integral representation, in terms of a smooth weight function $g$, which carries the detailed dynamical information. A similar, but one-dimensional, integral representation can be obtained for the Light-Front wave function in terms of the same weight function $g$. By using the generalized Stieltjes transfor… ▽ More

    Submitted 13 April, 2017; originally announced April 2017.

    Comments: 12 pages, 1 figure, to appear in Phys. Lett. B

  50. arXiv:1703.00994  [pdf, ps, other

    stat.ML cs.LG

    Co-Clustering for Multitask Learning

    Authors: Keerthiram Murugesan, Jaime Carbonell, Yiming Yang

    Abstract: This paper presents a new multitask learning framework that learns a shared representation among the tasks, incorporating both task and feature clusters. The jointly-induced clusters yield a shared latent subspace where task relationships are learned more effectively and more generally than in state-of-the-art multitask learning methods. The proposed general framework enables the derivation of mor… ▽ More

    Submitted 2 March, 2017; originally announced March 2017.