Skip to main content

Showing 1–17 of 17 results for author: Lee, M W

.
  1. arXiv:2403.06880  [pdf, other

    cs.LG cs.AI

    Unveiling the Significance of Toddler-Inspired Reward Transition in Goal-Oriented Reinforcement Learning

    Authors: Junseok Park, Yoonsung Kim, Hee Bin Yoo, Min Whoo Lee, Kibeom Kim, Won-Seok Choi, Minsu Lee, Byoung-Tak Zhang

    Abstract: Toddlers evolve from free exploration with sparse feedback to exploiting prior experiences for goal-directed learning with denser rewards. Drawing inspiration from this Toddler-Inspired Reward Transition, we set out to explore the implications of varying reward transitions when incorporated into Reinforcement Learning (RL) tasks. Central to our inquiry is the transition from sparse to potential-ba… ▽ More

    Submitted 18 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: Accepted as a full paper at AAAI 2024 (Oral presentation): 7 pages (main paper), 2 pages (references), 17 pages (appendix) each

  2. Visual Hindsight Self-Imitation Learning for Interactive Navigation

    Authors: Kibeom Kim, Kisung Shin, Min Whoo Lee, Moonhoen Lee, Minsu Lee, Byoung-Tak Zhang

    Abstract: Interactive visual navigation tasks, which involve following instructions to reach and interact with specific targets, are challenging not only because successful experiences are very rare but also because the complex visual inputs require a substantial number of samples. Previous methods for these tasks often rely on intricately designed dense rewards or the use of expensive expert data for imita… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: 14 pages, 9 figures and under-review

  3. arXiv:2305.13741  [pdf, other

    cs.LG cs.AI

    L-SA: Learning Under-Explored Targets in Multi-Target Reinforcement Learning

    Authors: Kibeom Kim, Hyundo Lee, Min Whoo Lee, Moonheon Lee, Minsu Lee, Byoung-Tak Zhang

    Abstract: Tasks that involve interaction with various targets are called multi-target tasks. When applying general reinforcement learning approaches for such tasks, certain targets that are difficult to access or interact with may be neglected throughout the course of training - a predicament we call Under-explored Target Problem (UTP). To address this problem, we propose L-SA (Learning by adaptive Sampling… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: 17 pages include appendices, it is under-review

  4. arXiv:2208.09663  [pdf, other

    physics.flu-dyn nlin.CD

    Feature Identification in Complex Fluid Flows by Convolutional Neural Networks

    Authors: Shizheng Wen, Michael W. Lee, Kai M. Kruger Bastos, Earl H. Dowell

    Abstract: Recent efforts have shown machine learning to be useful for the prediction of nonlinear fluid dynamics. Predictive accuracy is often a central motivation for employing neural networks, but the pattern recognition central to the network function is equally valuable for purposes of enhancing our dynamical insight into confounding dynamics. In this paper, convolutional neural networks (CNNs) were tra… ▽ More

    Submitted 20 August, 2022; originally announced August 2022.

  5. arXiv:2208.04832  [pdf, other

    cs.AI cs.LG cs.NE

    On the Importance of Critical Period in Multi-stage Reinforcement Learning

    Authors: Junseok Park, Inwoo Hwang, Min Whoo Lee, Hyunseok Oh, Minsu Lee, Youngki Lee, Byoung-Tak Zhang

    Abstract: The initial years of an infant's life are known as the critical period, during which the overall development of learning performance is significantly impacted due to neural plasticity. In recent studies, an AI agent, with a deep neural network mimicking mechanisms of actual neurons, exhibited a learning period similar to human's critical period. Especially during this initial period, the appropria… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

    Comments: Accepted by the ICML Complex Feedback in Online Learning Workshop (Open Problems) 2022

  6. arXiv:2203.11987  [pdf, other

    cs.CV

    PaCa-ViT: Learning Patch-to-Cluster Attention in Vision Transformers

    Authors: Ryan Grainger, Thomas Paniagua, Xi Song, Naresh Cuntoor, Mun Wai Lee, Tianfu Wu

    Abstract: Vision Transformers (ViTs) are built on the assumption of treating image patches as ``visual tokens" and learn patch-to-patch attention. The patch embedding based tokenizer has a semantic gap with respect to its counterpart, the textual tokenizer. The patch-to-patch attention suffers from the quadratic complexity issue, and also makes it non-trivial to explain learned ViTs. To address these issues… ▽ More

    Submitted 6 April, 2023; v1 submitted 22 March, 2022; originally announced March 2022.

    Comments: CVPR 2023

  7. arXiv:2110.12985  [pdf, other

    cs.LG cs.AI cs.RO

    Goal-Aware Cross-Entropy for Multi-Target Reinforcement Learning

    Authors: Kibeom Kim, Min Whoo Lee, Yoonsung Kim, Je-Hwan Ryu, Minsu Lee, Byoung-Tak Zhang

    Abstract: Learning in a multi-target environment without prior knowledge about the targets requires a large amount of samples and makes generalization difficult. To solve this problem, it is important to be able to discriminate targets through semantic understanding. In this paper, we propose goal-aware cross-entropy (GACE) loss, that can be utilized in a self-supervised way using auto-labeled goal states a… ▽ More

    Submitted 26 October, 2021; v1 submitted 25 October, 2021; originally announced October 2021.

    Comments: NeurIPS 2021 accepted, 19 pages including appendix and reference, 8 figures

  8. arXiv:1909.05057  [pdf, other

    nucl-ex nucl-th

    Quenching of gamma0 transition results from p-wave neutron inducing doorway mechanism

    Authors: T. F. Wang, X. T. Yang, T. Katabuchi, Z. M. Li, L. H. Zhu, M. W. Lee, G. N. Kim, T. I. Ro, Y. R. Kang, M. Igashira

    Abstract: Gamma-strength function essentially distinguishes the reaction mechanisms of charged particle inelastic and neutron capture reactions, reflecting from the ratios of transition of neutron capture to low-lying states. The extraordinary quenching of gamma_0 transition of p-wave neutron resonance reaction in 3s-region nucleus 57Fe is observed, for the first time, due to the non-forming 2p-1h doorway s… ▽ More

    Submitted 11 September, 2019; originally announced September 2019.

    Comments: 6 pages, 6 figures

  9. arXiv:1402.1248  [pdf, other

    physics.atom-ph physics.optics quant-ph

    Frequency Stabilization of a 369 nm Diode Laser by Nonlinear Spectroscopy of Ytterbium Ions in a Discharge

    Authors: Michael W Lee, Marie Claire Jarratt, Christian Marciniak, Michael J Biercuk

    Abstract: We demonstrate stabilisation of an ultraviolet diode laser via Doppler free spectroscopy of Ytterbium ions in a discharge. Our technique employs polarization spectroscopy, which produces a natural dispersive lineshape whose zero-crossing is largely immune to environmental drifts, making this signal an ideal absolute frequency reference for Yb$^+$ ion trap** experiments. We stabilise an external-… ▽ More

    Submitted 6 February, 2014; originally announced February 2014.

    Comments: Related papers available at http://www.physics.usyd.edu.au/~mbiercuk/Publications.html

  10. arXiv:1308.6628  [pdf, other

    cs.CV cs.CL cs.MM

    Joint Video and Text Parsing for Understanding Events and Answering Queries

    Authors: Kewei Tu, Meng Meng, Mun Wai Lee, Tae Eun Choe, Song-Chun Zhu

    Abstract: We propose a framework for parsing video and text jointly for understanding events and answering user queries. Our framework produces a parse graph that represents the compositional structures of spatial information (objects and scenes), temporal information (actions and events) and causal information (causalities between events and fluents) in the video and text. The knowledge representation of o… ▽ More

    Submitted 21 February, 2014; v1 submitted 29 August, 2013; originally announced August 2013.

  11. arXiv:1304.1947  [pdf, other

    physics.optics physics.atom-ph physics.ins-det quant-ph

    A high-power 626 nm diode laser system for Beryllium ion trap**

    Authors: H. Ball, M. W. Lee, S. D. Gensemer, M. J. Biercuk

    Abstract: We describe a high-power, frequency-tunable, external cavity diode laser (ECDL) system near 626 nm useful for laser cooling of trapped $^9$Be$^+$ ions. A commercial single-mode laser diode with rated power output of 170 mW at 635 nm is cooled to $\approx - 31$ C, and a single longitudinal mode is selected via the Littrow configuration. In our setup, involving multiple stages of thermoelectric cool… ▽ More

    Submitted 6 April, 2013; originally announced April 2013.

    Comments: Related manuscripts available at http://www.physics.usyd.edu.au/~mbiercuk/Publications.html

  12. arXiv:1011.0352  [pdf

    physics.ins-det hep-ex

    Belle II Technical Design Report

    Authors: T. Abe, I. Adachi, K. Adamczyk, S. Ahn, H. Aihara, K. Akai, M. Aloi, L. Andricek, K. Aoki, Y. Arai, A. Arefiev, K. Arinstein, Y. Arita, D. M. Asner, V. Aulchenko, T. Aushev, T. Aziz, A. M. Bakich, V. Balagura, Y. Ban, E. Barberio, T. Barvich, K. Belous, T. Bergauer, V. Bhardwaj , et al. (387 additional authors not shown)

    Abstract: The Belle detector at the KEKB electron-positron collider has collected almost 1 billion Y(4S) events in its decade of operation. Super-KEKB, an upgrade of KEKB is under construction, to increase the luminosity by two orders of magnitude during a three-year shutdown, with an ultimate goal of 8E35 /cm^2 /s luminosity. To exploit the increased luminosity, an upgrade of the Belle detector has been pr… ▽ More

    Submitted 1 November, 2010; originally announced November 2010.

    Comments: Edited by: Z. Doležal and S. Uno

    Report number: KEK Report 2010-1

  13. Excitation functions of proton induced nuclear reactions on natW up to 40 MeV

    Authors: M. U. Khandake, M. S. Uddin, K. S. Kim, M. W. Lee, Y. S. Lee, G. N. Kim

    Abstract: Excitation functions for the production of the 181,182m,182g,183,184g,186Re and 183,184Ta radionuclides from proton bombardment on natural tungsten were measured using the stacked-foil activation technique for the proton energies up to 40 MeV. A new data set has been given for the formation of the investigated radionuclides. Results are in good agreement with the earlier reported experimental da… ▽ More

    Submitted 23 March, 2007; originally announced March 2007.

    Comments: 21papes, 14 figures

    Journal ref: Nucl.Instrum.Meth.B267:23-31,2009

  14. arXiv:physics/0411209  [pdf, ps, other

    physics.chem-ph

    Electronic Quantum Monte Carlo Calculations of Atomic Forces, Vibrations, and Anharmonicities

    Authors: Myung Won Lee, Massimo Mella, Andrew M. Rappe

    Abstract: Atomic forces are calculated for first-row monohydrides and carbon monoxide within electronic quantum Monte Carlo (QMC). Accurate and efficient forces are achieved by using an improved method for moving variational parameters in variational QMC. Newton's method with singular value decomposition (SVD) is combined with steepest descent (SD) updates along directions rejected by the SVD, after initi… ▽ More

    Submitted 8 April, 2005; v1 submitted 22 November, 2004; originally announced November 2004.

    Comments: 6 pages, 2 figures; updated content

  15. arXiv:cond-mat/9911421  [pdf, ps, other

    cond-mat physics.geo-ph

    Artifactual log-periodicity in finite size data: Relevance for earthquake aftershocks

    Authors: Y. Huang, A. Johansen, M. W. Lee, H. Saleur, D. Sornette

    Abstract: The recently proposed discrete scale invariance and its associated log-periodicity are an elaboration of the concept of scale invariance in which the system is scale invariant only under powers of specific values of the magnification factor. We report on the discovery of a novel mechanism for such log-periodicity relying solely on the manipulation of data. This ``synthetic'' scenario for log-per… ▽ More

    Submitted 4 August, 2000; v1 submitted 25 November, 1999; originally announced November 1999.

    Comments: LaTeX, JGR preprint with AGU++ v16.b and AGUTeX 5.0, use packages graphicx, psfrag and latexsym, 41 eps figures, 26 pages. In press J. Geophys. Res

    Journal ref: J. Geophys. Res. 105, pp. 25451-25471 (2000)

  16. Persistence and Quiescence of Seismicity on Fault Systems

    Authors: M. W. Lee, D. Sornette, L. Knopoff

    Abstract: We study the statistics of simulated earthquakes in a quasistatic model of two parallel heterogeneous faults within a slowly driven elastic tectonic plate. The probability that one fault remains dormant while the other is active for a time Dt following the previous activity shift is proportional to the inverse of Dt to the power 1+x, a result that is robust in the presence of annealed noise and… ▽ More

    Submitted 22 April, 1999; originally announced April 1999.

    Comments: 4 pages, 3 figures, Revtex

    Journal ref: Physical Review Letters 83 N20:4219-4222 (1999)

  17. arXiv:cond-mat/9903402  [pdf, ps, other

    cond-mat.stat-mech

    Novel Mechanism for Discrete Scale Invariance in Sandpile Models

    Authors: M. W. Lee, D. Sornette

    Abstract: Numerical simulations and a mean-field analysis of a sandpile model of earthquake aftershocks in 1d, 2d and 3d euclidean lattices determine that the average stress decays in a punctuated fashion after a main shock, with events occurring at characteristic times increasing as a geometrical series with a well-defined multiplicative factor which is a function of the stress corrosion exponent, the st… ▽ More

    Submitted 26 March, 1999; originally announced March 1999.

    Comments: 5 pages, 6 figures (revtex)

    Journal ref: European Physical Journal B 15, 193-197 (2000)