Skip to main content

Showing 1–36 of 36 results for author: Won, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01742  [pdf, other

    cs.PL

    The Continuous Tensor Abstraction: Where Indices are Real

    Authors: Jaeyeon Won, Willow Ahrens, Joel S. Emer, Saman Amarasinghe

    Abstract: This paper introduces the continuous tensor abstraction, allowing indices to take real-number values (e.g., A[3.14]), and provides a continuous loop construct that iterates over the infinitely large set of real numbers. This paper expands the existing tensor abstraction to include continuous tensors that exhibit a piecewise-constant property, enabling the transformation of an infinite amount of co… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. A Spatiotemporal Illumination Model for 3D Image Fusion in Optical Coherence Tomography

    Authors: Stefan Ploner, Jungeun Won, Julia Schottenhamml, Jessica Girgis, Kenneth Lam, Nadia Waheed, James Fujimoto, Andreas Maier

    Abstract: Optical coherence tomography (OCT) is a non-invasive, micrometer-scale imaging modality that has become a clinical standard in ophthalmology. By raster-scanning the retina, sequential cross-sectional image slices are acquired to generate volumetric data. In-vivo imaging suffers from discontinuities between slices that show up as motion and illumination artifacts. We present a new illumination mode… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: Presented orally & as poster on 20th April 2023 at the IEEE International Symposium on Biomedical Imaging (ISBI) in Cartagena, Colombia. 6 pages, 3 figures. You can find the official version with broken equations and bad contrast figures under https://ieeexplore.ieee.org/document/10230526

  3. arXiv:2401.17736  [pdf, other

    cs.CV

    Leveraging Human-Machine Interactions for Computer Vision Dataset Quality Enhancement

    Authors: Esla Timothy Anzaku, Hyesoo Hong, **-Woo Park, Wonjun Yang, Kangmin Kim, JongBum Won, Deshika Vinoshani Kumari Herath, Arnout Van Messem, Wesley De Neve

    Abstract: Large-scale datasets for single-label multi-class classification, such as \emph{ImageNet-1k}, have been instrumental in advancing deep learning and computer vision. However, a critical and often understudied aspect is the comprehensive quality assessment of these datasets, especially regarding potential multi-label annotation errors. In this paper, we introduce a lightweight, user-friendly, and sc… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  4. arXiv:2401.04847  [pdf, other

    stat.ML cs.LG math.OC

    On the Correctness of the Generalized Isotonic Recursive Partitioning Algorithm

    Authors: Joong-Ho Won, Jihan Jung

    Abstract: This paper presents an in-depth analysis of the generalized isotonic recursive partitioning (GIRP) algorithm for fitting isotonic models under separable convex losses, proposed by Luss and Rosset [J. Comput. Graph. Statist., 23 (2014), pp. 192--201] for differentiable losses and extended by Painsky and Rosset [IEEE Trans. Pattern Anal. Mach. Intell., 38 (2016), pp. 308-321] for nondifferentiable l… ▽ More

    Submitted 10 January, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

    Comments: 19 pages, 1 figure

  5. arXiv:2312.01133  [pdf, other

    stat.ML cs.LG

    $t^3$-Variational Autoencoder: Learning Heavy-tailed Data with Student's t and Power Divergence

    Authors: Juno Kim, Jaehyuk Kwon, Mincheol Cho, Hyunjong Lee, Joong-Ho Won

    Abstract: The variational autoencoder (VAE) typically employs a standard normal prior as a regularizer for the probabilistic latent encoder. However, the Gaussian tail often decays too quickly to effectively accommodate the encoded points, failing to preserve crucial structures hidden in the data. In this paper, we explore the use of heavy-tailed models to combat over-regularization. Drawing upon insights f… ▽ More

    Submitted 3 March, 2024; v1 submitted 2 December, 2023; originally announced December 2023.

    Comments: ICLR 2024; 27 pages, 7 figures, 8 tables

  6. MOCHA: Real-Time Motion Characterization via Context Matching

    Authors: Deok-Kyeong Jang, Yuting Ye, Jungdam Won, Sung-Hee Lee

    Abstract: Transforming neutral, characterless input motions to embody the distinct style of a notable character in real time is highly compelling for character animation. This paper introduces MOCHA, a novel online motion characterization framework that transfers both motion styles and body proportions from a target character to an input source motion. MOCHA begins by encoding the input motion into a motion… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: presented at Siggraph Asia 2023

  7. arXiv:2309.13742  [pdf, other

    cs.GR cs.CV

    DROP: Dynamics Responses from Human Motion Prior and Projective Dynamics

    Authors: Yifeng Jiang, Jungdam Won, Yuting Ye, C. Karen Liu

    Abstract: Synthesizing realistic human movements, dynamically responsive to the environment, is a long-standing objective in character animation, with applications in computer vision, sports, and healthcare, for motion prediction and data augmentation. Recent kinematics-based generative motion models offer impressive scalability in modeling extensive motion data, albeit without an interface to reason about… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

    Comments: SIGGRAPH Asia 2023, Video https://youtu.be/tF5WW7qNMLI, Website: https://stanford-tml.github.io/drop/

  8. arXiv:2308.10145  [pdf, other

    stat.ML cs.LG

    Wasserstein Geodesic Generator for Conditional Distributions

    Authors: Young-geun Kim, Kyungbok Lee, Youngwon Choi, Joong-Ho Won, Myunghee Cho Paik

    Abstract: Generating samples given a specific label requires estimating conditional distributions. We derive a tractable upper bound of the Wasserstein distance between conditional distributions to lay the theoretical groundwork to learn conditional distributions. Based on this result, we propose a novel conditional generation algorithm where conditional distributions are fully characterized by a metric spa… ▽ More

    Submitted 28 August, 2023; v1 submitted 19 August, 2023; originally announced August 2023.

  9. arXiv:2307.01938  [pdf, other

    cs.CV

    Physics-based Motion Retargeting from Sparse Inputs

    Authors: Daniele Reda, Jungdam Won, Yuting Ye, Michiel van de Panne, Alexander Winkler

    Abstract: Avatars are important to create interactive and immersive experiences in virtual worlds. One challenge in animating these characters to mimic a user's motion is that commercial AR/VR products consist only of a headset and controllers, providing very limited sensor data of the user's pose. Another challenge is that an avatar might have a different skeleton structure than a human and the map** bet… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Comments: More info at: https://www.cs.ubc.ca/~dreda/retargeting.html

  10. arXiv:2306.13264  [pdf, other

    cs.LG cs.AI

    FedSelect: Customized Selection of Parameters for Fine-Tuning during Personalized Federated Learning

    Authors: Rishub Tamirisa, John Won, Chengjun Lu, Ron Arel, Andy Zhou

    Abstract: Recent advancements in federated learning (FL) seek to increase client-level performance by fine-tuning client parameters on local data or personalizing architectures for the local task. Existing methods for such personalization either prune a global model or fine-tune a global model on a local client distribution. However, these existing methods either personalize at the expense of retaining impo… ▽ More

    Submitted 8 June, 2024; v1 submitted 22 June, 2023; originally announced June 2023.

    Journal ref: International Workshop on Federated Learning and Analytics in Practice: Algorithms, Systems, Applications, and Opportunities in Conjunction with ICML 2023

  11. arXiv:2306.05666  [pdf, other

    cs.GR cs.LG cs.RO

    QuestEnvSim: Environment-Aware Simulated Motion Tracking from Sparse Sensors

    Authors: Sunmin Lee, Sebastian Starke, Yuting Ye, Jungdam Won, Alexander Winkler

    Abstract: Replicating a user's pose from only wearable sensors is important for many AR/VR applications. Most existing methods for motion tracking avoid environment interaction apart from foot-floor contact due to their complex dynamics and hard constraints. However, in daily life people regularly interact with their environment, e.g. by sitting on a couch or leaning on a desk. Using Reinforcement Learning,… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    ACM Class: I.3.6

    Journal ref: SIGGRAPH 23 Conference Proceedings, August 6-10, 2023, Los Angeles, CA, USA

  12. Bidirectional GaitNet: A Bidirectional Prediction Model of Human Gait and Anatomical Conditions

    Authors: Jungnam Park, Moon Seok Park, Jehee Lee, Jungdam Won

    Abstract: We present a novel generative model, called Bidirectional GaitNet, that learns the relationship between human anatomy and its gait. The simulation model of human anatomy is a comprehensive, full-body, simulation-ready, musculoskeletal model with 304 Hill-type musculotendon units. The Bidirectional GaitNet consists of forward and backward models. The forward model predicts a gait pattern of a perso… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    ACM Class: I.3; I.6

  13. arXiv:2305.20041  [pdf, other

    cs.GR cs.RO

    Simulation and Retargeting of Complex Multi-Character Interactions

    Authors: Yunbo Zhang, Deepak Gopinath, Yuting Ye, Jessica Hodgins, Greg Turk, Jungdam Won

    Abstract: We present a method for reproducing complex multi-character interactions for physically simulated humanoid characters using deep reinforcement learning. Our method learns control policies for characters that imitate not only individual motions, but also the interactions between characters, while maintaining balance and matching the complexity of reference data. Our approach uses a novel reward for… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: 11 pages. Accepted to SIGGRAPH 2023

  14. arXiv:2305.14792  [pdf, other

    cs.RO cs.GR

    ACE: Adversarial Correspondence Embedding for Cross Morphology Motion Retargeting from Human to Nonhuman Characters

    Authors: Tianyu Li, Jungdam Won, Alexander Clegg, Jeonghwan Kim, Akshara Rai, Sehoon Ha

    Abstract: Motion retargeting is a promising approach for generating natural and compelling animations for nonhuman characters. However, it is challenging to translate human movements into semantically equivalent motions for target characters with different morphologies due to the ambiguous nature of the problem. This work presents a novel learning-based motion retargeting framework, Adversarial Corresponden… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  15. arXiv:2305.03249  [pdf, other

    cs.GR cs.LG

    PMP: Learning to Physically Interact with Environments using Part-wise Motion Priors

    Authors: **seok Bae, Jungdam Won, Donggeun Lim, Cheol-Hui Min, Young Min Kim

    Abstract: We present a method to animate a character incorporating multiple part-wise motion priors (PMP). While previous works allow creating realistic articulated motions from reference data, the range of motion is largely limited by the available samples. Especially for the interaction-rich scenarios, it is impractical to attempt acquiring every possible interacting motion, as the combination of physical… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: 13 pages, 11 figures

  16. arXiv:2303.14711  [pdf, other

    eess.IV cs.CV

    Unsupervised detection of small hyperreflective features in ultrahigh resolution optical coherence tomography

    Authors: Marcel Reimann, Jungeun Won, Hiroyuki Takahashi, Antonio Yaghy, Yunchan Hwang, Stefan Ploner, Junhong Lin, Jessica Girgis, Kenneth Lam, Siyu Chen, Nadia K. Waheed, Andreas Maier, James G. Fujimoto

    Abstract: Recent advances in optical coherence tomography such as the development of high speed ultrahigh resolution scanners and corresponding signal processing techniques may reveal new potential biomarkers in retinal diseases. Newly visible features are, for example, small hyperreflective specks in age-related macular degeneration. Identifying these new markers is crucial to investigate potential associa… ▽ More

    Submitted 26 March, 2023; originally announced March 2023.

    Comments: Accepted as poster at BVM workshop 2023 (https://www.bvm-workshop.org/). The arXiv version provides full quality figures. 6 pages content (2 figures)

  17. arXiv:2210.16598  [pdf, other

    cs.LG

    Self-Supervised Predictive Coding with Multimodal Fusion for Patient Deterioration Prediction in Fine-grained Time Resolution

    Authors: Kwanhyung Lee, John Won, Heejung Hyun, Sangchul Hahn, Edward Choi, Joohyung Lee

    Abstract: Accurate time prediction of patients' critical events is crucial in urgent scenarios where timely decision-making is important. Though many studies have proposed automatic prediction methods using Electronic Health Records (EHR), their coarse-grained time resolutions limit their practical usage in urgent environments such as the emergency department (ED) and intensive care unit (ICU). Therefore, i… ▽ More

    Submitted 13 April, 2023; v1 submitted 29 October, 2022; originally announced October 2022.

    Comments: Accepted as oral contribution at Trustworthy Machine Learning for Healthcare Workshop, ICLR 2023

  18. arXiv:2210.14685  [pdf, other

    cs.LG cs.AI cs.RO

    Leveraging Demonstrations with Latent Space Priors

    Authors: Jonas Gehring, Deepak Gopinath, Jungdam Won, Andreas Krause, Gabriel Synnaeve, Nicolas Usunier

    Abstract: Demonstrations provide insight into relevant state or action space regions, bearing great potential to boost the efficiency and practicality of reinforcement learning agents. In this work, we propose to leverage demonstration datasets by combining skill learning and sequence modeling. Starting with a learned joint latent space, we separately train a generative model of demonstration sequences and… ▽ More

    Submitted 13 March, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

    Comments: Published in Transactions on Machine Learning Research (03/2023)

  19. QuestSim: Human Motion Tracking from Sparse Sensors with Simulated Avatars

    Authors: Alexander Winkler, Jungdam Won, Yuting Ye

    Abstract: Real-time tracking of human body motion is crucial for interactive and immersive experiences in AR/VR. However, very limited sensor data about the body is available from standalone wearable devices such as HMDs (Head Mounted Devices) or AR glasses. In this work, we present a reinforcement learning framework that takes in sparse signals from an HMD and two controllers, and simulates plausible and p… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Journal ref: SIGGRAPH Asia 2022 Conference Papers, December 6 to 9, 2022, Daegu, Republic of Korea

  20. arXiv:2209.07232  [pdf, other

    eess.IV cs.CV

    A Spatiotemporal Model for Precise and Efficient Fully-automatic 3D Motion Correction in OCT

    Authors: Stefan Ploner, Siyu Chen, Jungeun Won, Lennart Husvogt, Katharina Breininger, Julia Schottenhamml, James Fujimoto, Andreas Maier

    Abstract: Optical coherence tomography (OCT) is a micrometer-scale, volumetric imaging modality that has become a clinical standard in ophthalmology. OCT instruments image by raster-scanning a focused light spot across the retina, acquiring sequential cross-sectional images to generate volumetric data. Patient eye motion during the acquisition poses unique challenges: Non-rigid, discontinuous distortions ca… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: Presented at MICCAI 2022 (main conference). The arXiv version provides full quality figures. 9 pages content (5 figures) + 2 pages references + 2 pages supplementary material (2 figures)

  21. arXiv:2209.07115  [pdf

    cs.CV

    LAVOLUTION: Measurement of Non-target Structural Displacement Calibrated by Structured Light

    Authors: Jongbin Won, Minhyuk Song, Gunhee Kim, Jong-Woong Park, Haemin Jeon

    Abstract: Displacement is an important measurement for the assessment of structural conditions, but its field measurement is often hindered by difficulties associated with sensor installation and measurement accuracy. To overcome the disadvantages of conventional displacement measurement, computer vision (CV)-based methods have been implemented due to their remote sensing capabilities and accuracy. This pap… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: 27 pages, 12 figures, 3 tables

  22. The Sparse Abstract Machine

    Authors: Olivia Hsu, Maxwell Strange, Ritvik Sharma, Jaeyeon Won, Kunle Olukotun, Joel Emer, Mark Horowitz, Fredrik Kjolstad

    Abstract: We propose the Sparse Abstract Machine (SAM), an abstract machine model for targeting sparse tensor algebra to reconfigurable and fixed-function spatial dataflow accelerators. SAM defines a streaming dataflow abstraction with sparse primitives that encompass a large space of scheduled tensor algebra expressions. SAM dataflow graphs naturally separate tensor formats from algorithms and are expressi… ▽ More

    Submitted 23 March, 2023; v1 submitted 30 August, 2022; originally announced August 2022.

    Comments: 18 pages, 17 figures, 3 tables

    Journal ref: ASPLOS 2023: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems Volume 3 (2023) 710-726

  23. arXiv:2208.12813  [pdf

    cs.LG cs.AI

    Abnormal Local Clustering in Federated Learning

    Authors: Jihwan Won

    Abstract: Federated learning is a model for privacy without revealing private data by transfer models instead of personal and private data from local client devices. While, in the global model, it's crucial to recognize each local data is normal. This paper suggests one method to separate normal locals and abnormal locals by Euclidean similarity clustering of vectors extracted by inputting dummy data in loc… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

    Comments: 3 pages

  24. arXiv:2206.12663  [pdf, other

    stat.ML cs.LG stat.CO

    Statistical inference with implicit SGD: proximal Robbins-Monro vs. Polyak-Ruppert

    Authors: Yoonhyung Lee, Sungdong Lee, Joong-Ho Won

    Abstract: The implicit stochastic gradient descent (ISGD), a proximal version of SGD, is gaining interest in the literature due to its stability over (explicit) SGD. In this paper, we conduct an in-depth analysis of the two modes of ISGD for smooth convex functions, namely proximal Robbins-Monro (proxRM) and proximal Poylak-Ruppert (proxPR) procedures, for their use in statistical inference on model paramet… ▽ More

    Submitted 28 June, 2022; v1 submitted 25 June, 2022; originally announced June 2022.

    Comments: Accepted to the 39 th International Conference on Machine Learning. This version contains corrections to typos found after submitting the camera-ready version

  25. arXiv:2204.07135  [pdf, other

    cs.LG cs.AI cs.CL cs.HC

    Scalable and Robust Self-Learning for Skill Routing in Large-Scale Conversational AI Systems

    Authors: Mohammad Kachuee, **seok Nam, Sarthak Ahuja, **-Myung Won, Sung** Lee

    Abstract: Skill routing is an important component in large-scale conversational systems. In contrast to traditional rule-based skill routing, state-of-the-art systems use a model-based approach to enable natural conversations. To provide supervision signal required to train such models, ideas such as human annotation, replication of a rule-based system, relabeling based on user paraphrases, and bandit-based… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Comments: NAACL 2022

  26. Transformer Inertial Poser: Real-time Human Motion Reconstruction from Sparse IMUs with Simultaneous Terrain Generation

    Authors: Yifeng Jiang, Yuting Ye, Deepak Gopinath, Jungdam Won, Alexander W. Winkler, C. Karen Liu

    Abstract: Real-time human motion reconstruction from a sparse set of (e.g. six) wearable IMUs provides a non-intrusive and economic approach to motion capture. Without the ability to acquire position information directly from IMUs, recent works took data-driven approaches that utilize large human motion datasets to tackle this under-determined problem. Still, challenges remain such as temporal consistency,… ▽ More

    Submitted 8 December, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

    Comments: SIGGRAPH Asia 2022. Video: https://youtu.be/rXb6SaXsnc0. Code: https://github.com/jyf588/transformer-inertial-poser

  27. Conditional Motion In-betweening

    Authors: Jihoon Kim, Taehyun Byun, Seungyoun Shin, Jungdam Won, Sungjoon Choi

    Abstract: Motion in-betweening (MIB) is a process of generating intermediate skeletal movement between the given start and target poses while preserving the naturalness of the motion, such as periodic footstep motion while walking. Although state-of-the-art MIB methods are capable of producing plausible motions given sparse key-poses, they often lack the controllability to generate motions satisfying the se… ▽ More

    Submitted 6 October, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

    Journal ref: Pattern Recognition, Volume 132, December 2022, 108894

  28. arXiv:2109.13362  [pdf, other

    cs.RO

    FastMimic: Model-based Motion Imitation for Agile, Diverse and Generalizable Quadrupedal Locomotion

    Authors: Tianyu Li, Jungdam Won, Sehoon Ha, Akshara Rai

    Abstract: Robots operating in human environments need various skills, like slow and fast walking, turning, side-step**, and many more. However, building robot controllers that can exhibit such a large range of behaviors is a challenging problem that requires tedious investigation for every task. We present a unified model-based control algorithm for imitating different animal gaits without expensive simul… ▽ More

    Submitted 25 February, 2022; v1 submitted 27 September, 2021; originally announced September 2021.

    Comments: Submitted to RA-L. Video Link: https://youtu.be/Z-1YfpaFO_g

  29. arXiv:2010.16114  [pdf, ps, other

    stat.CO cs.MS

    DistStat.jl: Towards Unified Programming for High-Performance Statistical Computing Environments in Julia

    Authors: Seyoon Ko, Hua Zhou, ** Zhou, Joong-Ho Won

    Abstract: The demand for high-performance computing (HPC) is ever-increasing for everyday statistical computing purposes. The downside is that we need to write specialized code for each HPC environment. CPU-level parallelization needs to be explicitly coded for effective use of multiple nodes in cluster supercomputing environments. Acceleration via graphics processing units (GPUs) requires to write kernel c… ▽ More

    Submitted 30 October, 2020; originally announced October 2020.

  30. arXiv:2006.10318  [pdf, other

    cs.CR cs.RO

    Drift with Devil: Security of Multi-Sensor Fusion based Localization in High-Level Autonomous Driving under GPS Spoofing (Extended Version)

    Authors: Junjie Shen, Jun Yeon Won, Zeyuan Chen, Qi Alfred Chen

    Abstract: For high-level Autonomous Vehicles (AV), localization is highly security and safety critical. One direct threat to it is GPS spoofing, but fortunately, AV systems today predominantly use Multi-Sensor Fusion (MSF) algorithms that are generally believed to have the potential to practically defeat GPS spoofing. However, no prior work has studied whether today's MSF algorithms are indeed sufficiently… ▽ More

    Submitted 12 August, 2020; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: This is an extended version of our paper, which appears in USENIX Security 2020. For attack demos, see our project website: https://sites.google.com/view/cav-sec/fusionripper

  31. arXiv:2006.03333  [pdf, other

    stat.ML cs.LG

    Principled learning method for Wasserstein distributionally robust optimization with local perturbations

    Authors: Yongchan Kwon, Wonyoung Kim, Joong-Ho Won, Myunghee Cho Paik

    Abstract: Wasserstein distributionally robust optimization (WDRO) attempts to learn a model that minimizes the local worst-case risk in the vicinity of the empirical data distribution defined by Wasserstein ball. While WDRO has received attention as a promising tool for inference since its introduction, its theoretical understanding has not been fully matured. Gao et al. (2017) proposed a minimizer based on… ▽ More

    Submitted 22 June, 2020; v1 submitted 5 June, 2020; originally announced June 2020.

    Comments: Accepted for ICML 2020

  32. arXiv:1903.08831  [pdf

    cs.CV

    Non-target Structural Displacement Measurement Using Reference Frame Based Deepflow

    Authors: Jongbin Won, Jong-Woong Park, Do-Soo Moon

    Abstract: Structural displacement is crucial for structural health monitoring, although it is very challenging to measure in field conditions. Most existing displacement measurement methods are costly, labor intensive, and insufficiently accurate for measuring small dynamic displacements. Computer vision (CV) based methods incorporate optical devices with advanced image processing algorithms to accurately,… ▽ More

    Submitted 21 March, 2019; originally announced March 2019.

  33. arXiv:1802.06964  [pdf, other

    cs.CV

    Co-occurrence matrix analysis-based semi-supervised training for object detection

    Authors: Min-Kook Choi, Jaehyeong Park, Jihun Jung, Heechul Jung, **-Hee Lee, Woong Jae Won, Woo Young Jung, **cheol Kim, Soon Kwon

    Abstract: One of the most important factors in training object recognition networks using convolutional neural networks (CNNs) is the provision of annotated data accompanying human judgment. Particularly, in object detection or semantic segmentation, the annotation process requires considerable human effort. In this paper, we propose a semi-supervised learning (SSL)-based training methodology for object det… ▽ More

    Submitted 19 February, 2018; originally announced February 2018.

    Comments: Submitted to International Conference on Image Processing (ICIP) 2018

  34. arXiv:1503.04250  [pdf, other

    cs.MM cs.CL

    The YLI-MED Corpus: Characteristics, Procedures, and Plans

    Authors: Julia Bernd, Damian Borth, Benjamin Elizalde, Gerald Friedland, Heather Gallagher, Luke Gottlieb, Adam Janin, Sara Karabashlieva, Jocelyn Takahashi, Jennifer Won

    Abstract: The YLI Multimedia Event Detection corpus is a public-domain index of videos with annotations and computed features, specialized for research in multimedia event detection (MED), i.e., automatically identifying what's happening in a video by analyzing the audio and visual content. The videos indexed in the YLI-MED corpus are a subset of the larger YLI feature corpus, which is being developed by th… ▽ More

    Submitted 13 March, 2015; originally announced March 2015.

    Comments: 47 pages; 3 figures; 25 tables. Also published as ICSI Technical Report TR-15-001

    Report number: TR-15-001

  35. arXiv:1411.6365  [pdf

    cs.CV

    On the mathematic modeling of non-parametric curves based on cubic Bézier curves

    Authors: Ha Jong Won, Choe Chun Hwa, Li Kum Song

    Abstract: Bézier splines are widely available in various systems with the curves and surface designs. In general, the Bézier spline can be specified with the Bézier curve segments and a Bézier curve segment can be fitted to any number of control points. The number of control points determines the degree of the Bézier polynomial. This paper presents a method which determines control points for Bézier curves… ▽ More

    Submitted 24 November, 2014; originally announced November 2014.

  36. arXiv:1411.4114  [pdf

    cs.CL cs.CV cs.LG

    Definition of Visual Speech Element and Research on a Method of Extracting Feature Vector for Korean Lip-Reading

    Authors: Ha Jong Won, Li Gwang Chol, Kim Hyok Chol, Li Kum Song

    Abstract: In this paper, we defined the viseme (visual speech element) and described about the method of extracting visual feature vector. We defined the 10 visemes based on vowel by analyzing of Korean utterance and proposed the method of extracting the 20-dimensional visual feature vector, combination of static features and dynamic features. Lastly, we took an experiment in recognizing words based on 3-vi… ▽ More

    Submitted 15 November, 2014; originally announced November 2014.