Skip to main content

Showing 1–45 of 45 results for author: Park, H S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.13215  [pdf, other

    physics.app-ph cs.LG

    Machine Learning-Guided Design of Non-Reciprocal and Asymmetric Elastic Chiral Metamaterials

    Authors: Lingxiao Yuan, Emma Lejeune, Harold S. Park

    Abstract: There has been significant recent interest in the mechanics community to design structures that can either violate reciprocity, or exhibit elastic asymmetry or odd elasticity. While these properties are highly desirable to enable mechanical metamaterials to exhibit novel wave propagation phenomena, it remains an open question as to how to design passive structures that exhibit both significant non… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  2. arXiv:2402.11909  [pdf, other

    cs.CV

    One2Avatar: Generative Implicit Head Avatar For Few-shot User Adaptation

    Authors: Zhixuan Yu, Ziqian Bai, Abhimitra Meka, Feitong Tan, Qiangeng Xu, Rohit Pandey, Sean Fanello, Hyun Soo Park, Yinda Zhang

    Abstract: Traditional methods for constructing high-quality, personalized head avatars from monocular videos demand extensive face captures and training time, posing a significant challenge for scalability. This paper introduces a novel approach to create high quality head avatar utilizing only a single or a few images per user. We learn a generative model for 3D animatable photo-realistic head avatar from… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  3. arXiv:2311.18259  [pdf, other

    cs.CV cs.AI

    Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

    Authors: Kristen Grauman, Andrew Westbury, Lorenzo Torresani, Kris Kitani, Jitendra Malik, Triantafyllos Afouras, Kumar Ashutosh, Vijay Baiyya, Siddhant Bansal, Bikram Boote, Eugene Byrne, Zach Chavis, Joya Chen, Feng Cheng, Fu-Jen Chu, Sean Crane, Avijit Dasgupta, **g Dong, Maria Escobar, Cristhian Forigua, Abrham Gebreselasie, Sanjay Haresh, **g Huang, Md Mohaiminul Islam, Suyog Jain , et al. (76 additional authors not shown)

    Abstract: We present Ego-Exo4D, a diverse, large-scale multimodal multiview video dataset and benchmark challenge. Ego-Exo4D centers around simultaneously-captured egocentric and exocentric video of skilled human activities (e.g., sports, music, dance, bike repair). 740 participants from 13 cities worldwide performed these activities in 123 different natural scene contexts, yielding long-form captures from… ▽ More

    Submitted 29 April, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: updated baseline results and dataset statistics to match the released v2 data; added table to appendix comparing stats of Ego-Exo4D alongside other datasets

  4. arXiv:2311.05828  [pdf, other

    cs.CV

    Diffusion Shape Prior for Wrinkle-Accurate Cloth Registration

    Authors: **gfan Guo, Fabian Prada, Donglai Xiang, Javier Romero, Chenglei Wu, Hyun Soo Park, Takaaki Shiratori, Shunsuke Saito

    Abstract: Registering clothes from 4D scans with vertex-accurate correspondence is challenging, yet important for dynamic appearance modeling and physics parameter estimation from real-world data. However, previous methods either rely on texture information, which is not always reliable, or achieve only coarse-level alignment. In this work, we present a novel approach to enabling accurate surface registrati… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: Project page: https://www-users.cse.umn.edu/~guo00109/projects/3dv2024/

  5. arXiv:2307.14579  [pdf, other

    cs.CV

    Neural Representation-Based Method for Metal-induced Artifact Reduction in Dental CBCT Imaging

    Authors: Hyoung Suk Park, Kiwan Jeon, ** Keun Seo

    Abstract: This study introduces a novel reconstruction method for dental cone-beam computed tomography (CBCT), focusing on effectively reducing metal-induced artifacts commonly encountered in the presence of prevalent metallic implants. Despite significant progress in metal artifact reduction techniques, challenges persist owing to the intricate physical interactions between polychromatic X-ray beams and me… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: 10 pages, 5 figures

  6. arXiv:2305.10132  [pdf, other

    cs.CV

    Automatic 3D Registration of Dental CBCT and Face Scan Data using 2D Projection Images

    Authors: Hyoung Suk Park, Chang Min Hyun, Sang-Hwy Lee, ** Keun Seo, Kiwan Jeon

    Abstract: This paper presents a fully automatic registration method of dental cone-beam computed tomography (CBCT) and face scan data. It can be used for a digital platform of 3D jaw-teeth-face models in a variety of applications, including 3D digital treatment planning and orthognathic surgery. Difficulties in accurately merging facial scans and CBCT images are due to the different image acquisition method… ▽ More

    Submitted 26 July, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: 8 pages, 6 figures, 2 tables

    MSC Class: 92C55; 15A04; 62F10

  7. arXiv:2305.09986  [pdf, other

    eess.IV cs.CV cs.LG

    A robust multi-domain network for short-scanning amyloid PET reconstruction

    Authors: Hyoung Suk Park, Young ** Jeong, Kiwan Jeon

    Abstract: This paper presents a robust multi-domain network designed to restore low-quality amyloid PET images acquired in a short period of time. The proposed method is trained on pairs of PET images from short (2 minutes) and standard (20 minutes) scanning times, sourced from multiple domains. Learning relevant image features between these domains with a single network is challenging. Our key contribution… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

    Comments: 21 pages, 7 figures, 3 tables

    MSC Class: 92C55; 68T05; 15A29; 65F22

  8. arXiv:2303.06504  [pdf, other

    cs.CV

    Normal-guided Garment UV Prediction for Human Re-texturing

    Authors: Yasamin Jafarian, Tuanfeng Y. Wang, Duygu Ceylan, Jimei Yang, Nathan Carr, Yi Zhou, Hyun Soo Park

    Abstract: Clothes undergo complex geometric deformations, which lead to appearance changes. To edit human videos in a physically plausible way, a texture map must take into account not only the garment transformation induced by the body movements and clothes fitting, but also its 3D fine-grained surface geometry. This poses, however, a new challenge of 3D reconstruction of dynamic clothes from an image or a… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

  9. arXiv:2303.01678  [pdf, other

    eess.IV cs.CV physics.med-ph

    Nonlinear ill-posed problem in low-dose dental cone-beam computed tomography

    Authors: Hyoung Suk Park, Chang Min Hyun, ** Keun Seo

    Abstract: This paper describes the mathematical structure of the ill-posed nonlinear inverse problem of low-dose dental cone-beam computed tomography (CBCT) and explains the advantages of a deep learning-based approach to the reconstruction of computed tomography images over conventional regularization methods. This paper explains the underlying reasons why dental CBCT is more ill-posed than standard comput… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

  10. arXiv:2209.05432  [pdf, other

    cs.RO cs.AI cs.CV

    Self-supervised Wide Baseline Visual Servoing via 3D Equivariance

    Authors: **wook Huh, Jungseok Hong, Suveer Garg, Hyun Soo Park, Volkan Isler

    Abstract: One of the challenging input settings for visual servoing is when the initial and goal camera views are far apart. Such settings are difficult because the wide baseline can cause drastic changes in object appearance and cause occlusions. This paper presents a novel self-supervised visual servoing method for wide baseline images which does not require 3D ground truth supervision. Existing approache… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

    Comments: Accepted at the 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  11. arXiv:2207.07077  [pdf, other

    cs.CV cs.AI

    Egocentric Scene Understanding via Multimodal Spatial Rectifier

    Authors: Tien Do, Khiem Vuong, Hyun Soo Park

    Abstract: In this paper, we study a problem of egocentric scene understanding, i.e., predicting depths and surface normals from an egocentric image. Egocentric scene understanding poses unprecedented challenges: (1) due to large head movements, the images are taken from non-canonical viewpoints (i.e., tilted images) where existing models of geometry prediction do not apply; (2) dynamic foreground objects in… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

    Comments: Appearing in the Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022

  12. arXiv:2206.14917  [pdf, other

    stat.ML cs.CE cs.LG physics.data-an

    Towards out of distribution generalization for problems in mechanics

    Authors: Lingxiao Yuan, Harold S. Park, Emma Lejeune

    Abstract: There has been a massive increase in research interest towards applying data driven methods to problems in mechanics. While traditional machine learning (ML) methods have enabled many breakthroughs, they rely on the assumption that the training (observed) data and testing (unseen) data are independent and identically distributed (i.i.d). Thus, traditional ML approaches often break down when applie… ▽ More

    Submitted 13 August, 2022; v1 submitted 29 June, 2022; originally announced June 2022.

    Journal ref: Comput. Methods Appl. Mech. Engrg. 400 (2022) 115569

  13. arXiv:2203.12780  [pdf, other

    cs.CV

    Learning Motion-Dependent Appearance for High-Fidelity Rendering of Dynamic Humans from a Single Camera

    Authors: Jae Shin Yoon, Duygu Ceylan, Tuanfeng Y. Wang, **gwan Lu, Jimei Yang, Zhixin Shu, Hyun Soo Park

    Abstract: Appearance of dressed humans undergoes a complex geometric transformation induced not only by the static pose but also by its dynamics, i.e., there exists a number of cloth geometric configurations given a pose depending on the way it has moved. Such appearance modeling conditioned on motion has been largely neglected in existing human rendering methods, resulting in rendering of physically implau… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

    Comments: CVPR accepted. 15 pages. 17 figures, 5 tables

    Journal ref: IEEE Computer Vision and Pattern Recognition (CVPR) 2022

  14. arXiv:2202.03571  [pdf, other

    eess.IV cs.CV

    Metal Artifact Reduction with Intra-Oral Scan Data for 3D Low Dose Maxillofacial CBCT Modeling

    Authors: Chang Min Hyun, Taigyntuya Bayaraa, Hye Sun Yun, Tae Jun Jang, Hyoung Suk Park, ** Keun Seo

    Abstract: Low-dose dental cone beam computed tomography (CBCT) has been increasingly used for maxillofacial modeling. However, the presence of metallic inserts, such as implants, crowns, and dental filling, causes severe streaking and shading artifacts in a CBCT image and loss of the morphological structures of the teeth, which consequently prevents accurate segmentation of bones. A two-stage metal artifact… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

  15. arXiv:2112.00216  [pdf, other

    cs.CV cs.SD eess.AS

    PoseKernelLifter: Metric Lifting of 3D Human Pose using Sound

    Authors: Zhijian Yang, Xiaoran Fan, Volkan Isler, Hyun Soo Park

    Abstract: Reconstructing the 3D pose of a person in metric scale from a single view image is a geometrically ill-posed problem. For example, we can not measure the exact distance of a person to the camera from a single view image without additional scene assumptions (e.g., known height). Existing learning based approaches circumvent this issue by reconstructing the 3D pose up to scale. However, there are ma… ▽ More

    Submitted 2 December, 2021; v1 submitted 30 November, 2021; originally announced December 2021.

  16. arXiv:2110.07058  [pdf, other

    cs.CV cs.AI

    Ego4D: Around the World in 3,000 Hours of Egocentric Video

    Authors: Kristen Grauman, Andrew Westbury, Eugene Byrne, Zachary Chavis, Antonino Furnari, Rohit Girdhar, Jackson Hamburger, Hao Jiang, Miao Liu, Xingyu Liu, Miguel Martin, Tushar Nagarajan, Ilija Radosavovic, Santhosh Kumar Ramakrishnan, Fiona Ryan, Jayant Sharma, Michael Wray, Mengmeng Xu, Eric Zhongcong Xu, Chen Zhao, Siddhant Bansal, Dhruv Batra, Vincent Cartillier, Sean Crane, Tien Do , et al. (60 additional authors not shown)

    Abstract: We introduce Ego4D, a massive-scale egocentric video dataset and benchmark suite. It offers 3,670 hours of daily-life activity video spanning hundreds of scenarios (household, outdoor, workplace, leisure, etc.) captured by 931 unique camera wearers from 74 worldwide locations and 9 different countries. The approach to collection is designed to uphold rigorous privacy and ethics standards with cons… ▽ More

    Submitted 11 March, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: To appear in the Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022. This version updates the baseline result numbers for the Hands and Objects benchmark (appendix)

  17. arXiv:2110.00543  [pdf, other

    cs.CV

    Self-supervised Secondary Landmark Detection via 3D Representation Learning

    Authors: Praneet C. Bala, Jan Zimmermann, Hyun Soo Park, Benjamin Y. Hayden

    Abstract: Recent technological developments have spurred great advances in the computerized tracking of joints and other landmarks in moving animals, including humans. Such tracking promises important advances in biology and biomedicine. Modern tracking models depend critically on labor-intensive annotated datasets of primary landmarks by non-expert humans. However, such annotation approaches can be costly… ▽ More

    Submitted 1 October, 2021; originally announced October 2021.

  18. arXiv:2110.00119  [pdf, other

    cs.CV

    HUMBI: A Large Multiview Dataset of Human Body Expressions and Benchmark Challenge

    Authors: Jae Shin Yoon, Zhixuan Yu, Jaesik Park, Hyun Soo Park

    Abstract: This paper presents a new large multiview dataset called HUMBI for human body expressions with natural clothing. The goal of HUMBI is to facilitate modeling view-specific appearance and geometry of five primary body signals including gaze, face, hand, body, and garment from assorted people. 107 synchronized HD cameras are used to capture 772 distinctive subjects across gender, ethnicity, age, and… ▽ More

    Submitted 20 December, 2021; v1 submitted 30 September, 2021; originally announced October 2021.

    Comments: 18 pages; Accepted to TPAMI

  19. arXiv:2109.11248  [pdf

    physics.app-ph cs.CR

    Encryption Device Based on Wave-Chaos for Enhanced Physical Security of Wireless Wave Transmission

    Authors: Hong Soo Park, Sun K. Hong

    Abstract: We introduce an encryption device based on wave-chaos to enhance the physical security of wireless wave transmission. The proposed encryption device is composed of a compact quasi-2D disordered cavity, where transmit signals pass through to be distorted in time before transmission. On the receiving end, the signals can only be decrypted when they pass through an identical cavity. In the absence of… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

  20. arXiv:2109.09299  [pdf, other

    cs.CV

    Semi-supervised Dense Keypoints Using Unlabeled Multiview Images

    Authors: Zhixuan Yu, Haozheng Yu, Long Sha, Sujoy Ganguly, Hyun Soo Park

    Abstract: This paper presents a new end-to-end semi-supervised framework to learn a dense keypoint detector using unlabeled multiview images. A key challenge lies in finding the exact correspondences between the dense keypoints in multiple views since the inverse of the keypoint map** can be neither analytically derived nor differentiated. This limits applying existing multiview supervision approaches use… ▽ More

    Submitted 19 February, 2024; v1 submitted 20 September, 2021; originally announced September 2021.

    Comments: Published as a conference paper at NeurIPS 2021

  21. arXiv:2103.03319  [pdf, other

    cs.CV

    Self-supervised 3D Representation Learning of Dressed Humans from Social Media Videos

    Authors: Yasamin Jafarian, Hyun Soo Park

    Abstract: A key challenge of learning a visual representation for the 3D high fidelity geometry of dressed humans lies in the limited availability of the ground truth data (e.g., 3D scanned models), which results in the performance degradation of 3D human reconstruction when applying to real-world imagery. We address this challenge by leveraging a new data resource: a number of social media dance videos tha… ▽ More

    Submitted 27 December, 2022; v1 submitted 4 March, 2021; originally announced March 2021.

  22. arXiv:2102.00062  [pdf, other

    cs.CV

    Neural 3D Clothes Retargeting from a Single Image

    Authors: Jae Shin Yoon, Kihwan Kim, Jan Kautz, Hyun Soo Park

    Abstract: In this paper, we present a method of clothes retargeting; generating the potential poses and deformations of a given 3D clothing template model to fit onto a person in a single RGB image. The problem is fundamentally ill-posed as attaining the ground truth data is impossible, i.e., images of people wearing the different 3D clothing template model at exact same pose. We address this challenge by u… ▽ More

    Submitted 29 January, 2021; originally announced February 2021.

    Comments: 20 pages, 21 figures

  23. arXiv:2012.03796  [pdf, other

    cs.CV

    Pose-Guided Human Animation from a Single Image in the Wild

    Authors: Jae Shin Yoon, Lingjie Liu, Vladislav Golyanik, Kripasindhu Sarkar, Hyun Soo Park, Christian Theobalt

    Abstract: We present a new pose transfer method for synthesizing a human animation from a single image of a person controlled by a sequence of body poses. Existing pose transfer methods exhibit significant visual artifacts when applying to a novel scene, resulting in temporal inconsistency and failures in preserving the identity and textures of the person. To address these limitations, we design a compositi… ▽ More

    Submitted 21 November, 2021; v1 submitted 7 December, 2020; originally announced December 2020.

    Comments: 14 pages including Appendix

  24. arXiv:2007.09264  [pdf, other

    cs.CV

    Surface Normal Estimation of Tilted Images via Spatial Rectifier

    Authors: Tien Do, Khiem Vuong, Stergios I. Roumeliotis, Hyun Soo Park

    Abstract: In this paper, we present a spatial rectifier to estimate surface normals of tilted images. Tilted images are of particular interest as more visual data are captured by arbitrarily oriented sensors such as body-/robot-mounted cameras. Existing approaches exhibit bounded performance on predicting surface normals because they were trained using gravity-aligned images. Our two main hypotheses are: (1… ▽ More

    Submitted 14 July, 2022; v1 submitted 17 July, 2020; originally announced July 2020.

    Comments: Appearing in the European Conference on Computer Vision 2020. This version fixes a typo on the L2 loss function

  25. arXiv:2004.01294  [pdf, other

    cs.CV

    Novel View Synthesis of Dynamic Scenes with Globally Coherent Depths from a Monocular Camera

    Authors: Jae Shin Yoon, Kihwan Kim, Orazio Gallo, Hyun Soo Park, Jan Kautz

    Abstract: This paper presents a new method to synthesize an image from arbitrary views and times given a collection of images of a dynamic scene. A key challenge for the novel view synthesis arises from dynamic scene reconstruction where epipolar geometry does not apply to the local motion of dynamic contents. To address this challenge, we propose to combine the depth from single view (DSV) and the depth fr… ▽ More

    Submitted 2 April, 2020; originally announced April 2020.

    Comments: This paper is accepted to CVPR 2020

  26. arXiv:1907.10815  [pdf, other

    cs.CV

    Self-Supervised Adaptation of High-Fidelity Face Models for Monocular Performance Tracking

    Authors: Jae Shin Yoon, Takaaki Shiratori, Shoou-I Yu, Hyun Soo Park

    Abstract: Improvements in data-capture and face modeling techniques have enabled us to create high-fidelity realistic face models. However, driving these realistic face models requires special input data, e.g. 3D meshes and unwrapped textures. Also, these face models expect clean input data taken under controlled lab environments, which is very different from data collected in the wild. All these constraint… ▽ More

    Submitted 24 July, 2019; originally announced July 2019.

    Comments: This work is accepted by CVPR 2019

  27. Unpaired image denoising using a generative adversarial network in X-ray CT

    Authors: Hyoung Suk Park, **eon Baek, Sun Kyoung You, Jae Kyu Choi, ** Keun Seo

    Abstract: This paper proposes a deep learning-based denoising method for noisy low-dose computerized tomography (CT) images in the absence of paired training data. The proposed method uses a fidelity-embedded generative adversarial network (GAN) to learn a denoising function from unpaired training data of low-dose CT (LDCT) and standard-dose CT (SDCT) images, where the denoising function is the optimal gene… ▽ More

    Submitted 8 August, 2019; v1 submitted 4 March, 2019; originally announced March 2019.

    Journal ref: IEEE Access, 2019

  28. arXiv:1901.09478  [pdf, other

    quant-ph cs.CR

    Efficient High-dimensional Quantum Key Distribution with Hybrid Encoding

    Authors: Yonggi Jo, Hee Su Park, Seung-Woo Lee, Wonmin Son

    Abstract: We propose a schematic setup of quantum key distribution (QKD) with an improved secret key rate based on high-dimensional quantum states. Two degrees-of-freedom of a single photon, orbital angular momentum modes, and multi-path modes, are used to encode secret key information. Its practical implementation consists of optical elements that are within the reach of current technologies such as a mult… ▽ More

    Submitted 27 January, 2019; originally announced January 2019.

    Comments: 10 pages, 6 figures

    Journal ref: Entropy 2019, 21(1), 80

  29. arXiv:1812.01738  [pdf, other

    cs.CV

    Multiview Cross-supervision for Semantic Segmentation

    Authors: Yuan Yao, Hyun Soo Park

    Abstract: This paper presents a semi-supervised learning framework for a customized semantic segmentation task using multiview image streams. A key challenge of the customized task lies in the limited accessibility of the labeled data due to the requirement of prohibitive manual annotation effort. We hypothesize that it is possible to leverage multiview image streams that are linked through the underlying 3… ▽ More

    Submitted 4 December, 2018; originally announced December 2018.

  30. arXiv:1812.00312  [pdf, other

    cs.CV

    ECO: Egocentric Cognitive Map**

    Authors: Jayant Sharma, Zixing Wang, Alberto Speranzon, Vijay Venkataraman, Hyun Soo Park

    Abstract: We present a new method to localize a camera within a previously unseen environment perceived from an egocentric point of view. Although this is, in general, an ill-posed problem, humans can effortlessly and efficiently determine their relative location and orientation and navigate into a previously unseen environments, e.g., finding a specific item in a new grocery store. To enable such a capabil… ▽ More

    Submitted 1 December, 2018; originally announced December 2018.

  31. arXiv:1812.00281  [pdf, other

    cs.CV

    HUMBI: A Large Multiview Dataset of Human Body Expressions

    Authors: Zhixuan Yu, Jae Shin Yoon, In Kyu Lee, Prashanth Venkatesh, Jaesik Park, Jihun Yu, Hyun Soo Park

    Abstract: This paper presents a new large multiview dataset called HUMBI for human body expressions with natural clothing. The goal of HUMBI is to facilitate modeling view-specific appearance and geometry of gaze, face, hand, body, and garment from assorted people. 107 synchronized HD cameras are used to capture 772 distinctive subjects across gender, ethnicity, age, and physical condition. With the multivi… ▽ More

    Submitted 22 May, 2020; v1 submitted 1 December, 2018; originally announced December 2018.

  32. arXiv:1811.11251  [pdf, other

    cs.CV

    Multiview Supervision By Registration

    Authors: Yilun Zhang, Hyun Soo Park

    Abstract: This paper presents a semi-supervised learning framework to train a keypoint detector using multiview image streams given the limited labeled data (typically $<$4\%). We leverage the complementary relationship between multiview geometry and visual tracking to provide three types of supervisionary signals to utilize the unlabeled data: (1) keypoint detection in one view can be supervised by other v… ▽ More

    Submitted 23 March, 2019; v1 submitted 27 November, 2018; originally announced November 2018.

  33. arXiv:1806.00104  [pdf, other

    cs.CV

    MONET: Multiview Semi-supervised Keypoint Detection via Epipolar Divergence

    Authors: Yuan Yao, Yasamin Jafarian, Hyun Soo Park

    Abstract: This paper presents MONET -- an end-to-end semi-supervised learning framework for a keypoint detector using multiview image streams. In particular, we consider general subjects such as non-human species where attaining a large scale annotated dataset is challenging. While multiview geometry can be used to self-supervise the unlabeled data, integrating the geometry into learning a keypoint detector… ▽ More

    Submitted 16 August, 2019; v1 submitted 31 May, 2018; originally announced June 2018.

  34. arXiv:1802.02987  [pdf, ps, other

    cs.NE cs.AI math.CV

    A Generalization Method of Partitioned Activation Function for Complex Number

    Authors: HyeonSeok Lee, Hyo Seon Park

    Abstract: A method to convert real number partitioned activation function into complex number one is provided. The method has 4em variations; 1 has potential to get holomorphic activation, 2 has potential to conserve complex angle, and the last 1 guarantees interaction between real and imaginary parts. The method has been applied to LReLU and SELU as examples. The complex number activation function is an bu… ▽ More

    Submitted 8 February, 2018; originally announced February 2018.

    Comments: Complex Activation Function, Holomorphic, Phase-preserving, real-complex interaction

  35. arXiv:1712.01359  [pdf, other

    cs.CV

    3D Semantic Trajectory Reconstruction from 3D Pixel Continuum

    Authors: Jae Shin Yoon, Ziwei Li, Hyun Soo Park

    Abstract: This paper presents a method to reconstruct dense semantic trajectory stream of human interactions in 3D from synchronized multiple videos. The interactions inherently introduce self-occlusion and illumination/appearance/shape changes, resulting in highly fragmented trajectory reconstruction with noisy and coarse semantic labels. Our conjecture is that among many views, there exists a set of views… ▽ More

    Submitted 4 December, 2017; originally announced December 2017.

  36. arXiv:1708.00607  [pdf, other

    physics.med-ph cs.CV

    CT sinogram-consistency learning for metal-induced beam hardening correction

    Authors: Hyung Suk Park, Sung Min Lee, Hwa Pyung Kim, ** Keun Seo

    Abstract: This paper proposes a sinogram consistency learning method to deal with beam-hardening related artifacts in polychromatic computerized tomography (CT). The presence of highly attenuating materials in the scan field causes an inconsistent sinogram, that does not match the range space of the Radon transform. When the mismatched data are entered into the range space during CT reconstruction, streakin… ▽ More

    Submitted 12 January, 2018; v1 submitted 2 August, 2017; originally announced August 2017.

    Comments: 17 pages, 8 figures

  37. arXiv:1704.00098  [pdf, other

    cs.CV

    Customizing First Person Image Through Desired Actions

    Authors: Shan Su, Jianbo Shi, Hyun Soo Park

    Abstract: This paper studies a problem of inverse visual path planning: creating a visual scene from a first person action. Our conjecture is that the spatial arrangement of a first person visual scene is deployed to afford an action, and therefore, the action can be inversely used to synthesize a new scene such that the action is feasible. As a proof-of-concept, we focus on linking visual experiences induc… ▽ More

    Submitted 31 March, 2017; originally announced April 2017.

  38. arXiv:1611.09464  [pdf, other

    cs.CV

    Social Behavior Prediction from First Person Videos

    Authors: Shan Su, Jung Pyo Hong, Jianbo Shi, Hyun Soo Park

    Abstract: This paper presents a method to predict the future movements (location and gaze direction) of basketball players as a whole from their first person videos. The predicted behaviors reflect an individual physical space that affords to take the next actions while conforming to social behaviors by engaging to joint attention. Our key innovation is to use the 3D reconstruction of multiple first person… ▽ More

    Submitted 28 November, 2016; originally announced November 2016.

  39. arXiv:1611.05365  [pdf, other

    cs.CV

    Am I a Baller? Basketball Performance Assessment from First-Person Videos

    Authors: Gedas Bertasius, Hyun Soo Park, Stella X. Yu, Jianbo Shi

    Abstract: This paper presents a method to assess a basketball player's performance from his/her first-person video. A key challenge lies in the fact that the evaluation metric is highly subjective and specific to a particular evaluator. We leverage the first-person camera to address this challenge. The spatiotemporal visual semantics provided by a first-person view allows us to reason about the camera weare… ▽ More

    Submitted 2 August, 2017; v1 submitted 16 November, 2016; originally announced November 2016.

  40. arXiv:1611.05335  [pdf, other

    cs.CV

    Unsupervised Learning of Important Objects from First-Person Videos

    Authors: Gedas Bertasius, Hyun Soo Park, Stella X. Yu, Jianbo Shi

    Abstract: A first-person camera, placed at a person's head, captures, which objects are important to the camera wearer. Most prior methods for this task learn to detect such important objects from the manually labeled first-person data in a supervised fashion. However, important objects are strongly related to the camera wearer's internal state such as his intentions and attention, and thus, only the person… ▽ More

    Submitted 2 August, 2017; v1 submitted 16 November, 2016; originally announced November 2016.

  41. arXiv:1603.04908  [pdf, other

    cs.CV

    First Person Action-Object Detection with EgoNet

    Authors: Gedas Bertasius, Hyun Soo Park, Stella X. Yu, Jianbo Shi

    Abstract: Unlike traditional third-person cameras mounted on robots, a first-person camera, captures a person's visual sensorimotor object interactions from up close. In this paper, we study the tight interplay between our momentary visual attention and motor action with objects from a first-person camera. We propose a concept of action-objects---the objects that capture person's conscious visual (watching… ▽ More

    Submitted 10 June, 2017; v1 submitted 15 March, 2016; originally announced March 2016.

  42. arXiv:1511.02682  [pdf, other

    cs.CV

    Exploiting Egocentric Object Prior for 3D Saliency Detection

    Authors: Gedas Bertasius, Hyun Soo Park, Jianbo Shi

    Abstract: On a minute-to-minute basis people undergo numerous fluid interactions with objects that barely register on a conscious level. Recent neuroscientific research demonstrates that humans have a fixed size prior for salient objects. This suggests that a salient object in 3D undergoes a consistent transformation such that people's visual system perceives it with an approximately fixed size. This findin… ▽ More

    Submitted 9 November, 2015; originally announced November 2015.

  43. arXiv:1509.02094  [pdf, other

    cs.CV

    Future Localization from an Egocentric Depth Image

    Authors: Hyun Soo Park, Yedong Niu, Jianbo Shi

    Abstract: This paper presents a method for future localization: to predict a set of plausible trajectories of ego-motion given a depth image. We predict paths avoiding obstacles, between objects, even paths turning around a corner into space behind objects. As a byproduct of the predicted trajectories of ego-motion, we discover in the image the empty space occluded by foreground objects. We use no image bas… ▽ More

    Submitted 7 September, 2015; originally announced September 2015.

    Comments: 9 pages

  44. arXiv:cmp-lg/9505029  [pdf, ps

    cs.CL

    Map** Scrambled Korean Sentences into English Using Synchronous TAGs

    Authors: Hyun S. Park

    Abstract: Synchronous Tree Adjoining Grammars can be used for Machine Translation. However, translating a free order language such as Korean to English is complicated. I present a mechanism to translate scrambled Korean sentences into English by combining the concepts of Multi-Component TAGs (MC-TAGs) and Synchronous TAGs (STAGs).

    Submitted 13 May, 1995; originally announced May 1995.

    Comments: uuencoded compressed ps file. 3 pages. To appear ACL95

  45. arXiv:cmp-lg/9410023  [pdf, ps

    cs.CL

    Korean to English Translation Using Synchronous TAGs

    Authors: Dania Egedi, Martha Palmer, Hyun S. Park, Aravind K. Joshi

    Abstract: It is often argued that accurate machine translation requires reference to contextual knowledge for the correct treatment of linguistic phenomena such as dropped arguments and accurate lexical selection. One of the historical arguments in favor of the interlingua approach has been that, since it revolves around a deep semantic representation, it is better able to handle the types of linguistic p… ▽ More

    Submitted 24 October, 1994; originally announced October 1994.

    Comments: ps file. 8 pages

    Journal ref: Proceedings of AMTA 94