Search | arXiv e-print repository

CrossLoco: Human Motion Driven Control of Legged Robots via Guided Unsupervised Reinforcement Learning

Authors: Tianyu Li, Hyunyoung Jung, Matthew Gombolay, Yong Kwon Cho, Sehoon Ha

Abstract: Human motion driven control (HMDC) is an effective approach for generating natural and compelling robot motions while preserving high-level semantics. However, establishing the correspondence between humans and robots with different body structures is not straightforward due to the mismatches in kinematics and dynamics properties, which causes intrinsic ambiguity to the problem. Many previous algo… ▽ More Human motion driven control (HMDC) is an effective approach for generating natural and compelling robot motions while preserving high-level semantics. However, establishing the correspondence between humans and robots with different body structures is not straightforward due to the mismatches in kinematics and dynamics properties, which causes intrinsic ambiguity to the problem. Many previous algorithms approach this motion retargeting problem with unsupervised learning, which requires the prerequisite skill sets. However, it will be extremely costly to learn all the skills without understanding the given human motions, particularly for high-dimensional robots. In this work, we introduce CrossLoco, a guided unsupervised reinforcement learning framework that simultaneously learns robot skills and their correspondence to human motions. Our key innovation is to introduce a cycle-consistency-based reward term designed to maximize the mutual information between human motions and robot states. We demonstrate that the proposed framework can generate compelling robot motions by translating diverse human motions, such as running, hop**, and dancing. We quantitatively compare our CrossLoco against the manually engineered and unsupervised baseline algorithms along with the ablated versions of our framework and demonstrate that our method translates human motions with better accuracy, diversity, and user preference. We also showcase its utility in other applications, such as synthesizing robot movements from language input and enabling interactive robot control. △ Less

Submitted 29 September, 2023; originally announced September 2023.

arXiv:2305.15420 [pdf]

A Hybrid Semantic-Geometric Approach for Clutter-Resistant Floorplan Generation from Building Point Clouds

Authors: Seongyong Kim, Yosuke Yajima, Jisoo Park, **gdao Chen, Yong K. Cho

Abstract: Building Information Modeling (BIM) technology is a key component of modern construction engineering and project management workflows. As-is BIM models that represent the spatial reality of a project site can offer crucial information to stakeholders for construction progress monitoring, error checking, and building maintenance purposes. Geometric methods for automatically converting raw scan data… ▽ More Building Information Modeling (BIM) technology is a key component of modern construction engineering and project management workflows. As-is BIM models that represent the spatial reality of a project site can offer crucial information to stakeholders for construction progress monitoring, error checking, and building maintenance purposes. Geometric methods for automatically converting raw scan data into BIM models (Scan-to-BIM) often fail to make use of higher-level semantic information in the data. Whereas, semantic segmentation methods only output labels at the point level without creating object level models that is necessary for BIM. To address these issues, this research proposes a hybrid semantic-geometric approach for clutter-resistant floorplan generation from laser-scanned building point clouds. The input point clouds are first pre-processed by normalizing the coordinate system and removing outliers. Then, a semantic segmentation network based on PointNet++ is used to label each point as ceiling, floor, wall, door, stair, and clutter. The clutter points are removed whereas the wall, door, and stair points are used for 2D floorplan generation. A region-growing segmentation algorithm paired with geometric reasoning rules is applied to group the points together into individual building elements. Finally, a 2-fold Random Sample Consensus (RANSAC) algorithm is applied to parameterize the building elements into 2D lines which are used to create the output floorplan. The proposed method is evaluated using the metrics of precision, recall, Intersection-over-Union (IOU), Betti error, and war** error. △ Less

Submitted 15 May, 2023; originally announced May 2023.

arXiv:2103.09160 [pdf, other]

doi 10.1109/LRA.2021.3062607

LRGNet: Learnable Region Growing for Class-Agnostic Point Cloud Segmentation

Authors: **gdao Chen, Zsolt Kira, Yong K. Cho

Abstract: 3D point cloud segmentation is an important function that helps robots understand the layout of their surrounding environment and perform tasks such as gras** objects, avoiding obstacles, and finding landmarks. Current segmentation methods are mostly class-specific, many of which are tuned to work with specific object categories and may not be generalizable to different types of scenes. This res… ▽ More 3D point cloud segmentation is an important function that helps robots understand the layout of their surrounding environment and perform tasks such as gras** objects, avoiding obstacles, and finding landmarks. Current segmentation methods are mostly class-specific, many of which are tuned to work with specific object categories and may not be generalizable to different types of scenes. This research proposes a learnable region growing method for class-agnostic point cloud segmentation, specifically for the task of instance label prediction. The proposed method is able to segment any class of objects using a single deep neural network without any assumptions about their shapes and sizes. The deep neural network is trained to predict how to add or remove points from a point cloud region to morph it into incrementally more complete regions of an object instance. Segmentation results on the S3DIS and ScanNet datasets show that the proposed method outperforms competing methods by 1%-9% on 6 different evaluation metrics. △ Less

Submitted 16 March, 2021; originally announced March 2021.

Journal ref: IEEE Robotics and Automation Letters 2021

arXiv:1902.06768 [pdf, other]

doi 10.1109/LRA.2019.2894915

Multi-view Incremental Segmentation of 3D Point Clouds for Mobile Robots

Authors: **gdao Chen, Yong K. Cho, Zsolt Kira

Abstract: Mobile robots need to create high-definition 3D maps of the environment for applications such as remote surveillance and infrastructure map**. Accurate semantic processing of the acquired 3D point cloud is critical for allowing the robot to obtain a high-level understanding of the surrounding objects and perform context-aware decision making. Existing techniques for point cloud semantic segmenta… ▽ More Mobile robots need to create high-definition 3D maps of the environment for applications such as remote surveillance and infrastructure map**. Accurate semantic processing of the acquired 3D point cloud is critical for allowing the robot to obtain a high-level understanding of the surrounding objects and perform context-aware decision making. Existing techniques for point cloud semantic segmentation are mostly applied on a single-frame or offline basis, with no way to integrate the segmentation results over time. This paper proposes an online method for mobile robots to incrementally build a semantically-rich 3D point cloud of the environment. The proposed deep neural network, MCPNet, is trained to predict class labels and object instance labels for each point in the scanned point cloud in an incremental fashion. A multi-view context pooling (MCP) operator is used to combine point features obtained from multiple viewpoints to improve the classification accuracy. The proposed architecture was trained and evaluated on ray-traced scans derived from the Stanford 3D Indoor Spaces dataset. Results show that the proposed approach led to 15% improvement in point-wise accuracy and 7% improvement in NMI compared to the next best online method, with only a 6% drop in accuracy compared to the PointNet-based offline approach. △ Less

Submitted 18 February, 2019; originally announced February 2019.

arXiv:1805.03993 [pdf, other]

doi 10.1103/PhysRevLett.121.104102

Experimental observation of current reversal in a rocking Brownian motor

Authors: Christian Schwemmer, Stefan Fringes, Urs Duerig, Yu Kyoung Ryu Cho, Armin W. Knoll

Abstract: A reversal of the particle current in rocking Brownian motors was predicted more than 20 years ago; however, an experimental verification and a deeper insight into the underlying mechanisms remained elusive. Here, we investigate the high frequency behaviour of a rocking Brownian motor for charged nanoparticles based on electrostatic interactions in a 3D shaped nanofluidic slit and electro-osmotic… ▽ More A reversal of the particle current in rocking Brownian motors was predicted more than 20 years ago; however, an experimental verification and a deeper insight into the underlying mechanisms remained elusive. Here, we investigate the high frequency behaviour of a rocking Brownian motor for charged nanoparticles based on electrostatic interactions in a 3D shaped nanofluidic slit and electro-osmotic forcing of the particles. A sub ms temporal and $\approx\,10\,$nm spatial resolution of the 60 nm gold spheres allows us to measure the time-resolved and frequency dependent evolution of the particle probability density in-situ. At 250 Hz the particle current changes sign, in agreement with a theoretical model based on the time-dependent Fokker-Planck equation. From this fit-parameter free description and its excellent agreement with the observed behaviour, we trace the origin of the current reversal to the asymmetric and increasingly static probability density at high frequencies. △ Less

Submitted 23 July, 2018; v1 submitted 9 May, 2018; originally announced May 2018.

Comments: 6 pages, 4 figures, SM: 14 pages, 14 figures

Journal ref: Phys. Rev. Lett. 121, 104102 (2018)

arXiv:1707.03039 [pdf]

doi 10.1364/OL.42.003379

Rapid focus map surveying for whole slide imaging with continues sample motion

Authors: Jun Liao, Yutong Jiang, Zichao Bian, Bahareh Mahrou, Aparna Nambiar, Alexander W. Magsam, Kaikai Guo, Yong Ku Cho, Guoan Zheng

Abstract: Whole slide imaging (WSI) has recently been cleared for primary diagnosis in the US. A critical challenge of WSI is to perform accurate focusing in high speed. Traditional systems create a focus map prior to scanning. For each focus point on the map, sample needs to be static in the x-y plane and axial scanning is needed to maximize the contrast. Here we report a novel focus map surveying method f… ▽ More Whole slide imaging (WSI) has recently been cleared for primary diagnosis in the US. A critical challenge of WSI is to perform accurate focusing in high speed. Traditional systems create a focus map prior to scanning. For each focus point on the map, sample needs to be static in the x-y plane and axial scanning is needed to maximize the contrast. Here we report a novel focus map surveying method for WSI. The reported method requires no axial scanning, no additional camera and lens, works for stained and transparent samples, and allows continuous sample motion in the surveying process. It can be used for both brightfield and fluorescence WSI. By using a 20X, 0.75 NA objective lens, we demonstrate a mean focusing error of ~0.08 microns in the static mode and ~0.17 microns in the continuous motion mode. The reported method may provide a turnkey solution for most existing WSI systems for its simplicity, robustness, accuracy, and high-speed. It may also standardize the imaging performance of WSI systems for digital pathology and find other applications in high-content microscopy such as DNA sequencing and time-lapse live-cell imaging. △ Less

Submitted 5 July, 2017; originally announced July 2017.

Journal ref: Optics Letters, 42 (17), 3379-3382 (2017)

arXiv:0902.3328 [pdf]

doi 10.1166/jctn.2009.1297

On Electrical Equivalence of Aperture-Body and Transmission-Cavity Resonance Phenomena in Subwavelength Conducting Aperture Systems from an Equivalent Circuit Point of View

Authors: Young Ki Cho, Jong-Ig Lee, Ki Young Kim

Abstract: For a narrow slit structure backed by a conducting strip which is taken as a representative example of an aperture-body resonance (ABR) problem, the transmission resonance condition (i.e., condition for maximum power transmission) and the transmission width (i.e., normalized maximum transmitted power through the slit) are found to be the same as those for narrow slit coupling problem in a thick… ▽ More For a narrow slit structure backed by a conducting strip which is taken as a representative example of an aperture-body resonance (ABR) problem, the transmission resonance condition (i.e., condition for maximum power transmission) and the transmission width (i.e., normalized maximum transmitted power through the slit) are found to be the same as those for narrow slit coupling problem in a thick conducting screen, which is designated as a transmission-cavity resonance (TCR) problem. From a viewpoint of equivalent circuit representation for the transmission resonance condition and the funneling mechanism, the ABR and the TCR problems are thought to be essentially of the same nature. △ Less

Submitted 29 December, 2009; v1 submitted 19 February, 2009; originally announced February 2009.

Comments: 14 pages, 3 figures

Journal ref: Journal of Computational and Theoretical Nanoscience, vol. 6, no. 11, pp. 2402-2406, November 2009

arXiv:0902.3309 [pdf]

doi 10.2478/s11772-006-0031-z

Optical guided dispersions and subwavelength transmissions in dispersive plasmonic circular holes

Authors: Ki Young Kim, Young Ki Cho, Heung-Sik Tae, Jeong-Hae Lee

Abstract: The light transmission through a dispersive plasmonic circular hole is numerically investigated with an emphasis on its subwavelength guidance. For a better understanding of the effect of the hole diameter on the guided dispersion characteristics, the guided modes, including both the surface plasmon polariton mode and the circular waveguide mode, are studied for several hole diameters, especiall… ▽ More The light transmission through a dispersive plasmonic circular hole is numerically investigated with an emphasis on its subwavelength guidance. For a better understanding of the effect of the hole diameter on the guided dispersion characteristics, the guided modes, including both the surface plasmon polariton mode and the circular waveguide mode, are studied for several hole diameters, especially when the metal cladding has a plasmonic frequency dependency. A brief comparison is also made with the guided dispersion characteristics of a dispersive plasmonic gap [K. Y. Kim, et al., Opt. Express 14, 320-330 (2006)], which is a planar version of the present structure, and a circular waveguide with perfect electric conductor cladding. Finally, the modal behavior of the first three TM-like principal modes with varied hole diameters is examined for the same operating mode. △ Less

Submitted 19 February, 2009; originally announced February 2009.

Comments: 20 pages, 5 figures, 1 table

Journal ref: Opto-Electronics Review, vol. 14, no. 3, pp. 233-241, Sep. 2006

arXiv:0707.2995 [pdf, ps, other]

Triple Hilbert transforms along polynomial surfaces

Authors: Yong Kum Cho, Sunggeum Hong, Joonil Kim, Chan Woo Yang

Abstract: This paper has been withdrawn since it contains some discrepancy with othe authers's recent result. We will not post this until this discrepancy is resolved. This paper has been withdrawn since it contains some discrepancy with othe authers's recent result. We will not post this until this discrepancy is resolved. △ Less

Submitted 24 July, 2007; v1 submitted 20 July, 2007; originally announced July 2007.

Comments: This paper has been withdrawn

MSC Class: 42B20

Showing 1–9 of 9 results for author: Cho, Y K