Skip to main content

Showing 1–50 of 109 results for author: Sung, M

.
  1. arXiv:2406.10853  [pdf, other

    cs.CV

    MV2Cyl: Reconstructing 3D Extrusion Cylinders from Multi-View Images

    Authors: Eunji Hong, Minh Hieu Nguyen, Mikaela Angelina Uy, Minhyuk Sung

    Abstract: We present MV2Cyl, a novel method for reconstructing 3D from 2D multi-view images, not merely as a field or raw geometry but as a sketch-extrude CAD model. Extracting extrusion cylinders from raw 3D geometry has been extensively researched in computer vision, while the processing of 3D data through neural networks has remained a bottleneck. Since 3D scans are generally accompanied by multi-view im… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 24 pages

  2. arXiv:2406.09728  [pdf, other

    cs.CV cs.GR

    Neural Pose Representation Learning for Generating and Transferring Non-Rigid Object Poses

    Authors: Seungwoo Yoo, Juil Koo, Kyeongmin Yeo, Minhyuk Sung

    Abstract: We propose a novel method for learning representations of poses for 3D deformable objects, which specializes in 1) disentangling pose information from the object's identity, 2) facilitating the learning of pose variations, and 3) transferring pose information to other object identities. Based on these properties, our method enables the generation of 3D deformable objects with diversity in both ide… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  3. arXiv:2405.00523  [pdf, other

    cs.AI cs.CL

    CookingSense: A Culinary Knowledgebase with Multidisciplinary Assertions

    Authors: Donghee Choi, Mogan Gim, Donghyeon Park, Mujeen Sung, Hyunjae Kim, Jaewoo Kang, Jihun Choi

    Abstract: This paper introduces CookingSense, a descriptive collection of knowledge assertions in the culinary domain extracted from various sources, including web data, scientific papers, and recipes, from which knowledge covering a broad range of aspects is acquired. CookingSense is constructed through a series of dictionary-based filtering and language model-based semantic filtering techniques, which res… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: LREC-COLING 2024 Accepted

  4. arXiv:2403.17422  [pdf, other

    cs.CV

    InterHandGen: Two-Hand Interaction Generation via Cascaded Reverse Diffusion

    Authors: Jihyun Lee, Shunsuke Saito, Giljoo Nam, Minhyuk Sung, Tae-Kyun Kim

    Abstract: We present InterHandGen, a novel framework that learns the generative prior of two-hand interaction. Sampling from our model yields plausible and diverse two-hand shapes in close interaction with or without an object. Our prior can be incorporated into any optimization or learning methods to reduce ambiguity in an ill-posed setup. Our key observation is that directly modeling the joint distributio… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024, project page: https://jyunlee.github.io/projects/interhandgen/

  5. arXiv:2403.14860  [pdf, other

    eess.SY cs.LG

    Robust Model Based Reinforcement Learning Using $\mathcal{L}_1$ Adaptive Control

    Authors: Minjun Sung, Sambhu H. Karumanchi, Aditya Gahlawat, Naira Hovakimyan

    Abstract: We introduce $\mathcal{L}_1$-MBRL, a control-theoretic augmentation scheme for Model-Based Reinforcement Learning (MBRL) algorithms. Unlike model-free approaches, MBRL algorithms learn a model of the transition function using data and use it to design a control input. Our approach generates a series of approximate control-affine models of the learned transition function according to the proposed s… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  6. arXiv:2403.14370  [pdf, other

    cs.CV

    SyncTweedies: A General Generative Framework Based on Synchronized Diffusions

    Authors: Jaihoon Kim, Juil Koo, Kyeongmin Yeo, Minhyuk Sung

    Abstract: We introduce a general framework for generating diverse visual content, including ambiguous images, panorama images, mesh textures, and Gaussian splat textures, by synchronizing multiple diffusion processes. We present exhaustive investigation into all possible scenarios for synchronizing multiple diffusion processes through a canonical space and analyze their characteristics across applications.… ▽ More

    Submitted 20 June, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: Project page: https://synctweedies.github.io/

  7. arXiv:2403.13589  [pdf, other

    cs.CV

    ReGround: Improving Textual and Spatial Grounding at No Cost

    Authors: Yuseung Lee, Minhyuk Sung

    Abstract: When an image generation process is guided by both a text prompt and spatial cues, such as a set of bounding boxes, do these elements work in harmony, or does one dominate the other? Our analysis of a pretrained image diffusion model that integrates gated self-attention into the U-Net reveals that spatial grounding often outweighs textual grounding due to the sequential flow from gated self-attent… ▽ More

    Submitted 30 March, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Comments: Project page: https://re-ground.github.io/

  8. arXiv:2401.15269  [pdf, other

    cs.CL cs.AI cs.IR

    Improving Medical Reasoning through Retrieval and Self-Reflection with Retrieval-Augmented Large Language Models

    Authors: Minbyul Jeong, Jiwoong Sohn, Mujeen Sung, Jaewoo Kang

    Abstract: Recent proprietary large language models (LLMs), such as GPT-4, have achieved a milestone in tackling diverse challenges in the biomedical domain, ranging from multiple-choice questions to long-form generations. To address challenges that still cannot be handled with the encoded knowledge of LLMs, various retrieval-augmented generation (RAG) methods have been developed by searching documents from… ▽ More

    Submitted 17 June, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: ISMB 2024

  9. arXiv:2401.05906  [pdf, other

    cs.CV

    PartSTAD: 2D-to-3D Part Segmentation Task Adaptation

    Authors: Hyun** Kim, Minhyuk Sung

    Abstract: We introduce PartSTAD, a method designed for the task adaptation of 2D-to-3D segmentation lifting. Recent studies have highlighted the advantages of utilizing 2D segmentation models to achieve high-quality 3D segmentation through few-shot adaptation. However, previous approaches have focused on adapting 2D segmentation models for domain shift to rendered images and synthetic text descriptions, rat… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  10. arXiv:2311.16739  [pdf, other

    cs.CV cs.GR

    As-Plausible-As-Possible: Plausibility-Aware Mesh Deformation Using 2D Diffusion Priors

    Authors: Seungwoo Yoo, Kunho Kim, Vladimir G. Kim, Minhyuk Sung

    Abstract: We present As-Plausible-as-Possible (APAP) mesh deformation technique that leverages 2D diffusion priors to preserve the plausibility of a mesh under user-controlled deformation. Our framework uses per-face Jacobians to represent mesh deformations, where mesh vertex coordinates are computed via a differentiable Poisson Solve. The deformed mesh is rendered, and the resulting 2D image is used in the… ▽ More

    Submitted 30 March, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: Project page: https://as-plausible-as-possible.github.io/

  11. arXiv:2311.13831  [pdf, other

    cs.CV

    Posterior Distillation Sampling

    Authors: Juil Koo, Chanho Park, Minhyuk Sung

    Abstract: We introduce Posterior Distillation Sampling (PDS), a novel optimization method for parametric image editing based on diffusion models. Existing optimization-based methods, which leverage the powerful 2D prior of diffusion models to handle various parametric images, have mainly focused on generation. Unlike generation, editing requires a balance between conforming to the target attribute and prese… ▽ More

    Submitted 31 March, 2024; v1 submitted 23 November, 2023; originally announced November 2023.

    Comments: Project page: https://posterior-distillation-sampling.github.io/

  12. OptCtrlPoints: Finding the Optimal Control Points for Biharmonic 3D Shape Deformation

    Authors: Kunho Kim, Mikaela Angelina Uy, Despoina Paschalidou, Alec Jacobson, Leonidas J. Guibas, Minhyuk Sung

    Abstract: We propose OptCtrlPoints, a data-driven framework designed to identify the optimal sparse set of control points for reproducing target shapes using biharmonic 3D shape deformation. Control-point-based 3D deformation methods are widely utilized for interactive shape editing, and their usability is enhanced when the control points are sparse yet strategically distributed across the shape. With this… ▽ More

    Submitted 13 October, 2023; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: Pacific Graphics 2023 (Full Paper). Project page: https://soulmates2.github.io/publications/OptCtrlPoints/

  13. arXiv:2307.10204  [pdf, ps, other

    cs.IR cs.LG stat.ML

    An IPW-based Unbiased Ranking Metric in Two-sided Markets

    Authors: Keisho Oh, Naoki Nishimura, Minje Sung, Ken Kobayashi, Kazuhide Nakata

    Abstract: In modern recommendation systems, unbiased learning-to-rank (LTR) is crucial for prioritizing items from biased implicit user feedback, such as click data. Several techniques, such as Inverse Propensity Weighting (IPW), have been proposed for single-sided markets. However, less attention has been paid to two-sided markets, such as job platforms or dating services, where successful conversions requ… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  14. arXiv:2307.08100  [pdf, other

    cs.CV

    FourierHandFlow: Neural 4D Hand Representation Using Fourier Query Flow

    Authors: Jihyun Lee, Junbong Jang, Donghwan Kim, Minhyuk Sung, Tae-Kyun Kim

    Abstract: Recent 4D shape representations model continuous temporal evolution of implicit shapes by (1) learning query flows without leveraging shape and articulation priors or (2) decoding shape occupancies separately for each time value. Thus, they do not effectively capture implicit correspondences between articulated shapes or regularize jittery temporal deformations. In this work, we present FourierHan… ▽ More

    Submitted 16 July, 2023; originally announced July 2023.

    Comments: 16 pages, 6 figures, under review

  15. arXiv:2307.07409  [pdf, other

    cs.CL cs.AI eess.IV

    KU-DMIS-MSRA at RadSum23: Pre-trained Vision-Language Model for Radiology Report Summarization

    Authors: Gangwoo Kim, Hajung Kim, Lei Ji, Seongsu Bae, Chanhwi Kim, Mujeen Sung, Hyunjae Kim, Kun Yan, Eric Chang, Jaewoo Kang

    Abstract: In this paper, we introduce CheXOFA, a new pre-trained vision-language model (VLM) for the chest X-ray domain. Our model is initially pre-trained on various multimodal datasets within the general domain before being transferred to the chest X-ray domain. Following a prominent VLM, we unify various domain-specific tasks into a simple sequence-to-sequence schema. It enables the model to effectively… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: Published at BioNLP workshop @ ACL 2023

  16. arXiv:2307.02591  [pdf, other

    cs.CL cs.AI

    ODD: A Benchmark Dataset for the Natural Language Processing based Opioid Related Aberrant Behavior Detection

    Authors: Sunjae Kwon, Xun Wang, Weisong Liu, Emily Druhl, Minhee L. Sung, Joel I. Reisman, Wenjun Li, Robert D. Kerns, William Becker, Hong Yu

    Abstract: Opioid related aberrant behaviors (ORABs) present novel risk factors for opioid overdose. This paper introduces a novel biomedical natural language processing benchmark dataset named ODD, for ORAB Detection Dataset. ODD is an expert-annotated dataset designed to identify ORABs from patients' EHR notes and classify them into nine categories; 1) Confirmed Aberrant Behavior, 2) Suggested Aberrant Beh… ▽ More

    Submitted 22 March, 2024; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: To be appeared at NAACL 2024

  17. arXiv:2306.05178  [pdf, other

    cs.CV

    SyncDiffusion: Coherent Montage via Synchronized Joint Diffusions

    Authors: Yuseung Lee, Kunho Kim, Hyun** Kim, Minhyuk Sung

    Abstract: The remarkable capabilities of pretrained image diffusion models have been utilized not only for generating fixed-size images but also for creating panoramas. However, naive stitching of multiple images often results in visible seams. Recent techniques have attempted to address this issue by performing joint diffusions in multiple windows and averaging latent features in overlap** regions. Howev… ▽ More

    Submitted 29 October, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: Accepted to NeurIPS 2023. Project page: https://syncdiffusion.github.io

  18. arXiv:2305.14827  [pdf, other

    cs.CL

    Pre-training Intent-Aware Encoders for Zero- and Few-Shot Intent Classification

    Authors: Mujeen Sung, James Gung, Elman Mansimov, Nikolaos Pappas, Raphael Shu, Salvatore Romeo, Yi Zhang, Vittorio Castelli

    Abstract: Intent classification (IC) plays an important role in task-oriented dialogue systems. However, IC models often generalize poorly when training without sufficient annotated examples for each user intent. We propose a novel pre-training method for text encoders that uses contrastive learning with intent psuedo-labels to produce embeddings that are well-suited for IC tasks, reducing the need for manu… ▽ More

    Submitted 13 November, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023

  19. arXiv:2304.04336  [pdf, other

    cs.CV

    Split, Merge, and Refine: Fitting Tight Bounding Boxes via Over-Segmentation and Iterative Search

    Authors: Chanhyeok Park, Minhyuk Sung

    Abstract: Achieving tight bounding boxes of a shape while guaranteeing complete boundness is an essential task for efficient geometric operations and unsupervised semantic part detection. But previous methods fail to achieve both full coverage and tightness. Neural-network-based methods are not suitable for these goals due to the non-differentiability of the objective, while classic iterative search methods… ▽ More

    Submitted 1 December, 2023; v1 submitted 9 April, 2023; originally announced April 2023.

    Comments: 3DV 2024

  20. arXiv:2303.12236  [pdf, other

    cs.CV

    SALAD: Part-Level Latent Diffusion for 3D Shape Generation and Manipulation

    Authors: Juil Koo, Seungwoo Yoo, Minh Hieu Nguyen, Minhyuk Sung

    Abstract: We present a cascaded diffusion model based on a part-level implicit 3D representation. Our model achieves state-of-the-art generation quality and also enables part-level shape editing and manipulation without any additional training in conditional setup. Diffusion models have demonstrated impressive capabilities in data generation as well as zero-shot completion and editing via a guided reverse p… ▽ More

    Submitted 20 March, 2024; v1 submitted 21 March, 2023; originally announced March 2023.

    Comments: Project page: https://salad3d.github.io

  21. arXiv:2302.14348  [pdf, other

    cs.CV cs.AI

    Im2Hands: Learning Attentive Implicit Representation of Interacting Two-Hand Shapes

    Authors: Jihyun Lee, Minhyuk Sung, Honggyu Choi, Tae-Kyun Kim

    Abstract: We present Implicit Two Hands (Im2Hands), the first neural implicit representation of two interacting hands. Unlike existing methods on two-hand reconstruction that rely on a parametric hand model and/or low-resolution meshes, Im2Hands can produce fine-grained geometry of two hands with high hand-to-hand and hand-to-image coherency. To handle the shape complexity and interaction context between tw… ▽ More

    Submitted 27 March, 2023; v1 submitted 28 February, 2023; originally announced February 2023.

    Comments: 6 figures, 14 pages, accepted to CVPR 2023, project page: https://jyunlee.github.io/projects/implicit-two-hands/

  22. arXiv:2212.05011  [pdf, other

    cs.CV cs.CL

    LADIS: Language Disentanglement for 3D Shape Editing

    Authors: Ian Huang, Panos Achlioptas, Tianyi Zhang, Sergey Tulyakov, Minhyuk Sung, Leonidas Guibas

    Abstract: Natural language interaction is a promising direction for democratizing 3D shape design. However, existing methods for text-driven 3D shape editing face challenges in producing decoupled, local edits to 3D shapes. We address this problem by learning disentangled latent representations that ground language in 3D geometry. To this end, we propose a complementary tool set including a novel network ar… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

  23. arXiv:2211.08604  [pdf, other

    cs.LG cs.SI

    PU GNN: Chargeback Fraud Detection in P2E MMORPGs via Graph Attention Networks with Imbalanced PU Labels

    Authors: Jiho Choi, Junghoon Park, Woocheol Kim, **-Hyeok Park, Yumin Suh, Minchang Sung

    Abstract: The recent advent of play-to-earn (P2E) systems in massively multiplayer online role-playing games (MMORPGs) has made in-game goods interchangeable with real-world values more than ever before. The goods in the P2E MMORPGs can be directly exchanged with cryptocurrencies such as Bitcoin, Ethereum, or Klaytn via blockchain networks. Unlike traditional in-game goods, once they had been written to the… ▽ More

    Submitted 23 June, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

    Comments: ECML PKDD 2023 (Applied Data Science Track)

  24. arXiv:2211.00382  [pdf, other

    cs.CV

    Seg&Struct: The Interplay Between Part Segmentation and Structure Inference for 3D Shape Parsing

    Authors: Jeonghyun Kim, Kaichun Mo, Minhyuk Sung, Woontack Woo

    Abstract: We propose Seg&Struct, a supervised learning framework leveraging the interplay between part segmentation and structure inference and demonstrating their synergy in an integrated framework. Both part segmentation and structure inference have been extensively studied in the recent deep learning literature, while the supervisions used for each task have not been fully exploited to assist the other t… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: WACV 2023 (Algorithm Track)

  25. arXiv:2205.12680  [pdf, other

    cs.CL cs.IR

    Optimizing Test-Time Query Representations for Dense Retrieval

    Authors: Mujeen Sung, Jungsoo Park, Jaewoo Kang, Danqi Chen, **hyuk Lee

    Abstract: Recent developments of dense retrieval rely on quality representations of queries and contexts from pre-trained query and context encoders. In this paper, we introduce TOUR (Test-Time Optimization of Query Representations), which further optimizes instance-level query representations guided by signals from test-time retrieval results. We leverage a cross-encoder re-ranker to provide fine-grained p… ▽ More

    Submitted 28 May, 2023; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: Findings of ACL 2023

  26. arXiv:2203.15872  [pdf, other

    eess.SY

    Protective Mission against a Highly Maneuverable Rogue Drone Using Defense Margin Strategy

    Authors: Minjun Sung, Christophe Johannes Hiltebrandt-McIntosh, Hunmin Kim, Naira Hovakimyan

    Abstract: The current paper studies a protective mission to defend a domain called the safe zone from a rogue drone invasion. We consider a one attacker and one defender drone scenario where only a noisy observation of the attacker at every time step is accessible to the defender. Directly applying strategies used in existing problems such as pursuit-evasion games are shown to be insufficient for our missio… ▽ More

    Submitted 12 September, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

  27. arXiv:2203.15235  [pdf, other

    cs.CV

    Pop-Out Motion: 3D-Aware Image Deformation via Learning the Shape Laplacian

    Authors: Jihyun Lee, Minhyuk Sung, Hyun** Kim, Tae-Kyun Kim

    Abstract: We propose a framework that can deform an object in a 2D image as it exists in 3D space. Most existing methods for 3D-aware image manipulation are limited to (1) only changing the global scene information or depth, or (2) manipulating an object of specific categories. In this paper, we present a 3D-aware image deformation method with minimal restrictions on shape category and deformation type. Whi… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: 16 pages, 10 figures, accepted to CVPR 2022

  28. arXiv:2203.06457  [pdf, other

    cs.CV

    3D-GIF: 3D-Controllable Object Generation via Implicit Factorized Representations

    Authors: Minsoo Lee, Chaeyeon Chung, Hojun Cho, Minjung Kim, Sanghun Jung, Jaegul Choo, Minhyuk Sung

    Abstract: While NeRF-based 3D-aware image generation methods enable viewpoint control, limitations still remain to be adopted to various 3D applications. Due to their view-dependent and light-entangled volume representation, the 3D geometry presents unrealistic quality and the color should be re-rendered for every desired viewpoint. To broaden the 3D applicability from 3D-aware image generation to 3D-contro… ▽ More

    Submitted 12 March, 2022; originally announced March 2022.

  29. arXiv:2203.06413  [pdf, other

    cs.RO

    Implicit LiDAR Network: LiDAR Super-Resolution via Interpolation Weight Prediction

    Authors: Youngsun Kwon, Minhyuk Sung, Sung-Eui Yoon

    Abstract: Super-resolution of LiDAR range images is crucial to improving many downstream tasks such as object detection, recognition, and tracking. While deep learning has made a remarkable advances in super-resolution techniques, typical convolutional architectures limit upscaling factors to specific output resolutions in training. Recent work has shown that a continuous representation of an image and lear… ▽ More

    Submitted 12 March, 2022; originally announced March 2022.

    Comments: 7 pages, to be published in ICRA 2022

  30. BERN2: an advanced neural biomedical named entity recognition and normalization tool

    Authors: Mujeen Sung, Minbyul Jeong, Yonghwa Choi, Donghyeon Kim, **hyuk Lee, Jaewoo Kang

    Abstract: In biomedical natural language processing, named entity recognition (NER) and named entity normalization (NEN) are key tasks that enable the automatic extraction of biomedical entities (e.g. diseases and drugs) from the ever-growing biomedical literature. In this article, we present BERN2 (Advanced Biomedical Entity Recognition and Normalization), a tool that improves the previous neural network-b… ▽ More

    Submitted 6 October, 2022; v1 submitted 6 January, 2022; originally announced January 2022.

    Comments: Published in Bioinformatics 2022. Web service available at http://bern2.korea.ac.kr. Code available at https://github.com/dmis-lab/BERN2

  31. arXiv:2112.09329  [pdf, other

    cs.CV

    Point2Cyl: Reverse Engineering 3D Objects from Point Clouds to Extrusion Cylinders

    Authors: Mikaela Angelina Uy, Yen-yu Chang, Minhyuk Sung, Purvi Goel, Joseph Lambourne, Tolga Birdal, Leonidas Guibas

    Abstract: We propose Point2Cyl, a supervised network transforming a raw 3D point cloud to a set of extrusion cylinders. Reverse engineering from a raw geometry to a CAD model is an essential task to enable manipulation of the 3D data in shape editing software and thus expand their usages in many downstream applications. Particularly, the form of CAD models having a sequence of extrusion cylinders -- a 2D sk… ▽ More

    Submitted 29 May, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

    Comments: CVPR 2022

  32. arXiv:2112.06390  [pdf, other

    cs.CV

    PartGlot: Learning Shape Part Segmentation from Language Reference Games

    Authors: Juil Koo, Ian Huang, Panos Achlioptas, Leonidas Guibas, Minhyuk Sung

    Abstract: We introduce PartGlot, a neural framework and associated architectures for learning semantic part segmentation of 3D shape geometry, based solely on part referential language. We exploit the fact that linguistic descriptions of a shape can provide priors on the shape's parts -- as natural language has evolved to reflect human perception of the compositional structure of objects, essential to their… ▽ More

    Submitted 30 March, 2022; v1 submitted 12 December, 2021; originally announced December 2021.

    Comments: CVPR 2022 (Oral)

  33. arXiv:2112.00584  [pdf, other

    cs.GR cs.CV cs.LG

    The Shape Part Slot Machine: Contact-based Reasoning for Generating 3D Shapes from Parts

    Authors: Kai Wang, Paul Guerrero, Vladimir Kim, Siddhartha Chaudhuri, Minhyuk Sung, Daniel Ritchie

    Abstract: We present the Shape Part Slot Machine, a new method for assembling novel 3D shapes from existing parts by performing contact-based reasoning. Our method represents each shape as a graph of ``slots,'' where each slot is a region of contact between two shape parts. Based on this representation, we design a graph-neural-network-based model for generating new slot graphs and retrieving compatible par… ▽ More

    Submitted 21 July, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

    Comments: European Conference on Computer Vision (ECCV) 2022

  34. arXiv:2111.10584  [pdf

    cs.CL cs.IR

    Improving Tagging Consistency and Entity Coverage for Chemical Identification in Full-text Articles

    Authors: Hyunjae Kim, Mujeen Sung, Won** Yoon, Sungjoon Park, Jaewoo Kang

    Abstract: This paper is a technical report on our system submitted to the chemical identification task of the BioCreative VII Track 2 challenge. The main feature of this challenge is that the data consists of full-text articles, while current datasets usually consist of only titles and abstracts. To effectively address the problem, we aim to improve tagging consistency and entity coverage using various meth… ▽ More

    Submitted 20 November, 2021; originally announced November 2021.

    Comments: BioCreative VII Challenge Evaluation Workshop

  35. arXiv:2111.08400  [pdf, other

    cs.CL cs.SD eess.AS

    Integrated Semantic and Phonetic Post-correction for Chinese Speech Recognition

    Authors: Yi-Chang Chen, Chun-Yen Cheng, Chien-An Chen, Ming-Chieh Sung, Yi-Ren Yeh

    Abstract: Due to the recent advances of natural language processing, several works have applied the pre-trained masked language model (MLM) of BERT to the post-correction of speech recognition. However, existing pre-trained models only consider the semantic correction while the phonetic features of words is neglected. The semantic-only post-correction will consequently decrease the performance since homopho… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

  36. arXiv:2109.07154  [pdf, other

    cs.CL

    Can Language Models be Biomedical Knowledge Bases?

    Authors: Mujeen Sung, **hyuk Lee, Sean Yi, Minji Jeon, Sungdong Kim, Jaewoo Kang

    Abstract: Pre-trained language models (LMs) have become ubiquitous in solving various natural language processing (NLP) tasks. There has been increasing interest in what knowledge these LMs contain and how we can extract that knowledge, treating LMs as knowledge bases (KBs). While there has been much work on probing LMs in the general domain, there has been little attention to whether these powerful LMs can… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021. Code available at https://github.com/dmis-lab/BioLAMA

  37. arXiv:2109.02259  [pdf, other

    cs.CV

    CTRL-C: Camera calibration TRansformer with Line-Classification

    Authors: **woo Lee, Hyunsung Go, Hyunjoon Lee, Sunghyun Cho, Minhyuk Sung, Junho Kim

    Abstract: Single image camera calibration is the task of estimating the camera parameters from a single input image, such as the vanishing points, focal length, and horizon line. In this work, we propose Camera calibration TRansformer with Line-Classification (CTRL-C), an end-to-end neural network-based approach to single image camera calibration, which directly estimates the camera parameters from an image… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

    Comments: Accepted to ICCV 2021

  38. arXiv:2109.00113  [pdf, other

    cs.CV

    CPFN: Cascaded Primitive Fitting Networks for High-Resolution Point Clouds

    Authors: Eric-Tuan LĂȘ, Minhyuk Sung, Duygu Ceylan, Radomir Mech, Tamy Boubekeur, Niloy J. Mitra

    Abstract: Representing human-made objects as a collection of base primitives has a long history in computer vision and reverse engineering. In the case of high-resolution point cloud scans, the challenge is to be able to detect both large primitives as well as those explaining the detailed parts. While the classical RANSAC approach requires case-specific parameter tuning, state-of-the-art networks are limit… ▽ More

    Submitted 6 September, 2021; v1 submitted 31 August, 2021; originally announced September 2021.

    Comments: ICCV 2021: 15 pages, 8 figures

    Journal ref: ICCV 2021

  39. arXiv:2102.09105  [pdf, other

    cs.CV cs.GR

    DeepMetaHandles: Learning Deformation Meta-Handles of 3D Meshes with Biharmonic Coordinates

    Authors: Minghua Liu, Minhyuk Sung, Radomir Mech, Hao Su

    Abstract: We propose DeepMetaHandles, a 3D conditional generative model based on mesh deformation. Given a collection of 3D meshes of a category and their deformation handles (control points), our method learns a set of meta-handles for each shape, which are represented as combinations of the given handles. The disentangled meta-handles factorize all the plausible deformations of the shape, while each of th… ▽ More

    Submitted 28 March, 2021; v1 submitted 17 February, 2021; originally announced February 2021.

    Comments: CVPR2021 oral

  40. arXiv:2101.07889  [pdf, other

    cs.CV

    Joint Learning of 3D Shape Retrieval and Deformation

    Authors: Mikaela Angelina Uy, Vladimir G. Kim, Minhyuk Sung, Noam Aigerman, Siddhartha Chaudhuri, Leonidas Guibas

    Abstract: We propose a novel technique for producing high-quality 3D models that match a given target object image or scan. Our method is based on retrieving an existing shape from a database of 3D models and then deforming its parts to match the target shape. Unlike previous approaches that independently focus on either shape retrieval or deformation, we propose a joint learning procedure that simultaneous… ▽ More

    Submitted 13 April, 2021; v1 submitted 19 January, 2021; originally announced January 2021.

    Comments: CVPR '21 accepted paper

  41. arXiv:2101.06605  [pdf, other

    cs.CV cs.LG

    MultiBodySync: Multi-Body Segmentation and Motion Estimation via 3D Scan Synchronization

    Authors: Jiahui Huang, He Wang, Tolga Birdal, Minhyuk Sung, Federica Arrigoni, Shi-Min Hu, Leonidas Guibas

    Abstract: We present MultiBodySync, a novel, end-to-end trainable multi-body motion segmentation and rigid registration framework for multiple input 3D point clouds. The two non-trivial challenges posed by this multi-scan multibody setting that we investigate are: (i) guaranteeing correspondence and segmentation consistency across multiple input point clouds capturing different spatial arrangements of bodie… ▽ More

    Submitted 28 March, 2021; v1 submitted 17 January, 2021; originally announced January 2021.

    Comments: Contact: huang-jh18<at>mails<dot>tsinghua<dot>edu<dot>cn

  42. arXiv:2012.12624  [pdf, other

    cs.CL

    Learning Dense Representations of Phrases at Scale

    Authors: **hyuk Lee, Mujeen Sung, Jaewoo Kang, Danqi Chen

    Abstract: Open-domain question answering can be reformulated as a phrase retrieval problem, without the need for processing documents on-demand during inference (Seo et al., 2019). However, current phrase retrieval models heavily depend on sparse representations and still underperform retriever-reader approaches. In this work, we show for the first time that we can learn dense representations of phrases alo… ▽ More

    Submitted 2 June, 2021; v1 submitted 23 December, 2020; originally announced December 2020.

    Comments: ACL 2021. Code available at https://github.com/princeton-nlp/DensePhrases

  43. arXiv:2011.14062  [pdf, other

    eess.AS cs.CL cs.SD

    Unsupervised Spoken Term Discovery Based on Re-clustering of Hypothesized Speech Segments with Siamese and Triplet Networks

    Authors: Man-Ling Sung, Tan Lee

    Abstract: Spoken term discovery from untranscribed speech audio could be achieved via a two-stage process. In the first stage, the unlabelled speech is decoded into a sequence of subword units that are learned and modelled in an unsupervised manner. In the second stage, partial sequence matching and clustering are performed on the decoded subword sequences, resulting in a set of discovered words or phrases.… ▽ More

    Submitted 2 June, 2021; v1 submitted 27 November, 2020; originally announced November 2020.

  44. arXiv:2011.14060  [pdf, other

    eess.AS

    Unsupervised Spoken Term Discovery on Untranscribed Speech

    Authors: Man-Ling Sung

    Abstract: (Part of the abstract) In this thesis, we investigate the use of unsupervised spoken term discovery in tackling this problem. Unsupervised spoken term discovery aims to discover topic-related terminologies in a speech without knowing the phonetic properties of the language and content. It can be further divided into two parts: Acoustic segment modelling (ASM) and unsupervised pattern discovery. AS… ▽ More

    Submitted 27 November, 2020; originally announced November 2020.

    Comments: Thesis submitted in September 2019 for the M.Phil degree in Electronic Engineering at The Chinese University of Hong Kong (CUHK)

  45. Unsupervised Pattern Discovery from Thematic Speech Archives Based on Multilingual Bottleneck Features

    Authors: Man-Ling Sung, Siyuan Feng, Tan Lee

    Abstract: The present study tackles the problem of automatically discovering spoken keywords from untranscribed audio archives without requiring word-by-word speech transcription by automatic speech recognition (ASR) technology. The problem is of practical significance in many applications of speech analytics, including those concerning low-resource languages, and large amount of multilingual and multi-genr… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: 8 pages, accepted and presented in APSIPA-APC 2018. This work was done when Man-Ling Sung and Siyuan Feng were postgraduate students in the Chinese University of Hong Kong

  46. Pandemics are catalysts of scientific novelty: Evidence from COVID-19

    Authors: Meijun Liu, Yi Bu, Chongyan Chen, Jian Xu, Daifeng Li, Yan Leng, Richard Barry Freeman, Eric Meyer, Won** Yoon, Mujeen Sung, Minbyul Jeong, **hyuk Lee, Jaewoo Kang, Chao Min, Min Song, Yujia Zhai, Ying Ding

    Abstract: Scientific novelty drives the efforts to invent new vaccines and solutions during the pandemic. First-time collaboration and international collaboration are two pivotal channels to expand teams' search activities for a broader scope of resources required to address the global challenge, which might facilitate the generation of novel ideas. Our analysis of 98,981 coronavirus papers suggests that sc… ▽ More

    Submitted 14 November, 2021; v1 submitted 25 September, 2020; originally announced September 2020.

    Comments: 19 pages, 3 figures

    ACM Class: J.4

  47. arXiv:2009.01456  [pdf, other

    cs.GR

    DeformSyncNet: Deformation Transfer via Synchronized Shape Deformation Spaces

    Authors: Minhyuk Sung, Zhenyu Jiang, Panos Achlioptas, Niloy J. Mitra, Leonidas J. Guibas

    Abstract: Shape deformation is an important component in any geometry processing toolbox. The goal is to enable intuitive deformations of single or multiple shapes or to transfer example deformations to new shapes while preserving the plausibility of the deformed shape(s). Existing approaches assume access to point-level or part-level correspondence or establish them in a preprocessing phase, thus limiting… ▽ More

    Submitted 3 September, 2020; originally announced September 2020.

    Comments: SIGGRAPH Asia 2020

  48. arXiv:2008.07760  [pdf, other

    cs.CV

    Pix2Surf: Learning Parametric 3D Surface Models of Objects from Images

    Authors: Jiahui Lei, Srinath Sridhar, Paul Guerrero, Minhyuk Sung, Niloy Mitra, Leonidas J. Guibas

    Abstract: We investigate the problem of learning to generate 3D parametric surface representations for novel object instances, as seen from one or more views. Previous work on learning shape reconstruction from multiple views uses discrete representations such as point clouds or voxels, while continuous surface generation approaches lack multi-view consistency. We address these issues by designing neural ne… ▽ More

    Submitted 18 August, 2020; originally announced August 2020.

    Comments: ECCV 2020

  49. arXiv:2007.11855  [pdf, other

    cs.CV cs.LG

    Neural Geometric Parser for Single Image Camera Calibration

    Authors: **woo Lee, Minhyuk Sung, Hyunjoon Lee, Junho Kim

    Abstract: We propose a neural geometric parser learning single image camera calibration for man-made scenes. Unlike previous neural approaches that rely only on semantic cues obtained from neural networks, our approach considers both semantic and geometric cues, resulting in significant accuracy improvement. The proposed framework consists of two networks. Using line segments of an image as geometric cues,… ▽ More

    Submitted 23 July, 2020; v1 submitted 23 July, 2020; originally announced July 2020.

    Comments: ECCV 2020

  50. arXiv:2007.00217  [pdf, ps, other

    cs.CL

    Transferability of Natural Language Inference to Biomedical Question Answering

    Authors: Minbyul Jeong, Mujeen Sung, Gangwoo Kim, Donghyeon Kim, Won** Yoon, Jaehyo Yoo, Jaewoo Kang

    Abstract: Biomedical question answering (QA) is a challenging task due to the scarcity of data and the requirement of domain expertise. Pre-trained language models have been used to address these issues. Recently, learning relationships between sentence pairs has been proved to improve performance in general QA. In this paper, we focus on applying BioBERT to transfer the knowledge of natural language infere… ▽ More

    Submitted 17 February, 2021; v1 submitted 1 July, 2020; originally announced July 2020.

    Comments: submit for the 8th BioASQ workshop 2020