Skip to main content

Showing 1–17 of 17 results for author: Choe, J H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.03685  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Language-Image Models with 3D Understanding

    Authors: Jang Hyun Cho, Boris Ivanovic, Yulong Cao, Edward Schmerling, Yue Wang, Xinshuo Weng, Boyi Li, Yurong You, Philipp Krähenbühl, Yan Wang, Marco Pavone

    Abstract: Multi-modal large language models (MLLMs) have shown incredible capabilities in a variety of 2D vision and language tasks. We extend MLLMs' perceptual capabilities to ground and reason about images in 3-dimensional space. To that end, we first develop a large-scale pre-training dataset for 2D and 3D called LV3D by combining multiple existing 2D and 3D recognition datasets under a common task formu… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: Project page: https://janghyuncho.github.io/Cube-LLM

  2. arXiv:2402.03277  [pdf, other

    cs.IR

    Event-based Product Carousel Recommendation with Query-Click Graph

    Authors: Luyi Ma, Nimesh Sinha, Parth Vajge, Jason HD Cho, Sushant Kumar, Kannan Achan

    Abstract: Many current recommender systems mainly focus on the product-to-product recommendations and user-to-product recommendations even during the time of events rather than modeling the typical recommendations for the target event (e.g., festivals, seasonal activities, or social activities) without addressing the multiple aspects of the shop** demands for the target event. Product recommendations for… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 7 pages, 2 figures, 2021 IEEE International Conference on Big Data (Big Data)

  3. arXiv:2311.17902  [pdf, other

    cs.CV

    Language-conditioned Detection Transformer

    Authors: Jang Hyun Cho, Philipp Krähenbühl

    Abstract: We present a new open-vocabulary detection framework. Our framework uses both image-level labels and detailed detection annotations when available. Our framework proceeds in three steps. We first train a language-conditioned object detector on fully-supervised detection data. This detector gets to see the presence or absence of ground truth classes during training, and conditions prediction on the… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: Code is at https://github.com/janghyuncho/DECOLA

  4. arXiv:2305.09858  [pdf, other

    cs.IR cs.AI cs.CL cs.LG

    Knowledge Graph Completion Models are Few-shot Learners: An Empirical Study of Relation Labeling in E-commerce with LLMs

    Authors: Jiao Chen, Luyi Ma, Xiaohan Li, Nikhil Thakurdesai, Jianpeng Xu, Jason H. D. Cho, Kaushiki Nag, Evren Korpeoglu, Sushant Kumar, Kannan Achan

    Abstract: Knowledge Graphs (KGs) play a crucial role in enhancing e-commerce system performance by providing structured information about entities and their relationships, such as complementary or substitutable relations between products or product types, which can be utilized in recommender systems. However, relation labeling in KGs remains a challenging task due to the dynamic nature of e-commerce domains… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  5. arXiv:2301.09724  [pdf, other

    cs.CV cs.LG

    Long-tail Detection with Effective Class-Margins

    Authors: Jang Hyun Cho, Philipp Krähenbühl

    Abstract: Large-scale object detection and instance segmentation face a severe data imbalance. The finer-grained object classes become, the less frequent they appear in our datasets. However, at test-time, we expect a detector that performs well for all classes and not just the most frequent ones. In this paper, we provide a theoretical understanding of the long-trail detection problem. We show how the comm… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

    Comments: ECCV 2022 Oral. Code is available at https://github.com/janghyuncho/ECM-Loss

  6. arXiv:2212.06137  [pdf, other

    cs.CV

    NMS Strikes Back

    Authors: Jeffrey Ouyang-Zhang, Jang Hyun Cho, Xingyi Zhou, Philipp Krähenbühl

    Abstract: Detection Transformer (DETR) directly transforms queries to unique objects by using one-to-one bipartite matching during training and enables end-to-end object detection. Recently, these models have surpassed traditional detectors on COCO with undeniable elegance. However, they differ from traditional detectors in multiple designs, including model architecture and training schedules, and thus the… ▽ More

    Submitted 12 December, 2022; originally announced December 2022.

    Comments: Code is available at https://github.com/jozhang97/DETA

  7. NEAT: A Label Noise-resistant Complementary Item Recommender System with Trustworthy Evaluation

    Authors: Luyi Ma, Jianpeng Xu, Jason H. D. Cho, Evren Korpeoglu, Sushant Kumar, Kannan Achan

    Abstract: The complementary item recommender system (CIRS) recommends the complementary items for a given query item. Existing CIRS models consider the item co-purchase signal as a proxy of the complementary relationship due to the lack of human-curated labels from the huge transaction records. These methods represent items in a complementary embedding space and model the complementary relationship as a poi… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

    Comments: 11 pages, 4 figures; Published in: 2021 IEEE International Conference on Big Data (Big Data)

  8. arXiv:2103.17070  [pdf, other

    cs.CV

    PiCIE: Unsupervised Semantic Segmentation using Invariance and Equivariance in Clustering

    Authors: Jang Hyun Cho, Utkarsh Mall, Kavita Bala, Bharath Hariharan

    Abstract: We present a new framework for semantic segmentation without annotations via clustering. Off-the-shelf clustering methods are limited to curated, single-label, and object-centric images yet real-world data are dominantly uncurated, multi-label, and scene-centric. We extend clustering from images to pixels and assign separate cluster membership to different instances within each image. However, sol… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

    Comments: CVPR 2021

  9. arXiv:2101.01003  [pdf, ps, other

    cs.IT

    Complete solution over $\GF{p^n}$ of the equation $X^{p^k+1}+X+a=0$

    Authors: Kwang Ho Kim, Jong Hyok Choe, Sihem Mesnager

    Abstract: The problem of solving explicitly the equation $P_a(X):=X^{q+1}+X+a=0$ over the finite field $\GF{Q}$, where $Q=p^n$, $q=p^k$ and $p$ is a prime, arises in many different contexts including finite geometry, the inverse Galois problem \cite{ACZ2000}, the construction of difference sets with Singer parameters \cite{DD2004}, determining cross-correlation between $m$-sequences \cite{DOBBERTIN2006} and… ▽ More

    Submitted 4 January, 2021; originally announced January 2021.

    Comments: arXiv admin note: text overlap with arXiv:1912.12648

    MSC Class: 12E05; 12E12; 12E10

  10. arXiv:2011.10954  [pdf, ps, other

    cs.IT cs.CR

    Preimages of $p-$Linearized Polynomials over $\GF{p}$

    Authors: Kwang Ho Kim, Sihem Mesnager, Jong Hyok Choe, Dok Nam Lee

    Abstract: Linearized polynomials over finite fields have been intensively studied over the last several decades. Interesting new applications of linearized polynomials to coding theory and finite geometry have been also highlighted in recent years. Let $p$ be any prime. Recently, preimages of the $p-$linearized polynomials $\sum_{i=0}^{\frac kl-1} X^{p^{li}}$ and… ▽ More

    Submitted 22 November, 2020; originally announced November 2020.

    MSC Class: 11D04; 12E05; 12E12

  11. arXiv:2010.10986  [pdf

    physics.app-ph cs.LG

    Highly-scalable stochastic neuron based on Ovonic Threshold Switch (OTS) and its applications in Restricted Boltzmann Machine (RBM)

    Authors: Seong-il Im, Hye** Lee, Jaesang Lee, Jae-Seung Jeong, Joon Young Kwak, Keunsu Kim, Jeong Ho Cho, Hyunsu Ju, Suyoun Lee

    Abstract: Interest in Restricted Boltzmann Machine (RBM) is growing as a generative stochastic artificial neural network to implement a novel energy-efficient machine-learning (ML) technique. For a hardware implementation of the RBM, an essential building block is a reliable stochastic binary neuron device that generates random spikes following the Boltzmann distribution. Here, we propose a highly-scalable… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

  12. arXiv:2002.04912  [pdf, ps, other

    cs.IT math.NT

    Solving Some Affine Equations over Finite Fields

    Authors: Sihem Mesnager, Kwang Ho Kim, Jong Hyok Choe, Dok Nam Lee

    Abstract: Let $l$ and $k$ be two integers such that $l|k$. Define $T_l^k(X):=X+X^{p^l}+\cdots+X^{p^{l(k/l-2)}}+X^{p^{l(k/l-1)}}$ and $S_l^k(X):=X-X^{p^l}+\cdots+(-1)^{(k/l-1)}X^{p^{l(k/l-1)}}$, where $p$ is any prime. This paper gives explicit representations of all solutions in $\GF{p^n}$ to the affine equations $T_l^{k}(X)=a$ and $S_l^{k}(X)=a$, $a\in \GF{p^n}$. For the case $p=2$ that was solved very r… ▽ More

    Submitted 12 February, 2020; originally announced February 2020.

  13. arXiv:1910.01348  [pdf, other

    cs.LG cs.CV

    On the Efficacy of Knowledge Distillation

    Authors: Jang Hyun Cho, Bharath Hariharan

    Abstract: In this paper, we present a thorough evaluation of the efficacy of knowledge distillation and its dependence on student and teacher architectures. Starting with the observation that more accurate teachers often don't make good teachers, we attempt to tease apart the factors that affect knowledge distillation performance. We find crucially that larger models do not often make better teachers. We sh… ▽ More

    Submitted 3 October, 2019; originally announced October 2019.

    Comments: 13 pages, including Appendix

    Journal ref: ICCV 2019

  14. arXiv:1905.10579  [pdf, ps, other

    cs.IT math.NT

    Solutions of $x^{q^k}+\cdots+x^{q}+x=a$ in $GF{2^n}$

    Authors: Kwang Ho Kim, Jong Hyok Choe, Dok Nam Lee, Dae Song Go, Sihem Mesnager

    Abstract: Though it is well known that the roots of any affine polynomial over a finite field can be computed by a system of linear equations by using a normal base of the field, such solving approach appears to be difficult to apply when the field is fairly large. Thus, it may be of great interest to find an explicit representation of the solutions independently of the field base. This was previously done… ▽ More

    Submitted 25 May, 2019; originally announced May 2019.

  15. arXiv:1405.6450  [pdf, ps, other

    cs.IT

    Joint Transmitter and Receiver Optimization for Improper-Complex Second-Order Stationary Data Sequence

    Authors: Jeongho Yeo, Joon Ho Cho, James S. Lehnert

    Abstract: In this paper, the transmission of an improper-complex second-order stationary data sequence is considered over a strictly band-limited frequency-selective channel. It is assumed that the transmitter employs linear modulation and that the channel output is corrupted by additive proper-complex cyclostationary noise. Under the average transmit power constraint, the problem of minimizing the mean-squ… ▽ More

    Submitted 25 May, 2014; originally announced May 2014.

  16. arXiv:1304.7375  [pdf, ps, other

    cs.IT

    Asymptotic FRESH Properizer for Block Processing of Improper-Complex Second-Order Cyclostationary Random Processes

    Authors: Jeongho Yeo, Joon Ho Cho

    Abstract: In this paper, the block processing of a discrete-time (DT) improper-complex second-order cyclostationary (SOCS) random process is considered. In particular, it is of interest to find a pre-processing operation that enables computationally efficient near-optimal post-processing. An invertible linear-conjugate linear (LCL) operator named the DT FREquency Shift (FRESH) properizer is first proposed.… ▽ More

    Submitted 27 April, 2013; originally announced April 2013.

    Comments: 42 pages, 13 figures

  17. arXiv:1211.6491  [pdf, ps, other

    cs.IT

    Sum-Rate Optimal Multi-Code CDMA Systems: An Equivalence Result

    Authors: Yeo Hun Yun, Joon Ho Cho

    Abstract: In this paper, the sum rate of a multi-code CDMA system with asymmetric-power users is maximized, given a processing gain and a power profile of users. Unlike the sum-rate maximization for a single-code CDMA system, the optimization requires the joint optimal distribution of each user's power to its multiple data streams as well as the optimal design of signature sequences. The crucial step is to… ▽ More

    Submitted 27 November, 2012; originally announced November 2012.

    Comments: 66 pages, 7 figures