Skip to main content

Showing 1–7 of 7 results for author: Tam, Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.08747  [pdf, other

    cs.CL

    StreamBench: Towards Benchmarking Continuous Improvement of Language Agents

    Authors: Cheng-Kuang Wu, Zhi Rui Tam, Chieh-Yen Lin, Yun-Nung Chen, Hung-yi Lee

    Abstract: Recent works have shown that large language model (LLM) agents are able to improve themselves from experience, which is an important ability for continuous enhancement post-deployment. However, existing benchmarks primarily evaluate their innate capabilities and do not assess their ability to improve over time. To address this gap, we introduce StreamBench, a pioneering benchmark designed to evalu… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2404.17668  [pdf, other

    cs.RO

    Precise Object Placement Using Force-Torque Feedback

    Authors: Osher Lerner, Zachary Tam, Michael Equi

    Abstract: Precise object manipulation and placement is a common problem for household robots, surgery robots, and robots working on in-situ construction. Prior work using computer vision, depth sensors, and reinforcement learning lacks the ability to reactively recover from planning errors, execution errors, or sensor noise. This work introduces a method that uses force-torque sensing to robustly place obje… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  3. arXiv:2403.01858  [pdf, other

    cs.CL

    An Improved Traditional Chinese Evaluation Suite for Foundation Model

    Authors: Zhi-Rui Tam, Ya-Ting Pai, Yen-Wei Lee, Sega Cheng, Hong-Han Shuai

    Abstract: We present TMMLU+, a comprehensive dataset designed for the Traditional Chinese massive multitask language understanding dataset. TMMLU+ is a multiple-choice question-answering dataset with 66 subjects from elementary to professional level. Compared to its predecessor, TMMLU, TMMLU+ is six times larger and boasts a more balanced subject distribution. We included benchmark results in TMMLU+ from cl… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  4. arXiv:2304.07327  [pdf, other

    cs.CL cs.AI

    OpenAssistant Conversations -- Democratizing Large Language Model Alignment

    Authors: Andreas Köpf, Yannic Kilcher, Dimitri von Rütte, Sotiris Anagnostidis, Zhi-Rui Tam, Keith Stevens, Abdullah Barhoum, Nguyen Minh Duc, Oliver Stanley, Richárd Nagyfi, Shahul ES, Sameer Suri, David Glushkov, Arnav Dantuluri, Andrew Maguire, Christoph Schuhmann, Huu Nguyen, Alexander Mattick

    Abstract: Aligning large language models (LLMs) with human preferences has proven to drastically improve usability and has driven rapid adoption as demonstrated by ChatGPT. Alignment techniques such as supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF) greatly reduce the required skill and domain knowledge to effectively harness the capabilities of LLMs, increasing their acce… ▽ More

    Submitted 31 October, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

    Comments: Published in NeurIPS 2023 Datasets and Benchmarks

    Report number: V-02 ACM Class: I.2

  5. arXiv:2207.02347  [pdf, other

    cs.RO

    Mechanical Search on Shelves with Efficient Stacking and Destacking of Objects

    Authors: Huang Huang, Letian Fu, Michael Danielczuk, Chung Min Kim, Zachary Tam, Jeffrey Ichnowski, Anelia Angelova, Brian Ichter, Ken Goldberg

    Abstract: Stacking increases storage efficiency in shelves, but the lack of visibility and accessibility makes the mechanical search problem of revealing and extracting target objects difficult for robots. In this paper, we extend the lateral-access mechanical search problem to shelves with stacked items and introduce two novel policies -- Distribution Area Reduction for Stacked Scenes (DARSS) and Monte Car… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

  6. arXiv:2201.08968  [pdf, other

    cs.RO cs.LG

    Mechanical Search on Shelves using a Novel "Bluction" Tool

    Authors: Huang Huang, Michael Danielczuk, Chung Min Kim, Letian Fu, Zachary Tam, Jeffrey Ichnowski, Anelia Angelova, Brian Ichter, Ken Goldberg

    Abstract: Shelves are common in homes, warehouses, and commercial settings due to their storage efficiency. However, this efficiency comes at the cost of reduced visibility and accessibility. When looking from a side (lateral) view of a shelf, most objects will be fully occluded, resulting in a constrained lateral-access mechanical search problem. To address this problem, we introduce: (1) a novel bluction… ▽ More

    Submitted 22 January, 2022; originally announced January 2022.

  7. arXiv:2109.02235  [pdf, other

    cs.LG

    Gradient Normalization for Generative Adversarial Networks

    Authors: Yi-Lun Wu, Hong-Han Shuai, Zhi-Rui Tam, Hong-Yu Chiu

    Abstract: In this paper, we propose a novel normalization method called gradient normalization (GN) to tackle the training instability of Generative Adversarial Networks (GANs) caused by the sharp gradient space. Unlike existing work such as gradient penalty and spectral normalization, the proposed GN only imposes a hard 1-Lipschitz constraint on the discriminator function, which increases the capacity of t… ▽ More

    Submitted 10 October, 2021; v1 submitted 6 September, 2021; originally announced September 2021.

    Comments: Published as a conference paper at ICCV 2021