Skip to main content

Showing 1–11 of 11 results for author: Cao, B

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.01252  [pdf, other

    cs.CL cs.AI stat.ML

    Towards Scalable Automated Alignment of LLMs: A Survey

    Authors: Boxi Cao, Keming Lu, Xinyu Lu, Jiawei Chen, Mengjie Ren, Hao Xiang, Peilin Liu, Yaojie Lu, Ben He, Xianpei Han, Le Sun, Hongyu Lin, Bowen Yu

    Abstract: Alignment is the most critical step in building large language models (LLMs) that meet human needs. With the rapid development of LLMs gradually surpassing human capabilities, traditional alignment methods based on human-annotation are increasingly unable to meet the scalability demands. Therefore, there is an urgent need to explore new sources of automated alignment signals and technical approach… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  2. arXiv:2006.10533  [pdf

    stat.ME q-bio.PE q-bio.QM

    Endpoints for randomized controlled clinical trials for COVID-19 treatments

    Authors: Lori E Dodd, Dean Follmann, **g Wang, Franz Koenig, Lisa L Korn, Christian Schoergenhofer, Michael Proschan, Sally Hunsberger, Tyler Bonnett, Mat Makowski, Drifa Belhadi, Yeming Wang, Bin Cao, France Mentre, Thomas Jaki

    Abstract: Introduction: Endpoint choice for randomized controlled trials of treatments for COVID-19 is complex. A new disease brings many uncertainties, but trials must start rapidly. COVID-19 is heterogeneous, ranging from mild disease that improves within days to critical disease that can last weeks and can end in death. While improvement in mortality would provide unquestionable evidence about clinical s… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

  3. arXiv:1811.05072  [pdf, other

    cs.LG stat.ML

    Private Model Compression via Knowledge Distillation

    Authors: Ji Wang, Weidong Bao, Lichao Sun, Xiaomin Zhu, Bokai Cao, Philip S. Yu

    Abstract: The soaring demand for intelligent mobile applications calls for deploying powerful deep neural networks (DNNs) on mobile devices. However, the outstanding performance of DNNs notoriously relies on increasingly complex models, which in turn is associated with an increase in computational expense far surpassing mobile devices' capacity. What is worse, app service providers need to collect and utili… ▽ More

    Submitted 12 November, 2018; originally announced November 2018.

    Comments: Conference version accepted by AAAI'19

  4. arXiv:1809.04110  [pdf, other

    cs.SI cs.LG stat.ML

    Joint Embedding of Meta-Path and Meta-Graph for Heterogeneous Information Networks

    Authors: Lichao Sun, Lifang He, Zhipeng Huang, Bokai Cao, Congying Xia, Xiaokai Wei, Philip S. Yu

    Abstract: Meta-graph is currently the most powerful tool for similarity search on heterogeneous information networks,where a meta-graph is a composition of meta-paths that captures the complex structural information. However, current relevance computing based on meta-graph only considers the complex structural information, but ignores its embedded meta-paths information. To address this problem, we proposeM… ▽ More

    Submitted 11 September, 2018; originally announced September 2018.

    Comments: accepted by ICBK 18

  5. arXiv:1809.03428  [pdf, other

    cs.LG cs.AI cs.DC stat.ML

    Not Just Privacy: Improving Performance of Private Deep Learning in Mobile Cloud

    Authors: Ji Wang, Jianguo Zhang, Weidong Bao, Xiaomin Zhu, Bokai Cao, Philip S. Yu

    Abstract: The increasing demand for on-device deep learning services calls for a highly efficient manner to deploy deep neural networks (DNNs) on mobile devices with limited capacity. The cloud-based solution is a promising approach to enabling deep learning applications on mobile devices where the large portions of a DNN are offloaded to the cloud. However, revealing data to the cloud leads to potential pr… ▽ More

    Submitted 5 January, 2019; v1 submitted 10 September, 2018; originally announced September 2018.

    Comments: Conference version accepted by KDD'18

  6. arXiv:1806.07703  [pdf, other

    cs.LG stat.ML

    Multi-View Multi-Graph Embedding for Brain Network Clustering Analysis

    Authors: Ye Liu, Lifang He, Bokai Cao, Philip S. Yu, Ann B. Ragin, Alex D. Leow

    Abstract: Network analysis of human brain connectivity is critically important for understanding brain function and disease states. Embedding a brain network as a whole graph instance into a meaningful low-dimensional representation can be used to investigate disease mechanisms and inform therapeutic interventions. Moreover, by exploiting information from multiple neuroimaging modalities or views, we are ab… ▽ More

    Submitted 19 June, 2018; originally announced June 2018.

  7. arXiv:1803.08978  [pdf, other

    cs.LG stat.ML

    Broad Learning for Healthcare

    Authors: Bokai Cao

    Abstract: A broad spectrum of data from different modalities are generated in the healthcare domain every day, including scalar data (e.g., clinical measures collected at hospitals), tensor data (e.g., neuroimages analyzed by research institutes), graph data (e.g., brain connectivity networks), and sequence data (e.g., digital footprints recorded on smart sensors). Capability for modeling information from t… ▽ More

    Submitted 23 March, 2018; originally announced March 2018.

    Comments: PhD Thesis, University of Illinois at Chicago, March 2018

  8. arXiv:1508.04554  [pdf, other

    cs.LG cs.CV cs.CY stat.AP stat.ML

    Mining Brain Networks using Multiple Side Views for Neurological Disorder Identification

    Authors: Bokai Cao, Xiangnan Kong, **gyuan Zhang, Philip S. Yu, Ann B. Ragin

    Abstract: Mining discriminative subgraph patterns from graph data has attracted great interest in recent years. It has a wide variety of applications in disease diagnosis, neuroimaging, etc. Most research on subgraph mining focuses on the graph representation alone. However, in many real-world applications, the side information is available along with the graph data. For example, for neurological disorder i… ▽ More

    Submitted 19 August, 2015; originally announced August 2015.

    Comments: in Proceedings of IEEE International Conference on Data Mining (ICDM) 2015

  9. arXiv:1508.01023  [pdf, other

    cs.LG cs.CE cs.DB q-bio.NC stat.AP

    A review of heterogeneous data mining for brain disorders

    Authors: Bokai Cao, Xiangnan Kong, Philip S. Yu

    Abstract: With rapid advances in neuroimaging techniques, the research on brain disorder identification has become an emerging area in the data mining community. Brain disorder data poses many unique challenges for data mining research. For example, the raw data generated by neuroimaging experiments is in tensor representations, with typical characteristics of high dimensionality, structural complexity and… ▽ More

    Submitted 5 August, 2015; originally announced August 2015.

  10. Multi-View Factorization Machines

    Authors: Bokai Cao, Hucheng Zhou, Guoqiang Li, Philip S. Yu

    Abstract: For a learning task, data can usually be collected from different sources or be represented from multiple views. For example, laboratory results from different medical examinations are available for disease diagnosis, and each of them can only reflect the health state of a person from a particular aspect/view. Therefore, different views provide complementary information for learning tasks. An effe… ▽ More

    Submitted 23 March, 2018; v1 submitted 2 June, 2015; originally announced June 2015.

    Comments: WSDM 2016

    ACM Class: H.2.8

  11. arXiv:1305.4433  [pdf, other

    cs.LG stat.ML

    Meta Path-Based Collective Classification in Heterogeneous Information Networks

    Authors: Xiangnan Kong, Bokai Cao, Philip S. Yu, Ying Ding, David J. Wild

    Abstract: Collective classification has been intensively studied due to its impact in many important applications, such as web mining, bioinformatics and citation analysis. Collective classification approaches exploit the dependencies of a group of linked objects whose class labels are correlated and need to be predicted simultaneously. In this paper, we focus on studying the collective classification probl… ▽ More

    Submitted 20 May, 2013; originally announced May 2013.