Skip to main content

Showing 1–16 of 16 results for author: Feng, G

Searching in archive stat. Search in all archives.
.
  1. arXiv:2404.18922  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    DPO Meets PPO: Reinforced Token Optimization for RLHF

    Authors: Han Zhong, Guhao Feng, Wei Xiong, Li Zhao, Di He, Jiang Bian, Liwei Wang

    Abstract: In the classical Reinforcement Learning from Human Feedback (RLHF) framework, Proximal Policy Optimization (PPO) is employed to learn from sparse, sentence-level rewards -- a challenging scenario in traditional deep reinforcement learning. Despite the great successes of PPO in the alignment of state-of-the-art closed-source large language models (LLMs), its open-source implementation is still larg… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  2. arXiv:2402.13934  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Do Efficient Transformers Really Save Computation?

    Authors: Kai Yang, Jan Ackermann, Zhenyu He, Guhao Feng, Bohang Zhang, Yunzhen Feng, Qiwei Ye, Di He, Liwei Wang

    Abstract: As transformer-based language models are trained on increasingly large datasets and with vast numbers of parameters, finding more efficient alternatives to the standard Transformer has become very valuable. While many efficient Transformers and Transformer alternatives have been proposed, none provide theoretical guarantees that they are a suitable replacement for the standard Transformer. This ma… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  3. arXiv:2401.16421  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation

    Authors: Zhenyu He, Guhao Feng, Shengjie Luo, Kai Yang, Liwei Wang, **g**g Xu, Zhi Zhang, Hongxia Yang, Di He

    Abstract: In this work, we leverage the intrinsic segmentation of language sequences and design a new positional encoding method called Bilevel Positional Encoding (BiPE). For each position, our BiPE blends an intra-segment encoding and an inter-segment encoding. The intra-segment encoding identifies the locations within a segment and helps the model capture the semantic information therein via absolute pos… ▽ More

    Submitted 17 June, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: 17 pages, 7 figures, 8 tables; ICML 2024 Camera Ready version; Code: https://github.com/zhenyuhe00/BiPE

  4. arXiv:2312.17248  [pdf, other

    cs.LG cs.AI cs.CC cs.DS stat.ML

    Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity

    Authors: Guhao Feng, Han Zhong

    Abstract: Reinforcement Learning (RL) encompasses diverse paradigms, including model-based RL, policy-based RL, and value-based RL, each tailored to approximate the model, optimal policy, and optimal value function, respectively. This work investigates the potential hierarchy of representation complexity -- the complexity of functions to be represented -- among these RL paradigms. We first demonstrate that,… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  5. arXiv:2305.15408  [pdf, other

    cs.LG cs.CC cs.CL stat.ML

    Towards Revealing the Mystery behind Chain of Thought: A Theoretical Perspective

    Authors: Guhao Feng, Bohang Zhang, Yuntian Gu, Haotian Ye, Di He, Liwei Wang

    Abstract: Recent studies have discovered that Chain-of-Thought prompting (CoT) can dramatically improve the performance of Large Language Models (LLMs), particularly when dealing with complex tasks involving mathematics or reasoning. Despite the enormous empirical success, the underlying mechanisms behind CoT and how it unlocks the potential of LLMs remain elusive. In this paper, we take a first step toward… ▽ More

    Submitted 22 December, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: 42 pages; Camera-ready version for NeurIPS 2023 (Oral Presentation)

  6. arXiv:2105.13841   

    cs.LG cs.AI stat.ML

    A General Taylor Framework for Unifying and Revisiting Attribution Methods

    Authors: Huiqi Deng, Na Zou, Mengnan Du, Weifu Chen, Guocan Feng, Xia Hu

    Abstract: Attribution methods provide an insight into the decision-making process of machine learning models, especially deep neural networks, by assigning contribution scores to each individual feature. However, the attribution problem has not been well-defined, which lacks a unified guideline to the contribution assignment process. Furthermore, existing attribution methods often built upon various empiric… ▽ More

    Submitted 25 February, 2023; v1 submitted 28 May, 2021; originally announced May 2021.

    Comments: In the current version, the author information is not complete and there are some mathematical errors in the proof. We need to correct errors and add all co-authors who contribute to the paper. Therefore, we hope to withdraw the manuscript

  7. arXiv:2008.09695  [pdf, other

    stat.ML cs.AI cs.CV cs.LG

    A Unified Taylor Framework for Revisiting Attribution Methods

    Authors: Huiqi Deng, Na Zou, Mengnan Du, Weifu Chen, Guocan Feng, Xia Hu

    Abstract: Attribution methods have been developed to understand the decision-making process of machine learning models, especially deep neural networks, by assigning importance scores to individual features. Existing attribution methods often built upon empirical intuitions and heuristics. There still lacks a general and theoretical framework that not only can unify these attribution methods, but also theor… ▽ More

    Submitted 13 April, 2021; v1 submitted 21 August, 2020; originally announced August 2020.

  8. arXiv:2005.03622  [pdf, ps, other

    cs.IT cs.LG eess.SP math.ST stat.ML

    Nonparametric Estimation of the Fisher Information and Its Applications

    Authors: Wei Cao, Alex Dytso, Michael Fauß, H. Vincent Poor, Gang Feng

    Abstract: This paper considers the problem of estimation of the Fisher information for location from a random sample of size $n$. First, an estimator proposed by Bhattacharya is revisited and improved convergence rates are derived. Second, a new estimator, termed a clipped estimator, is proposed. Superior upper bounds on the rates of convergence can be shown for the new estimator compared to the Bhattachary… ▽ More

    Submitted 7 May, 2020; originally announced May 2020.

  9. The Learning of Fuzzy Cognitive Maps With Noisy Data: A Rapid and Robust Learning Method With Maximum Entropy

    Authors: Guoliang Feng, Wei Lu, Witold Pedrycz, Jianhua Yang, Xiaodong Liu

    Abstract: Numerous learning methods for fuzzy cognitive maps (FCMs), such as the Hebbian-based and the population-based learning methods, have been developed for modeling and simulating dynamic systems. However, these methods are faced with several obvious limitations. Most of these models are extremely time consuming when learning the large-scale FCMs with hundreds of nodes. Furthermore, the FCMs learned b… ▽ More

    Submitted 22 August, 2019; originally announced August 2019.

    Comments: The manuscript has been published on IEEE Transactions on Cybernetics

  10. arXiv:1902.01015  [pdf, other

    econ.EM stat.AP

    Factor Investing: A Bayesian Hierarchical Approach

    Authors: Guanhao Feng, **gyu He

    Abstract: This paper investigates asset allocation problems when returns are predictable. We introduce a market-timing Bayesian hierarchical (BH) approach that adopts heterogeneous time-varying coefficients driven by lagged fundamental characteristics. Our approach includes a joint estimation of conditional expected returns and covariance matrix and considers estimation risk for portfolio analysis. The hier… ▽ More

    Submitted 17 September, 2020; v1 submitted 3 February, 2019; originally announced February 2019.

  11. arXiv:1807.02164  [pdf, ps, other

    cs.HC cs.LG stat.ML

    V-CNN: When Convolutional Neural Network encounters Data Visualization

    Authors: Mao Yang, Bo Li, Guanxiong Feng, Zhongjiang Yan

    Abstract: In recent years, deep learning poses a deep technical revolution in almost every field and attracts great attentions from industry and academia. Especially, the convolutional neural network (CNN), one representative model of deep learning, achieves great successes in computer vision and natural language processing. However, simply or blindly applying CNN to the other fields results in lower traini… ▽ More

    Submitted 12 June, 2018; originally announced July 2018.

    Comments: 2 pages, 2 figures, submitted to ACM Sigcomm 2018

  12. arXiv:1805.01104  [pdf, other

    stat.ME

    Deep Learning in Characteristics-Sorted Factor Models

    Authors: Guanhao Feng, **gyu He, Nicholas G. Polson, Jianeng Xu

    Abstract: This paper presents an augmented deep factor model that generates latent factors for cross-sectional asset pricing. The conventional security sorting on firm characteristics for constructing long-short factor portfolio weights is nonlinear modeling, while factors are treated as inputs in linear models. We provide a structural deep learning framework to generalize the complete mechanism for fitting… ▽ More

    Submitted 19 July, 2023; v1 submitted 2 May, 2018; originally announced May 2018.

  13. arXiv:1804.09314  [pdf, other

    stat.ML cs.LG econ.EM

    Deep Learning for Predicting Asset Returns

    Authors: Guanhao Feng, **gyu He, Nicholas G. Polson

    Abstract: Deep learning searches for nonlinear factors for predicting asset returns. Predictability is achieved via multiple layers of composite factors as opposed to additive ones. Viewed in this way, asset pricing studies can be revisited using multi-layer deep learners, such as rectified linear units (ReLU) or long-short-term-memory (LSTM) for time-series effects. State-of-the-art algorithms including st… ▽ More

    Submitted 26 April, 2018; v1 submitted 24 April, 2018; originally announced April 2018.

  14. arXiv:1709.00379  [pdf, ps, other

    stat.ML

    Sparse Regularization in Marketing and Economics

    Authors: Guanhao Feng, Nicholas Polson, Yuexi Wang, Jianeng Xu

    Abstract: Sparse alpha-norm regularization has many data-rich applications in Marketing and Economics. Alpha-norm, in contrast to lasso and ridge regularization, jumps to a sparse solution. This feature is attractive for ultra high-dimensional problems that occur in demand estimation and forecasting. The alpha-norm objective is nonconvex and requires coordinate descent and proximal operators to find the spa… ▽ More

    Submitted 5 February, 2018; v1 submitted 1 September, 2017; originally announced September 2017.

  15. arXiv:1606.01701  [pdf, ps, other

    stat.ME

    Regularizing Bayesian Predictive Regressions

    Authors: Guanhao Feng, Nicholas G. Polson

    Abstract: We show that regularizing Bayesian predictive regressions provides a framework for prior sensitivity analysis. We develop a procedure that jointly regularizes expectations and variance-covariance matrices using a pair of shrinkage priors. Our methodology applies directly to vector autoregressions (VAR) and seemingly unrelated regressions (SUR). The regularization path provides a prior sensitivity… ▽ More

    Submitted 13 September, 2017; v1 submitted 6 June, 2016; originally announced June 2016.

  16. The Market for English Premier League (EPL) Odds

    Authors: Guanhao Feng, Nicholas G. Polson, Jianeng Xu

    Abstract: This paper employs a Skellam process to represent real-time betting odds for English Premier League (EPL) soccer games. Given a matrix of market odds on all possible score outcomes, we estimate the expected scoring rates for each team. The expected scoring rates then define the implied volatility of an EPL game. As events in the game evolve, we re-estimate the expected scoring rates and our implie… ▽ More

    Submitted 5 January, 2017; v1 submitted 12 April, 2016; originally announced April 2016.

    Journal ref: Journal of Quantitative Analysis in Sports, 12.4 (2017): 167-178