Skip to main content

Showing 1–14 of 14 results for author: Fu, L

Searching in archive stat. Search in all archives.
.
  1. arXiv:2311.08812  [pdf, other

    stat.ME

    Optimal subsampling algorithm for the marginal model with large longitudinal data

    Authors: Haohui Han, Liya Fu

    Abstract: Big data is ubiquitous in practices, and it has also led to heavy computation burden. To reduce the calculation cost and ensure the effectiveness of parameter estimators, an optimal subset sampling method is proposed to estimate the parameters in marginal models with massive longitudinal data. The optimal subsampling probabilities are derived, and the corresponding asymptotic properties are establ… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  2. arXiv:2306.08979  [pdf, other

    stat.ME

    Ranking and Selection in Large-Scale Inference of Heteroscedastic Units

    Authors: Bowen Gang, Luella Fu, Gareth James, Wenguang Sun

    Abstract: The allocation of limited resources to a large number of potential candidates presents a pervasive challenge. In the context of ranking and selecting top candidates from heteroscedastic units, conventional methods often result in over-representations of subpopulations, and this issue is further exacerbated in large-scale settings where thousands of candidates are considered simultaneously. To addr… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: 54 pages, 11 figures

  3. arXiv:2111.03885  [pdf, other

    stat.ME stat.AP

    An Empirical Bayes Approach to Controlling the False Discovery Exceedance

    Authors: Pallavi Basu, Luella Fu, Alessio Saretto, Wenguang Sun

    Abstract: In large-scale multiple hypothesis testing problems, the false discovery exceedance (FDX) provides a desirable alternative to the widely used false discovery rate (FDR) when the false discovery proportion (FDP) is highly variable. We develop an empirical Bayes approach to control the FDX. We show that, for independent hypotheses from a two-group model and dependent hypotheses from a Gaussian model… ▽ More

    Submitted 20 April, 2023; v1 submitted 6 November, 2021; originally announced November 2021.

    Comments: Updated

  4. arXiv:2105.13600  [pdf, ps, other

    cs.IT cs.NI stat.AP

    Placement Optimization and Power Control in Intelligent Reflecting Surface Aided Multiuser System

    Authors: Bifeng Ling, Jiangbin Lyu, Liqun Fu

    Abstract: Intelligent reflecting surface (IRS) is a new and revolutionary technology capable of reconfiguring the wireless propagation environment by controlling its massive low-cost passive reflecting elements. Different from prior works that focus on optimizing IRS reflection coefficients or single-IRS placement, we aim to maximize the minimum throughput of a single-cell multiuser system aided by multiple… ▽ More

    Submitted 4 November, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

    Comments: To appear in GLOBECOM 2021. This paper focuses on the multi-IRS placement optimization and downlink AP power control for achieving max-min throughput in a single-cell multi-user system. A ring-based IRS placement scheme is proposed which utilizes the near-AP/near-user deployment modes. Closed-form power control policy is devised to equalize the users' non-outage probability

  5. arXiv:2103.10613  [pdf, ps, other

    stat.ME

    Robust penalized empirical likelihood in high dimensional longitudinal data analysis

    Authors: Jiaqi Li, Liya Fu

    Abstract: As an effective nonparametric method, empirical likelihood (EL) is appealing in combining estimating equations flexibly and adaptively for incorporating data information. To select important variables and estimating equations in the sparse high-dimensional model, we consider a penalized EL method based on robust estimating functions by applying two penalty functions for regularizing the regression… ▽ More

    Submitted 30 June, 2021; v1 submitted 18 March, 2021; originally announced March 2021.

    Comments: 25 pages, 4 Tables

  6. Robust approach for variable selection with high dimensional Logitudinal data analysis

    Authors: Liya Fu, Jiaqi Li, You-Gan Wang

    Abstract: This paper proposes a new robust smooth-threshold estimating equation to select important variables and automatically estimate parameters for high dimensional longitudinal data. A novel working correlation matrix is proposed to capture correlations within the same subject. The proposed procedure works well when the number of covariates p increases as the number of subjects n increases. The propose… ▽ More

    Submitted 18 May, 2021; v1 submitted 12 November, 2020; originally announced November 2020.

    Comments: 32 pages, 7 tables, 5 figures

    Journal ref: Statistics in Medicine.(2021) 1-20

  7. Boosting Share Routing for Multi-task Learning

    Authors: Xiaokai Chen, Xiaoguang Gu, Libo Fu

    Abstract: Multi-task learning (MTL) aims to make full use of the knowledge contained in multi-task supervision signals to improve the overall performance. How to make the knowledge of multiple tasks shared appropriately is an open problem for MTL. Most existing deep MTL models are based on parameter sharing. However, suitable sharing mechanism is hard to design as the relationship among tasks is complicated… ▽ More

    Submitted 1 March, 2021; v1 submitted 1 September, 2020; originally announced September 2020.

  8. arXiv:2008.07438  [pdf, ps, other

    cs.IT cs.NI eess.SY math.PR stat.AP

    Analysis and Optimization for Large-Scale LoRa Networks: Throughput Fairness and Scalability

    Authors: Jiangbin Lyu, Dan Yu, Liqun Fu

    Abstract: LoRa networks are pivotally enabling Long Range connectivity to low-cost and power-constrained user equipments (UEs) in a wide area, whereas a critical issue is to effectively allocate wireless resources to support potentially massive UEs while resolving the prominent near-far fairness issue, which is challenging due to the lack of tractable analytical model and the practical requirement for low-c… ▽ More

    Submitted 5 November, 2021; v1 submitted 17 August, 2020; originally announced August 2020.

    Comments: To appear in IEEE IOT Journal. Stochastic geometry-based framework to model/analyze large-scale LoRa networks with channel fading/aggregate interference/packet overlap**/multi-GW reception. Jointly optimize SF/Tx-power/duty-cycle based on channel statistics and UE distribution. Achieve both fairness/power savings and improve cell-edge throughput and spatial (sum) throughput for majority of UEs. arXiv admin note: text overlap with arXiv:1904.12300

  9. arXiv:2005.04288  [pdf, other

    eess.AS cs.CL cs.LG cs.SD stat.ML

    Incremental Learning for End-to-End Automatic Speech Recognition

    Authors: Li Fu, Xiaoxiao Li, Libo Zi, Zhengchen Zhang, Youzheng Wu, Xiaodong He, Bowen Zhou

    Abstract: In this paper, we propose an incremental learning method for end-to-end Automatic Speech Recognition (ASR) which enables an ASR system to perform well on new tasks while maintaining the performance on its originally learned ones. To mitigate catastrophic forgetting during incremental learning, we design a novel explainability-based knowledge distillation for ASR models, which is combined with a re… ▽ More

    Submitted 15 September, 2021; v1 submitted 11 May, 2020; originally announced May 2020.

    Comments: ASRU 2021

  10. arXiv:2003.03948  [pdf, ps, other

    stat.ME

    An efficient Gehan-type estimation for the accelerated failure time model with clustered and censored data

    Authors: Liya Fu, Zhuoran Yang, Yan Zhou, You-Gan Wang

    Abstract: In medical studies, the collected covariates usually contain underlying outliers. For clustered /longitudinal data with censored observations, the traditional Gehan-type estimator is robust to outliers existing in response but sensitive to outliers in the covariate domain, and it also ignores the within-cluster correlations. To take account of within-cluster correlations, varying cluster sizes, an… ▽ More

    Submitted 9 March, 2020; originally announced March 2020.

    Comments: ready for submission

    MSC Class: 62F35 ACM Class: G.3

  11. arXiv:2002.12586  [pdf, other

    stat.ME

    Nonparametric Empirical Bayes Estimation on Heterogeneous Data

    Authors: Trambak Banerjee, Luella J. Fu, Gareth M. James, Gourab Mukherjee, Wenguang Sun

    Abstract: The simultaneous estimation of many parameters based on data collected from corresponding studies is a key research problem that has received renewed attention in the high-dimensional setting. Many practical situations involve heterogeneous data where heterogeneity is captured by a nuisance parameter. Effectively pooling information across samples while correctly accounting for heterogeneity prese… ▽ More

    Submitted 14 August, 2023; v1 submitted 28 February, 2020; originally announced February 2020.

    Comments: Citations corrected and a new author added. No change in content!

    MSC Class: 62G08; 62G05; 62G20 ACM Class: G.3

  12. arXiv:1911.08784  [pdf

    cs.LG physics.data-an stat.ML

    Deep-seismic-prior-based reconstruction of seismic data using convolutional neural networks

    Authors: Qun Liu, Lihua Fu, Meng Zhang

    Abstract: Reconstruction of seismic data with missing traces is a long-standing issue in seismic data processing. In recent years, rank reduction operations are being commonly utilized to overcome this problem, which require the rank of seismic data to be a prior. However, the rank of field data is unknown; usually it requires much time to manually adjust the rank and just obtain an approximated rank. Metho… ▽ More

    Submitted 20 November, 2019; originally announced November 2019.

    Comments: 5 pages,12 figures

  13. arXiv:1910.08107  [pdf, other

    stat.ME math.ST

    Heterocedasticity-Adjusted Ranking and Thresholding for Large-Scale Multiple Testing

    Authors: Luella Fu, Bowen Gang, Gareth M. James, Wenguang Sun

    Abstract: Standardization has been a widely adopted practice in multiple testing, for it takes into account the variability in sampling and makes the test statistics comparable across different study units. However, despite conventional wisdom to the contrary, we show that there can be a significant loss in information from basing hypothesis tests on standardized statistics rather than the full data. We dev… ▽ More

    Submitted 5 March, 2020; v1 submitted 17 October, 2019; originally announced October 2019.

    Comments: 55 pages, 13 figures

  14. arXiv:1812.07410  [pdf

    cs.LG eess.SY stat.ML

    An Improved Deep Belief Network Model for Road Safety Analyses

    Authors: Guangyuan Pan, Li** Fu, Lalita Thakali, Matthew Muresan, Ming Yu

    Abstract: Crash prediction is a critical component of road safety analyses. A widely adopted approach to crash prediction is application of regression based techniques. The underlying calibration process is often time-consuming, requiring significant domain knowledge and expertise and cannot be easily automated. This paper introduces a new machine learning (ML) based approach as an alternative to the tradit… ▽ More

    Submitted 17 December, 2018; originally announced December 2018.

    Journal ref: Transportation Research Board 97th Annual Meeting, 2018