Skip to main content

Showing 1–50 of 52 results for author: Long, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.12241  [pdf, other

    cs.CL cs.AI

    Introducing v0.5 of the AI Safety Benchmark from MLCommons

    Authors: Bertie Vidgen, Adarsh Agrawal, Ahmed M. Ahmed, Victor Akinwande, Namir Al-Nuaimi, Najla Alfaraj, Elie Alhajjar, Lora Aroyo, Trupti Bavalatti, Max Bartolo, Borhane Blili-Hamelin, Kurt Bollacker, Rishi Bomassani, Marisa Ferrara Boston, Siméon Campos, Kal Chakra, Canyu Chen, Cody Coleman, Zacharie Delpierre Coudert, Leon Derczynski, Debojyoti Dutta, Ian Eisenberg, James Ezick, Heather Frase, Brian Fuller , et al. (75 additional authors not shown)

    Abstract: This paper introduces v0.5 of the AI Safety Benchmark, which has been created by the MLCommons AI Safety Working Group. The AI Safety Benchmark has been designed to assess the safety risks of AI systems that use chat-tuned language models. We introduce a principled approach to specifying and constructing the benchmark, which for v0.5 covers only a single use case (an adult chatting to a general-pu… ▽ More

    Submitted 13 May, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  2. arXiv:2404.05870  [pdf, other

    cs.RO

    CoBT: Collaborative Programming of Behaviour Trees from One Demonstration for Robot Manipulation

    Authors: Aayush Jain, Philip Long, Valeria Villani, John D. Kelleher, Maria Chiara Leva

    Abstract: Mass customization and shorter manufacturing cycles are becoming more important among small and medium-sized companies. However, classical industrial robots struggle to cope with product variation and dynamic environments. In this paper, we present CoBT, a collaborative programming by demonstration framework for generating reactive and modular behavior trees. CoBT relies on a single demonstration… ▽ More

    Submitted 10 April, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

    Comments: Accepted for presentation at IEEE ICRA 2024

  3. arXiv:2401.15687  [pdf, other

    cs.CV cs.GR

    Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance

    Authors: Qingcheng Zhao, Pengyu Long, Qixuan Zhang, Dafei Qin, Han Liang, Longwen Zhang, Yingliang Zhang, **gyi Yu, Lan Xu

    Abstract: The synthesis of 3D facial animations from speech has garnered considerable attention. Due to the scarcity of high-quality 4D facial data and well-annotated abundant multi-modality labels, previous methods often suffer from limited realism and a lack of lexible conditioning. We address this challenge through a trilogy. We first introduce Generalized Neural Parametric Facial Asset (GNPFA), an effic… ▽ More

    Submitted 30 January, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

    Comments: Project Page: https://sites.google.com/view/media2face

  4. arXiv:2312.01661  [pdf, other

    cs.CL cs.AI

    ChatGPT as a Math Questioner? Evaluating ChatGPT on Generating Pre-university Math Questions

    Authors: Phuoc Pham Van Long, Duc Anh Vu, Nhat M. Hoang, Xuan Long Do, Anh Tuan Luu

    Abstract: Mathematical questioning is crucial for assessing students problem-solving skills. Since manually creating such questions requires substantial effort, automatic methods have been explored. Existing state-of-the-art models rely on fine-tuning strategies and struggle to generate questions that heavily involve multiple steps of logical and arithmetic reasoning. Meanwhile, large language models(LLMs)… ▽ More

    Submitted 27 February, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: Accepted at the 39th ACM/SIGAPP Symposium On Applied Computing (SAC 2024), Main Conference

  5. arXiv:2309.12488  [pdf, other

    cs.LG cs.NE stat.ML

    Sharpness-Aware Minimization and the Edge of Stability

    Authors: Philip M. Long, Peter L. Bartlett

    Abstract: Recent experiments have shown that, often, when training a neural network with gradient descent (GD) with a step size $η$, the operator norm of the Hessian of the loss grows until it approximately reaches $2/η$, after which it fluctuates around this value. The quantity $2/η$ has been called the "edge of stability" based on consideration of a local quadratic approximation of the loss. We perform a… ▽ More

    Submitted 5 June, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

  6. arXiv:2307.01481  [pdf, other

    cs.SE quant-ph

    Equivalence, Identity, and Unitarity Checking in Black-Box Testing of Quantum Programs

    Authors: Peixun Long, Jianjun Zhao

    Abstract: Quantum programs exhibit inherent non-deterministic behavior, which poses more significant challenges for error discovery compared to classical programs. While several testing methods have been proposed for quantum programs, they often overlook fundamental questions in black-box testing. In this paper, we bridge this gap by presenting three novel algorithms specifically designed to address the cha… ▽ More

    Submitted 24 May, 2024; v1 submitted 4 July, 2023; originally announced July 2023.

    Comments: 23 pages

  7. arXiv:2306.17407  [pdf, other

    cs.SE quant-ph

    Testing Multi-Subroutine Quantum Programs: From Unit Testing to Integration Testing

    Authors: Peixun Long, Jianjun Zhao

    Abstract: Quantum computing has emerged as a promising field with the potential to revolutionize various domains by harnessing the principles of quantum mechanics. As quantum hardware and algorithms continue to advance, develo** high-quality quantum software has become crucial. However, testing quantum programs poses unique challenges due to the distinctive characteristics of quantum systems and the compl… ▽ More

    Submitted 24 May, 2024; v1 submitted 30 June, 2023; originally announced June 2023.

    Comments: 62 pages

  8. Prediction, Learning, Uniform Convergence, and Scale-sensitive Dimensions

    Authors: Peter L. Bartlett, Philip M. Long

    Abstract: We present a new general-purpose algorithm for learning classes of $[0,1]$-valued functions in a generalization of the prediction model, and prove a general upper bound on the expected absolute error of this algorithm in terms of a scale-sensitive generalization of the Vapnik dimension proposed by Alon, Ben-David, Cesa-Bianchi and Haussler. We give lower bounds implying that our upper bounds canno… ▽ More

    Submitted 24 April, 2023; v1 submitted 21 April, 2023; originally announced April 2023.

    Comments: One header page, a three page correction, then a 28 page original paper

  9. arXiv:2210.09540  [pdf, other

    cs.RO

    Contact-Implicit Planning and Control for Non-Prehensile Manipulation Using State-Triggered Constraints

    Authors: Maozhen Wang, Aykut Ozgun Onol, Philip Long, Taskin Padir

    Abstract: We present a contact-implicit planning approach that can generate contact-interaction trajectories for non-prehensile manipulation problems without tuning or a tailored initial guess and with high success rates. This is achieved by leveraging the concept of state-triggered constraints (STCs) to capture the hybrid dynamics induced by discrete contact modes without explicitly reasoning about the com… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: 16 pages, The International Symposium on Robotics Research 2022

  10. arXiv:2210.01513  [pdf, other

    cs.LG math.OC stat.ML

    The Dynamics of Sharpness-Aware Minimization: Bouncing Across Ravines and Drifting Towards Wide Minima

    Authors: Peter L. Bartlett, Philip M. Long, Olivier Bousquet

    Abstract: We consider Sharpness-Aware Minimization (SAM), a gradient-based optimization method for deep networks that has exhibited performance improvements on image and language prediction problems. We show that when SAM is applied with a convex quadratic objective, for most random initializations it converges to a cycle that oscillates between either side of the minimum in the direction with the largest c… ▽ More

    Submitted 11 April, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

  11. arXiv:2209.09315  [pdf, other

    cs.LG cs.AI math.ST stat.ML

    Deep Linear Networks can Benignly Overfit when Shallow Ones Do

    Authors: Niladri S. Chatterji, Philip M. Long

    Abstract: We bound the excess risk of interpolating deep linear networks trained using gradient flow. In a setting previously used to establish risk bounds for the minimum $\ell_2$-norm interpolant, we show that randomly initialized deep linear networks can closely approximate or even match known bounds for the minimum $\ell_2$-norm interpolant. Our analysis also reveals that interpolating deep linear model… ▽ More

    Submitted 6 February, 2023; v1 submitted 19 September, 2022; originally announced September 2022.

  12. arXiv:2208.09206  [pdf, other

    cs.SE

    Testing Quantum Programs with Multiple Subroutines

    Authors: Peixun Long, Jianjun Zhao

    Abstract: Errors in quantum programs are challenging to track down due to the uncertainty of quantum programs. Testing is, therefore, an indispensable method for assuring the quality of quantum software. Existing testing methods focus only on testing quantum programs with quantum circuits or single subroutines and, therefore, cannot effectively test quantum programs with multi-subroutines. In this paper, we… ▽ More

    Submitted 26 February, 2023; v1 submitted 19 August, 2022; originally announced August 2022.

    Comments: 14 pages, 9 figures

  13. arXiv:2203.03365  [pdf

    stat.ML cs.CY cs.LG

    Machine learning using longitudinal prescription and medical claims for the detection of nonalcoholic steatohepatitis (NASH)

    Authors: Ozge Yasar, Patrick Long, Brett Harder, Hanna Marshall, Sanjay Bhasin, Suyin Lee, Mark Delegge, Stephanie Roy, Orla Doyle, Nadea Leavitt, John Rigg

    Abstract: Objectives To develop and evaluate machine learning models to detect suspected undiagnosed nonalcoholic steatohepatitis (NASH) patients for diagnostic screening and clinical management. Methods In this retrospective observational noninterventional study using administrative medical claims data from 1,463,089 patients, gradient-boosted decision trees were trained to detect likely NASH patients fr… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: 22 pages, 4 figures

  14. arXiv:2112.04590  [pdf, ps, other

    cs.LG stat.ML

    The perils of being unhinged: On the accuracy of classifiers minimizing a noise-robust convex loss

    Authors: Philip M. Long, Rocco A. Servedio

    Abstract: Van Rooyen et al. introduced a notion of convex loss functions being robust to random classification noise, and established that the "unhinged" loss function is robust in this sense. In this note we study the accuracy of binary classifiers obtained by minimizing the unhinged loss, and observe that even for simple linearly separable data distributions, minimizing the unhinged loss may only yield a… ▽ More

    Submitted 4 March, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

  15. arXiv:2110.02914  [pdf, ps, other

    stat.ML cs.LG math.ST

    Foolish Crowds Support Benign Overfitting

    Authors: Niladri S. Chatterji, Philip M. Long

    Abstract: We prove a lower bound on the excess risk of sparse interpolating procedures for linear regression with Gaussian data in the overparameterized regime. We apply this result to obtain a lower bound for basis pursuit (the minimum $\ell_1$-norm interpolant) that implies that its excess risk can converge at an exponentially slower rate than OLS (the minimum $\ell_2$-norm interpolant), even when the gro… ▽ More

    Submitted 17 March, 2022; v1 submitted 6 October, 2021; originally announced October 2021.

  16. arXiv:2108.11489  [pdf, ps, other

    stat.ML cs.LG math.ST

    The Interplay Between Implicit Bias and Benign Overfitting in Two-Layer Linear Networks

    Authors: Niladri S. Chatterji, Philip M. Long, Peter L. Bartlett

    Abstract: The recent success of neural network models has shone light on a rather surprising statistical phenomenon: statistical models that perfectly fit noisy data can generalize well to unseen test data. Understanding this phenomenon of $\textit{benign overfitting}$ has attracted intense theoretical and empirical study. In this paper, we consider interpolating two-layer linear neural networks trained wit… ▽ More

    Submitted 9 September, 2022; v1 submitted 25 August, 2021; originally announced August 2021.

    Comments: Accepted for publication at JMLR

  17. arXiv:2106.01921  [pdf, ps, other

    stat.ML cs.LG stat.AP

    Sample Selection Bias in Evaluation of Prediction Performance of Causal Models

    Authors: James P. Long, Min ** Ha

    Abstract: Causal models are notoriously difficult to validate because they make untestable assumptions regarding confounding. New scientific experiments offer the possibility of evaluating causal models using prediction performance. Prediction performance measures are typically robust to violations in causal assumptions. However, prediction performance does depend on the selection of training and test sets.… ▽ More

    Submitted 26 October, 2021; v1 submitted 3 June, 2021; originally announced June 2021.

    Comments: 12 pages, 4 figures, 2 tables

  18. arXiv:2105.10585  [pdf, ps, other

    cs.LG cs.AI cs.NE

    Properties of the After Kernel

    Authors: Philip M. Long

    Abstract: The Neural Tangent Kernel (NTK) is the wide-network limit of a kernel defined using neural networks at initialization, whose embedding is the gradient of the output of the network with respect to its parameters. We study the "after kernel", which is defined using the same embedding, except after training, for neural networks with standard architectures, on binary classification problems extracted… ▽ More

    Submitted 13 December, 2021; v1 submitted 21 May, 2021; originally announced May 2021.

  19. arXiv:2102.04998  [pdf, ps, other

    stat.ML cs.AI cs.LG math.OC

    When does gradient descent with logistic loss interpolate using deep networks with smoothed ReLU activations?

    Authors: Niladri S. Chatterji, Philip M. Long, Peter L. Bartlett

    Abstract: We establish conditions under which gradient descent applied to fixed-width deep networks drives the logistic loss to zero, and prove bounds on the rate of convergence. Our analysis applies for smoothed approximations to the ReLU, such as Swish and the Huberized ReLU, proposed in previous applied work. We provide two sufficient conditions for convergence. The first is simply a bound on the loss at… ▽ More

    Submitted 1 July, 2021; v1 submitted 9 February, 2021; originally announced February 2021.

  20. arXiv:2012.02409  [pdf, other

    stat.ML cs.LG math.OC

    When does gradient descent with logistic loss find interpolating two-layer networks?

    Authors: Niladri S. Chatterji, Philip M. Long, Peter L. Bartlett

    Abstract: We study the training of finite-width two-layer smoothed ReLU networks for binary classification using the logistic loss. We show that gradient descent drives the training loss to zero if the initial loss is small enough. When the data satisfies certain cluster and separation conditions and the network is wide enough, we show that one step of gradient descent reduces the loss sufficiently that the… ▽ More

    Submitted 1 July, 2021; v1 submitted 4 December, 2020; originally announced December 2020.

  21. arXiv:2011.04848  [pdf, other

    cs.RO

    AES: Autonomous Excavator System for Real-World and Hazardous Environments

    Authors: **xin Zhao, Pinxin Long, Liyang Wang, Lingfeng Qian, Feixiang Lu, Xibin Song, Dinesh Manocha, Liangjun Zhang

    Abstract: Excavators are widely used for material-handling applications in unstructured environments, including mining and construction. The size of the global market of excavators is 44.12 Billion USD in 2018 and is predicted to grow to 63.14 Billion USD by 2026. Operating excavators in a real-world environment can be challenging due to extreme conditions and rock sliding, ground collapse, or exceeding dus… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

  22. arXiv:2010.14038  [pdf, other

    cs.RO

    Optimization-Based Framework for Excavation Trajectory Generation

    Authors: Yajue Yang, Pinxin Long, Jia Pan, Xinbin Song, Liangjun Zhang

    Abstract: In this paper, we present a novel optimization-based framework for autonomous excavator trajectory generation under various objectives, including minimum joint displacement and minimum time. Traditional methods on excavation trajectory generation usually separate the excavation motion into a sequence of fixed phases, resulting in limited trajectory searching space. Our framework explores the space… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

  23. arXiv:2010.08479  [pdf, ps, other

    stat.ML cs.LG math.ST

    Failures of model-dependent generalization bounds for least-norm interpolation

    Authors: Peter L. Bartlett, Philip M. Long

    Abstract: We consider bounds on the generalization performance of the least-norm linear regressor, in the over-parameterized regime where it can interpolate the data. We describe a sense in which any generalization bound of a type that is commonly proved in statistical learning theory must sometimes be very loose when applied to analyze the least-norm interpolant. In particular, for a variety of natural joi… ▽ More

    Submitted 20 January, 2021; v1 submitted 16 October, 2020; originally announced October 2020.

    Journal ref: JMLR, 22(204):1-15, 2021

  24. arXiv:2006.06176  [pdf, other

    cs.RO

    Tuning-Free Contact-Implicit Trajectory Optimization

    Authors: Aykut Ozgun Onol, Radu Corcodel, Philip Long, Taskin Padir

    Abstract: We present a contact-implicit trajectory optimization framework that can plan contact-interaction trajectories for different robot architectures and tasks using a trivial initial guess and without requiring any parameter tuning. This is achieved by using a relaxed contact model along with an automatic penalty adjustment loop for suppressing the relaxation. Moreover, the structure of the problem en… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

    Comments: IEEE International Conference on Robotics and Automation (ICRA) 2020

  25. arXiv:2006.00811  [pdf, other

    cs.RO

    Time Variable Minimum Torque Trajectory Optimization for Autonomous Excavator

    Authors: Yajue Yang, Jia Pan, Pinxin Long, Xibin Song, Liangjun Zhang

    Abstract: In this paper, we present a minimal torque and time variable trajectory optimization method for autonomous excavator considering the soil-tool interaction. The method formulates the excavation motion generation as a trajectory optimization problem and takes into account geometric, kinematic and dynamics constraints. To generate time-efficient trajectory and improve the overall optimization efficie… ▽ More

    Submitted 1 June, 2020; originally announced June 2020.

  26. arXiv:2004.12019  [pdf, other

    stat.ML cs.LG math.ST

    Finite-sample Analysis of Interpolating Linear Classifiers in the Overparameterized Regime

    Authors: Niladri S. Chatterji, Philip M. Long

    Abstract: We prove bounds on the population risk of the maximum margin algorithm for two-class linear classification. For linearly separable training data, the maximum margin algorithm has been shown in previous work to be equivalent to a limit of training with logistic loss using gradient descent, as the training error is driven to zero. We analyze this algorithm applied to random data including misclassif… ▽ More

    Submitted 1 June, 2021; v1 submitted 24 April, 2020; originally announced April 2020.

    Comments: Corrected typographical errors from the previous version of this paper

  27. arXiv:2003.01094  [pdf, other

    cs.LG math.OC stat.ML

    On the Global Convergence of Training Deep Linear ResNets

    Authors: Difan Zou, Philip M. Long, Quanquan Gu

    Abstract: We study the convergence of gradient descent (GD) and stochastic gradient descent (SGD) for training $L$-hidden-layer linear residual networks (ResNets). We prove that for training deep residual networks with certain linear transformations at input and output layers, which are fixed throughout training, both GD and SGD with zero initialization on all hidden weights can converge to the global minim… ▽ More

    Submitted 2 March, 2020; originally announced March 2020.

    Comments: 26 pages, 1 figure. In ICLR 2020

  28. arXiv:2002.00291  [pdf, ps, other

    stat.ML cs.LG math.ST

    Oracle Lower Bounds for Stochastic Gradient Sampling Algorithms

    Authors: Niladri S. Chatterji, Peter L. Bartlett, Philip M. Long

    Abstract: We consider the problem of sampling from a strongly log-concave density in $\mathbb{R}^d$, and prove an information theoretic lower bound on the number of stochastic gradient queries of the log density needed. Several popular sampling algorithms (including many Markov chain Monte Carlo methods) operate by using stochastic gradients of the log density to generate a sample; our results establish an… ▽ More

    Submitted 3 July, 2021; v1 submitted 1 February, 2020; originally announced February 2020.

    Comments: 21 pages; accepted for publication at Bernoulli

  29. arXiv:1910.09998  [pdf, other

    cs.RO cs.AI cs.LG

    Learning Resilient Behaviors for Navigation Under Uncertainty

    Authors: Tingxiang Fan, Pinxin Long, Wenxi Liu, Jia Pan, Ruigang Yang, Dinesh Manocha

    Abstract: Deep reinforcement learning has great potential to acquire complex, adaptive behaviors for autonomous agents automatically. However, the underlying neural network polices have not been widely deployed in real-world applications, especially in these safety-critical tasks (e.g., autonomous driving). One of the reasons is that the learned policy cannot perform flexible and resilient behaviors as trad… ▽ More

    Submitted 3 June, 2020; v1 submitted 22 October, 2019; originally announced October 2019.

    Comments: accepted to ICRA 2020

  30. arXiv:1906.11300  [pdf, other

    stat.ML cs.LG math.ST

    Benign Overfitting in Linear Regression

    Authors: Peter L. Bartlett, Philip M. Long, Gábor Lugosi, Alexander Tsigler

    Abstract: The phenomenon of benign overfitting is one of the key mysteries uncovered by deep learning methodology: deep neural networks seem to predict well, even with a perfect fit to noisy training data. Motivated by this phenomenon, we consider when a perfect fit to training data in linear regression is compatible with accurate prediction. We give a characterization of linear regression problems for whic… ▽ More

    Submitted 29 January, 2020; v1 submitted 26 June, 2019; originally announced June 2019.

  31. arXiv:1905.12600  [pdf, other

    cs.LG cs.AI cs.NE math.ST stat.ML

    Generalization bounds for deep convolutional neural networks

    Authors: Philip M. Long, Hanie Sedghi

    Abstract: We prove bounds on the generalization error of convolutional networks. The bounds are in terms of the training loss, the number of parameters, the Lipschitz constant of the loss and the distance from the weights to the initial weights. They are independent of the number of pixels in the input, and the height and width of hidden feature maps. We present experiments using CIFAR-10 with varying hyper… ▽ More

    Submitted 8 April, 2020; v1 submitted 29 May, 2019; originally announced May 2019.

    Comments: Published as a conference paper at ICLR 2020

  32. arXiv:1901.02104  [pdf, other

    cs.LG cs.AI cs.NE math.ST stat.ML

    On the effect of the activation function on the distribution of hidden nodes in a deep network

    Authors: Philip M. Long, Hanie Sedghi

    Abstract: We analyze the joint probability distribution on the lengths of the vectors of hidden variables in different layers of a fully connected deep network, when the weights and biases are chosen randomly according to Gaussian distributions, and the input is in $\{ -1, 1\}^N$. We show that, if the activation function $φ$ satisfies a minimal set of assumptions, satisfied by all activation functions that… ▽ More

    Submitted 7 January, 2019; originally announced January 2019.

  33. arXiv:1811.03744  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Density estimation for shift-invariant multidimensional distributions

    Authors: Anindya De, Philip M. Long, Rocco A. Servedio

    Abstract: We study density estimation for classes of shift-invariant distributions over $\mathbb{R}^d$. A multidimensional distribution is "shift-invariant" if, roughly speaking, it is close in total variation distance to a small shift of it in any direction. Shift-invariance relaxes smoothness assumptions commonly used in non-parametric density estimation to allow jump discontinuities. The different classe… ▽ More

    Submitted 8 November, 2018; originally announced November 2018.

    Comments: Appears in the Proceedings of ITCS 2019

  34. Contact-Implicit Trajectory Optimization Based on a Variable Smooth Contact Model and Successive Convexification

    Authors: Aykut Ozgun Onol, Philip Long, Taskin Padir

    Abstract: In this paper, we propose a contact-implicit trajectory optimization (CITO) method based on a variable smooth contact model (VSCM) and successive convexification (SCvx). The VSCM facilitates the convergence of gradient-based optimization without compromising physical fidelity. On the other hand, the proposed SCvx-based approach combines the advantages of direct and shooting methods for CITO. For e… ▽ More

    Submitted 4 March, 2019; v1 submitted 24 October, 2018; originally announced October 2018.

    Comments: Accepted for publication in ICRA 2019

  35. arXiv:1810.00352  [pdf, other

    cs.RO

    Getting Robots Unfrozen and Unlost in Dense Pedestrian Crowds

    Authors: Tingxiang Fan, Xin**g Cheng, Jia Pan, Pinxin Long, Wenxi Liu, Ruigang Yang, Dinesh Manocha

    Abstract: We aim to enable a mobile robot to navigate through environments with dense crowds, e.g., shop** malls, canteens, train stations, or airport terminals. In these challenging environments, existing approaches suffer from two common problems: the robot may get frozen and cannot make any progress toward its goal, or it may get lost due to severe occlusions inside a crowd. Here we propose a navigatio… ▽ More

    Submitted 30 September, 2018; originally announced October 2018.

  36. arXiv:1808.03841  [pdf, other

    cs.RO

    Fully Distributed Multi-Robot Collision Avoidance via Deep Reinforcement Learning for Safe and Efficient Navigation in Complex Scenarios

    Authors: Tingxiang Fan, Pinxin Long, Wenxi Liu, Jia Pan

    Abstract: In this paper, we present a decentralized sensor-level collision avoidance policy for multi-robot systems, which shows promising results in practical applications. In particular, our policy directly maps raw sensor measurements to an agent's steering commands in terms of the movement velocity. As a first step toward reducing the performance gap between decentralized and centralized methods, we pre… ▽ More

    Submitted 11 August, 2018; originally announced August 2018.

  37. arXiv:1807.07013  [pdf, ps, other

    cs.DS cs.LG math.ST

    Learning Sums of Independent Random Variables with Sparse Collective Support

    Authors: Anindya De, Philip M. Long, Rocco A. Servedio

    Abstract: We study the learnability of sums of independent integer random variables given a bound on the size of the union of their supports. For $\mathcal{A} \subset \mathbf{Z}_{+}$, a sum of independent random variables with collective support $\mathcal{A}$} (called an $\mathcal{A}$-sum in this paper) is a distribution $\mathbf{S} = \mathbf{X}_1 + \cdots + \mathbf{X}_N$ where the $\mathbf{X}_i$'s are mutu… ▽ More

    Submitted 12 November, 2020; v1 submitted 18 July, 2018; originally announced July 2018.

    Comments: Conference version in FOCS'18; Journal version to appear in JMLR

  38. arXiv:1807.04814  [pdf

    cs.RO

    Integrating Risk in Humanoid Robot Control for Applications in the Nuclear Industry

    Authors: Xianchao Long, Philip Long, Aykut Onol, Taskin Padir

    Abstract: This paper discuss the integration of risk into a robot control framework for decommissioning applications in the nuclear industry. Our overall objective is to allow the robot to evaluate a risk associated with several methods of completing the same task by combining a set of action sequences. If the environment is known and in the absence of sensing errors each set of actions would successfully c… ▽ More

    Submitted 12 July, 2018; originally announced July 2018.

    Comments: 9 pages, 6 figues

    Journal ref: WM2018 Conference, March 18-22, 2018, Phoenix, Arizona, USA

  39. arXiv:1807.04198  [pdf

    cs.RO

    Using Contact to Increase Robot Performance for Glovebox D&D Tasks

    Authors: Aykut Onol, Philip Long, Taskin Padir

    Abstract: Glovebox decommissioning tasks usually require manipulating relatively heavy objects in a highly constrained environment. Thus, contact with the surroundings becomes inevitable. In order to allow the robot to interact with the environment in a natural way, we present a contact-implicit motion planning framework. This framework enables the system, without the specification in advance of a contact p… ▽ More

    Submitted 11 July, 2018; originally announced July 2018.

    Comments: 11 pages, 5 figures; Accepted for publication in Waste Management Symposia 2018

  40. A Comparative Analysis of Contact Models in Trajectory Optimization for Manipulation

    Authors: Aykut Ozgun Onol, Philip Long, Taskin Padir

    Abstract: In this paper, we analyze the effects of contact models on contact-implicit trajectory optimization for manipulation. We consider three different approaches: (1) a contact model that is based on complementarity constraints, (2) a smooth contact model, and our proposed method (3) a variable smooth contact model. We compare these models in simulation in terms of physical accuracy, quality of motions… ▽ More

    Submitted 30 July, 2018; v1 submitted 4 June, 2018; originally announced June 2018.

    Comments: 6 pages, 7 figures, 4 tables, IROS 2018 camera-ready version

  41. arXiv:1805.10408  [pdf, other

    cs.LG cs.AI stat.ML

    The Singular Values of Convolutional Layers

    Authors: Hanie Sedghi, Vineet Gupta, Philip M. Long

    Abstract: We characterize the singular values of the linear transformation associated with a standard 2D multi-channel convolutional layer, enabling their efficient computation. This characterization also leads to an algorithm for projecting a convolutional layer onto an operator-norm ball. We show that this is an effective regularizer; for example, it improves the test error of a deep residual network usin… ▽ More

    Submitted 5 March, 2019; v1 submitted 25 May, 2018; originally announced May 2018.

    Comments: Published as a conference paper at ICLR 2019

  42. arXiv:1804.05012  [pdf, ps, other

    cs.LG cs.AI cs.NE math.ST stat.ML

    Representing smooth functions as compositions of near-identity functions with implications for deep network optimization

    Authors: Peter L. Bartlett, Steven N. Evans, Philip M. Long

    Abstract: We show that any smooth bi-Lipschitz $h$ can be represented exactly as a composition $h_m \circ ... \circ h_1$ of functions $h_1,...,h_m$ that are close to the identity in the sense that each $\left(h_i-\mathrm{Id}\right)$ is Lipschitz, and the Lipschitz constant decreases inversely with the number $m$ of functions composed. This implies that $h$ can be represented to any accuracy by a deep residu… ▽ More

    Submitted 16 April, 2018; v1 submitted 13 April, 2018; originally announced April 2018.

  43. arXiv:1802.06093  [pdf, ps, other

    cs.LG cs.NE math.OC math.ST stat.ML

    Gradient descent with identity initialization efficiently learns positive definite linear transformations by deep residual networks

    Authors: Peter L. Bartlett, David P. Helmbold, Philip M. Long

    Abstract: We analyze algorithms for approximating a function $f(x) = Φx$ map** $\Re^d$ to $\Re^d$ using deep linear neural networks, i.e. that learn a function $h$ parameterized by matrices $Θ_1,...,Θ_L$ and defined by $h(x) = Θ_L Θ_{L-1} ... Θ_1 x$. We focus on algorithms that learn through gradient descent on the population quadratic loss in the case that the distribution over the inputs is isotropic.… ▽ More

    Submitted 18 June, 2018; v1 submitted 16 February, 2018; originally announced February 2018.

  44. arXiv:1709.10082  [pdf, other

    cs.RO cs.AI cs.LG cs.MA

    Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning

    Authors: Pinxin Long, Tingxiang Fan, Xinyi Liao, Wenxi Liu, Hao Zhang, Jia Pan

    Abstract: Develo** a safe and efficient collision avoidance policy for multiple robots is challenging in the decentralized scenarios where each robot generate its paths without observing other robots' states and intents. While other distributed multi-robot collision avoidance systems exist, they often require extracting agent-level features to plan a local collision-free action, which can be computational… ▽ More

    Submitted 20 May, 2018; v1 submitted 28 September, 2017; originally announced September 2017.

  45. arXiv:1709.09574  [pdf, ps, other

    cs.DS

    Fillable arrays with constant time operations and a single bit of redundancy

    Authors: Jacob Teo Por Loong, Jelani Nelson, Huacheng Yu

    Abstract: In the fillable array problem one must maintain an array A[1..n] of $w$-bit entries subject to random access reads and writes, and also a $\texttt{fill}(Δ)$ operation which sets every entry of to some $Δ\in\{0,\ldots,2^w-1\}$. We show that with just one bit of redundancy, i.e. a data structure using $nw+1$ bits of memory, $\texttt{read}/\texttt{fill}$ can be implemented in worst case constant time… ▽ More

    Submitted 27 September, 2017; originally announced September 2017.

  46. arXiv:1609.06838  [pdf, other

    cs.AI cs.CV cs.RO

    Deep-Learned Collision Avoidance Policy for Distributed Multi-Agent Navigation

    Authors: Pinxin Long, Wenxi Liu, Jia Pan

    Abstract: High-speed, low-latency obstacle avoidance that is insensitive to sensor noise is essential for enabling multiple decentralized robots to function reliably in cluttered and dynamic environments. While other distributed multi-agent collision avoidance systems exist, these systems require online geometric optimization where tedious parameter tuning and perfect sensing are necessary. We present a n… ▽ More

    Submitted 6 July, 2017; v1 submitted 22 September, 2016; originally announced September 2016.

    Journal ref: IEEE Robotics and Automation Letters 2(2): 656-663 (2017)

  47. arXiv:1603.06317  [pdf, other

    cs.RO

    DoraPicker: An Autonomous Picking System for General Objects

    Authors: Hao Zhang, Pinxin Long, Dandan Zhou, Zhongfeng Qian, Zheng Wang, Weiwei Wan, Dinesh Manocha, Chonhyon Park, Tommy Hu, Chao Cao, Yibo Chen, Marco Chow, Jia Pan

    Abstract: Robots that autonomously manipulate objects within warehouses have the potential to shorten the package delivery time and improve the efficiency of the e-commerce industry. In this paper, we present a robotic system that is capable of both picking and placing general objects in warehouse scenarios. Given a target object, the robot autonomously detects it from a shelf or a table and estimates its f… ▽ More

    Submitted 20 March, 2016; originally announced March 2016.

    Comments: 10 pages, 10 figures

  48. arXiv:1602.04484  [pdf, ps, other

    cs.LG cs.AI cs.NE math.ST stat.ML

    Surprising properties of dropout in deep networks

    Authors: David P. Helmbold, Philip M. Long

    Abstract: We analyze dropout in deep networks with rectified linear units and the quadratic loss. Our results expose surprising differences between the behavior of dropout and more traditional regularizers like weight decay. For example, on some simple data sets dropout training produces negative weights even though the output is the sum of the inputs. This provides a counterpoint to the suggestion that dro… ▽ More

    Submitted 19 April, 2017; v1 submitted 14 February, 2016; originally announced February 2016.

  49. arXiv:1412.4736  [pdf, other

    cs.LG cs.AI cs.NE math.ST stat.ML

    On the Inductive Bias of Dropout

    Authors: David P. Helmbold, Philip M. Long

    Abstract: Dropout is a simple but effective technique for learning in neural networks and other settings. A sound theoretical understanding of dropout is needed to determine when dropout should be applied and how to use it most effectively. In this paper we continue the exploration of dropout as a regularizer pioneered by Wager, et.al. We focus on linear classification where a convex proxy to the misclassif… ▽ More

    Submitted 17 February, 2015; v1 submitted 15 December, 2014; originally announced December 2014.

    Journal ref: Journal of Machine Learning Research, 16, 3403-3454 (2015). (See http://jmlr.org/papers/volume16/helmbold15a/helmbold15a.pdf.)

  50. arXiv:1307.8371  [pdf, ps, other

    cs.LG cs.CC cs.DS stat.ML

    The Power of Localization for Efficiently Learning Linear Separators with Noise

    Authors: Pranjal Awasthi, Maria Florina Balcan, Philip M. Long

    Abstract: We introduce a new approach for designing computationally efficient learning algorithms that are tolerant to noise, and demonstrate its effectiveness by designing algorithms with improved noise tolerance guarantees for learning linear separators. We consider both the malicious noise model and the adversarial label noise model. For malicious noise, where the adversary can corrupt both the label a… ▽ More

    Submitted 3 June, 2018; v1 submitted 31 July, 2013; originally announced July 2013.

    Comments: Contains improved label complexity analysis communicated to us by Steve Hanneke

    ACM Class: F.2