Skip to main content

Showing 1–10 of 10 results for author: Patwary, M M A

.
  1. arXiv:1712.00409  [pdf, other

    cs.LG stat.ML

    Deep Learning Scaling is Predictable, Empirically

    Authors: Joel Hestness, Sharan Narang, Newsha Ardalani, Gregory Diamos, Heewoo Jun, Hassan Kianinejad, Md. Mostofa Ali Patwary, Yang Yang, Yanqi Zhou

    Abstract: Deep learning (DL) creates impactful advances following a virtuous recipe: model architecture search, creating large training data sets, and scaling computation. It is widely believed that growing training sets and models should improve accuracy and result in better products. As DL application domains grow, we would like a deeper understanding of the relationships between training set size, comput… ▽ More

    Submitted 1 December, 2017; originally announced December 2017.

    Comments: 19 pages, 11 figures

  2. arXiv:1709.00086  [pdf, other

    astro-ph.CO cs.CE cs.PF

    Galactos: Computing the Anisotropic 3-Point Correlation Function for 2 Billion Galaxies

    Authors: Brian Friesen, Md. Mostofa Ali Patwary, Brian Austin, Nadathur Satish, Zachary Slepian, Narayanan Sundaram, Deborah Bard, Daniel J Eisenstein, Jack Deslippe, Pradeep Dubey, Prabhat

    Abstract: The nature of dark energy and the complete theory of gravity are two central questions currently facing cosmology. A vital tool for addressing them is the 3-point correlation function (3PCF), which probes deviations from a spatially random distribution of galaxies. However, the 3PCF's formidable computational expense has prevented its application to astronomical surveys comprising millions to bill… ▽ More

    Submitted 31 August, 2017; originally announced September 2017.

    Comments: 11 pages, 7 figures, accepted to SuperComputing 2017

  3. PANDA: Extreme Scale Parallel K-Nearest Neighbor on Distributed Architectures

    Authors: Md. Mostofa Ali Patwary, Nadathur Rajagopalan Satish, Narayanan Sundaram, Jialin Liu, Peter Sadowski, Evan Racah, Suren Byna, Craig Tull, Wahid Bhimji, Prabhat, Pradeep Dubey

    Abstract: Computing $k$-Nearest Neighbors (KNN) is one of the core kernels used in many machine learning, data mining and scientific computing applications. Although kd-tree based $O(\log n)$ algorithms have been proposed for computing KNN, due to its inherent sequentiality, linear algorithms are being used in practice. This limits the applicability of such methods to millions of data points, with limited s… ▽ More

    Submitted 27 July, 2016; originally announced July 2016.

    Comments: 11 pages in PANDA: Extreme Scale Parallel K-Nearest Neighbor on Distributed Architectures, Md. Mostofa Ali Patwary et.al., IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2016

  4. arXiv:1606.05973  [pdf, other

    cs.DS cs.CV cs.PF

    A New Parallel Algorithm for Two-Pass Connected Component Labeling

    Authors: Siddharth Gupta, Diana Palsetia, Md. Mostofa Ali Patwary, Ankit Agrawal, Alok Choudhary

    Abstract: Connected Component Labeling (CCL) is an important step in pattern recognition and image processing. It assigns labels to the pixels such that adjacent pixels sharing the same features are assigned the same label. Typically, CCL requires several passes over the data. We focus on two-pass technique where each pixel is given a provisional label in the first pass whereas an actual label is assigned i… ▽ More

    Submitted 20 June, 2016; originally announced June 2016.

    Comments: Parallel & Distributed Processing Symposium Workshops (IPDPSW), 2014

  5. arXiv:1503.07241  [pdf, other

    cs.PF cs.DB cs.DC

    GraphMat: High performance graph analytics made productive

    Authors: Narayanan Sundaram, Nadathur Rajagopalan Satish, Md Mostofa Ali Patwary, Subramanya R Dulloor, Satya Gautam Vadlamudi, Dipankar Das, Pradeep Dubey

    Abstract: Given the growing importance of large-scale graph analytics, there is a need to improve the performance of graph analysis frameworks without compromising on productivity. GraphMat is our solution to bridge this gap between a user-friendly graph analytics framework and native, hand-optimized code. GraphMat functions by taking vertex programs and map** them to high performance sparse matrix operat… ▽ More

    Submitted 24 March, 2015; originally announced March 2015.

  6. arXiv:1502.05700  [pdf, other

    stat.ML

    Scalable Bayesian Optimization Using Deep Neural Networks

    Authors: Jasper Snoek, Oren Rippel, Kevin Swersky, Ryan Kiros, Nadathur Satish, Narayanan Sundaram, Md. Mostofa Ali Patwary, Prabhat, Ryan P. Adams

    Abstract: Bayesian optimization is an effective methodology for the global optimization of functions with expensive evaluations. It relies on querying a distribution over functions defined by a relatively cheap surrogate model. An accurate model for this distribution over functions is critical to the effectiveness of the approach, and is typically fit using Gaussian processes (GPs). However, since GPs scale… ▽ More

    Submitted 13 July, 2015; v1 submitted 19 February, 2015; originally announced February 2015.

  7. Fast Algorithms for the Maximum Clique Problem on Massive Graphs with Applications to Overlap** Community Detection

    Authors: Bharath Pattabiraman, Md. Mostofa Ali Patwary, Assefaw H. Gebremedhin, Wei-keng Liao, Alok Choudhary

    Abstract: The maximum clique problem is a well known NP-Hard problem with applications in data mining, network analysis, information retrieval and many other areas related to the World Wide Web. There exist several algorithms for the problem with acceptable runtimes for certain classes of graphs, but many of them are infeasible for massive graphs. We present a new exact algorithm that employs novel pruning… ▽ More

    Submitted 26 November, 2014; originally announced November 2014.

    Comments: 28 pages, 7 figures, 10 tables, 2 algorithms. arXiv admin note: substantial text overlap with arXiv:1209.5818

    Journal ref: Internet Mathematics 2014, Special Issue (WAW'13)

  8. arXiv:1302.6256  [pdf, ps, other

    cs.SI cs.DC cs.DM cs.DS physics.soc-ph

    Parallel Maximum Clique Algorithms with Applications to Network Analysis and Storage

    Authors: Ryan A. Rossi, David F. Gleich, Assefaw H. Gebremedhin, Md. Mostofa Ali Patwary

    Abstract: We propose a fast, parallel maximum clique algorithm for large sparse graphs that is designed to exploit characteristics of social and information networks. The method exhibits a roughly linear runtime scaling over real-world networks ranging from 1000 to 100 million nodes. In a test on a social network with 1.8 billion edges, the algorithm finds the largest clique in about 20 minutes. Our method… ▽ More

    Submitted 25 December, 2013; v1 submitted 25 February, 2013; originally announced February 2013.

    Comments: 11 pages

    MSC Class: 05C69 ACM Class: G.2.2

  9. arXiv:1210.5802  [pdf, other

    cs.SI cs.DC cs.DM physics.soc-ph

    What if CLIQUE were fast? Maximum Cliques in Information Networks and Strong Components in Temporal Networks

    Authors: Ryan A. Rossi, David F. Gleich, Assefaw H. Gebremedhin, Md. Mostofa Ali Patwary

    Abstract: Exact maximum clique finders have progressed to the point where we can investigate cliques in million-node social and information networks, as well as find strongly connected components in temporal networks. We use one such finder to study a large collection of modern networks emanating from biological, social, and technological domains. We show inter-relationships between maximum cliques and seve… ▽ More

    Submitted 30 October, 2012; v1 submitted 22 October, 2012; originally announced October 2012.

    MSC Class: 05C69; 05C85; 91D30 ACM Class: G.2.2; H.2.8

  10. arXiv:1209.5818  [pdf, other

    cs.DS cs.IR

    Fast Algorithms for the Maximum Clique Problem on Massive Sparse Graphs

    Authors: Bharath Pattabiraman, Md. Mostofa Ali Patwary, Assefaw H. Gebremedhin, Wei-keng Liao, Alok Choudhary

    Abstract: The maximum clique problem is a well known NP-Hard problem with applications in data mining, network analysis, informatics, and many other areas. Although there exist several algorithms with acceptable runtimes for certain classes of graphs, many of them are infeasible for massive graphs. We present a new exact algorithm that employs novel pruning techniques to very quickly find maximum cliques in… ▽ More

    Submitted 14 November, 2012; v1 submitted 25 September, 2012; originally announced September 2012.

    Comments: 15 pages (including 2-page appendix), 5 tables, 4 figures

    ACM Class: G.2.2