Skip to main content

Showing 1–11 of 11 results for author: Khadka, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.04577  [pdf, other

    cs.IR

    Optimizing Nepali PDF Extraction: A Comparative Study of Parser and OCR Technologies

    Authors: Prabin Paudel, Supriya Khadka, Ranju G. C., Rahul Shah

    Abstract: This research compares PDF parsing and Optical Character Recognition (OCR) methods for extracting Nepali content from PDFs. PDF parsing offers fast and accurate extraction but faces challenges with non-Unicode Nepali fonts. OCR, specifically PyTesseract, overcomes these challenges, providing versatility for both digital and scanned PDFs. The study reveals that while PDF parsers are faster, their a… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  2. arXiv:2407.03469  [pdf

    cs.SE cs.AI

    Scaling Data-Driven Building Energy Modelling using Large Language Models

    Authors: Sunil Khadka, Liang Zhang

    Abstract: Building Management System (BMS) through a data-driven method always faces data and model scalability issues. We propose a methodology to tackle the scalability challenges associated with the development of data-driven models for BMS by using Large Language Models (LLMs). LLMs' code generation adaptability can enable broader adoption of BMS by "automating the automation," particularly the data han… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  3. arXiv:2010.03694  [pdf, other

    cs.LG cs.AI

    Learning Intrinsic Symbolic Rewards in Reinforcement Learning

    Authors: Hassam Sheikh, Shauharda Khadka, Santiago Miret, Somdeb Majumdar

    Abstract: Learning effective policies for sparse objectives is a key challenge in Deep Reinforcement Learning (RL). A common approach is to design task-related dense rewards to improve task learnability. While such rewards are easily interpreted, they rely on heuristics and domain expertise. Alternate approaches that train neural networks to discover dense surrogate rewards avoid heuristics, but are high-di… ▽ More

    Submitted 9 October, 2020; v1 submitted 7 October, 2020; originally announced October 2020.

  4. arXiv:2007.07298  [pdf, other

    cs.LG cs.AI

    Optimizing Memory Placement using Evolutionary Graph Reinforcement Learning

    Authors: Shauharda Khadka, Estelle Aflalo, Mattias Marder, Avrech Ben-David, Santiago Miret, Shie Mannor, Tamir Hazan, Hanlin Tang, Somdeb Majumdar

    Abstract: For deep neural network accelerators, memory movement is both energetically expensive and can bound computation. Therefore, optimal map** of tensors to memory hierarchies is critical to performance. The growing complexity of neural networks calls for automated memory map** instead of manual heuristic approaches; yet the search space of neural network computational graphs have previously been p… ▽ More

    Submitted 15 October, 2020; v1 submitted 14 July, 2020; originally announced July 2020.

    Comments: Updated manuscript

  5. arXiv:1906.07315  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Evolutionary Reinforcement Learning for Sample-Efficient Multiagent Coordination

    Authors: Shauharda Khadka, Somdeb Majumdar, Santiago Miret, Stephen McAleer, Kagan Tumer

    Abstract: Many cooperative multiagent reinforcement learning environments provide agents with a sparse team-based reward, as well as a dense agent-specific reward that incentivizes learning basic skills. Training policies solely on the team-based reward is often difficult due to its sparsity. Furthermore, relying solely on the agent-specific reward is sub-optimal because it usually does not capture the team… ▽ More

    Submitted 11 June, 2020; v1 submitted 17 June, 2019; originally announced June 2019.

    Comments: Proceedings of the 37th International Conference on Machine Learning, Vienna, Austria, PMLR 108, 2020

    Journal ref: Proceedings of the 37th International Conference on Machine Learning, Vienna, Austria, PMLR 119, 2020

  6. arXiv:1905.00976  [pdf, other

    cs.LG cs.AI stat.ML

    Collaborative Evolutionary Reinforcement Learning

    Authors: Shauharda Khadka, Somdeb Majumdar, Tarek Nassar, Zach Dwiel, Evren Tumer, Santiago Miret, Yinyin Liu, Kagan Tumer

    Abstract: Deep reinforcement learning algorithms have been successfully applied to a range of challenging control tasks. However, these methods typically struggle with achieving effective exploration and are extremely sensitive to the choice of hyperparameters. One reason is that most approaches use a noisy version of their operating policy to explore - thereby limiting the range of exploration. In this pap… ▽ More

    Submitted 6 May, 2019; v1 submitted 2 May, 2019; originally announced May 2019.

    Comments: Added link to public Github repo. Minor editorial changes. Order of authors modified to reflect ICML submission

    Journal ref: Proceedings of the 36th International Conference on Machine Learning, Long Beach, California, PMLR 97, 2019

  7. arXiv:1902.02441  [pdf, other

    cs.LG cs.RO stat.ML

    Artificial Intelligence for Prosthetics - challenge solutions

    Authors: Łukasz Kidziński, Carmichael Ong, Sharada Prasanna Mohanty, Jennifer Hicks, Sean F. Carroll, Bo Zhou, Hongsheng Zeng, Fan Wang, Rongzhong Lian, Hao Tian, Wojciech Jaśkowski, Garrett Andersen, Odd Rune Lykkebø, Nihat Engin Toklu, Pranav Shyam, Rupesh Kumar Srivastava, Sergey Kolesnikov, Oleksii Hrinchuk, Anton Pechenko, Mattias Ljungström, Zhen Wang, Xu Hu, Zehong Hu, Minghui Qiu, Jun Huang , et al. (25 additional authors not shown)

    Abstract: In the NeurIPS 2018 Artificial Intelligence for Prosthetics challenge, participants were tasked with building a controller for a musculoskeletal model with a goal of matching a given time-varying velocity vector. Top participants were invited to describe their algorithms. In this work, we describe the challenge and present thirteen solutions that used deep reinforcement learning approaches. Many s… ▽ More

    Submitted 6 February, 2019; originally announced February 2019.

  8. arXiv:1805.07917  [pdf, other

    cs.LG cs.NE stat.ML

    Evolution-Guided Policy Gradient in Reinforcement Learning

    Authors: Shauharda Khadka, Kagan Tumer

    Abstract: Deep Reinforcement Learning (DRL) algorithms have been successfully applied to a range of challenging control tasks. However, these methods typically suffer from three core difficulties: temporal credit assignment with sparse rewards, lack of effective exploration, and brittle convergence properties that are extremely sensitive to hyperparameters. Collectively, these challenges severely limit the… ▽ More

    Submitted 27 October, 2018; v1 submitted 21 May, 2018; originally announced May 2018.

    Comments: 32nd Conference on Neural Information Processing Systems (NIPS 2018), Montréal, Canada

  9. arXiv:1702.02313  [pdf, other

    cs.AR

    FASHION: Fault-Aware Self-Healing Intelligent On-chip Network

    Authors: Pengju Ren, Michel A. Kinsy, Mengjiao Zhu, Shreeya Khadka, Mihailo Isakov, Aniruddh Ramrakhyani, Tushar Krishna, Nanning Zheng

    Abstract: To avoid packet loss and deadlock scenarios that arise due to faults or power gating in multicore and many-core systems, the network-on-chip needs to possess resilient communication and load-balancing properties. In this work, we introduce the Fashion router, a self-monitoring and self-reconfiguring design that allows for the on-chip network to dynamically adapt to component failures. First, we in… ▽ More

    Submitted 8 February, 2017; originally announced February 2017.

    Comments: 14 pages, 12 figures

  10. arXiv:1608.04680  [pdf

    q-bio.NC cond-mat.dis-nn cs.ET physics.bio-ph

    Synchronization dynamics on the picosecond timescale in coupled Josephson junction neurons

    Authors: Ken Segall, Matthew LeGro, Steven Kaplan, Oleksiy Svitelskiy, Shreeya Khadka, Patrick Crotty, Daniel Schult

    Abstract: Conventional digital computation is rapidly approaching physical limits for speed and energy dissipation. Here we fabricate and test a simple neuromorphic circuit that models neuronal somas, axons and synapses with superconducting Josephson junctions. The circuit models two mutually coupled excitatory neurons. In some regions of parameter space the neurons are desynchronized. In others, the Joseph… ▽ More

    Submitted 8 February, 2017; v1 submitted 16 August, 2016; originally announced August 2016.

    Comments: 13 pages, 8 figures

    Journal ref: Phys. Rev. E 95, 032220 (2017)

  11. arXiv:1209.4608  [pdf

    q-fin.ST cs.CE

    Performance Analysis of Hybrid Forecasting Model In Stock Market Forecasting

    Authors: Mahesh S. Khadka, K. M. George, N. Park, J. B. Kim

    Abstract: This paper presents performance analysis of hybrid model comprise of concordance and Genetic Programming (GP) to forecast financial market with some existing models. This scheme can be used for in depth analysis of stock market. Different measures of concordances such as Kendalls Tau, Ginis Mean Difference, Spearmans Rho, and weak interpretation of concordance are used to search for the pattern in… ▽ More

    Submitted 15 May, 2013; v1 submitted 20 September, 2012; originally announced September 2012.

    Journal ref: International Journal of Managing Information Technology (IJMIT), Vol. 4, No. 3, August 2012