Skip to main content

Showing 1–14 of 14 results for author: Perera, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.17227  [pdf, other

    cs.RO

    Learning Generic and Dynamic Locomotion of Humanoids Across Discrete Terrains

    Authors: Shangqun Yu, Nisal Perera, Daniel Marew, Donghyun Kim

    Abstract: This paper addresses the challenge of terrain-adaptive dynamic locomotion in humanoid robots, a problem traditionally tackled by optimization-based methods or reinforcement learning (RL). Optimization-based methods, such as model-predictive control, excel in finding optimal reaction forces and achieving agile locomotion, especially in quadruped, but struggle with the nonlinear hybrid dynamics of l… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  2. arXiv:2404.05039  [pdf, other

    cs.RO

    StaccaToe: A Single-Leg Robot that Mimics the Human Leg and Toe

    Authors: Nisal Perera, Shangqun Yu, Daniel Marew, Mack Tang, Ken Suzuki, Aidan McCormack, Shifan Zhu, Yong-Jae Kim, Donghyun Kim

    Abstract: We introduce StaccaToe, a human-scale, electric motor-powered single-leg robot designed to rival the agility of human locomotion through two distinctive attributes: an actuated toe and a co-actuation configuration inspired by the human leg. Leveraging the foundational design of HyperLeg's lower leg mechanism, we develop a stand-alone robot by incorporating new link designs, custom-designed power e… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: Submitted to 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024)

  3. arXiv:2403.15721  [pdf, other

    cs.DC

    Design and Implementation of an Analysis Pipeline for Heterogeneous Data

    Authors: Arup Kumar Sarker, Aymen Alsaadi, Niranda Perera, Mills Staylor, Gregor von Laszewski, Matteo Turilli, Ozgur Ozan Kilic, Mikhail Titov, Andre Merzky, Shantenu Jha, Geoffrey Fox

    Abstract: Managing and preparing complex data for deep learning, a prevalent approach in large-scale data science can be challenging. Data transfer for model training also presents difficulties, impacting scientific fields like genomics, climate modeling, and astronomy. A large-scale solution like Google Pathways with a distributed execution environment for deep learning models exists but is proprietary. In… ▽ More

    Submitted 7 April, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

    Comments: 14 pages, 16 figures, 2 tables

    ACM Class: H.2.4; D.2.7; D.2.2

  4. arXiv:2307.01394  [pdf, ps, other

    cs.DC cs.AI cs.IR cs.LG

    In-depth Analysis On Parallel Processing Patterns for High-Performance Dataframes

    Authors: Niranda Perera, Arup Kumar Sarker, Mills Staylor, Gregor von Laszewski, Kaiying Shan, Supun Kamburugamuve, Chathura Widanage, Vibhatha Abeykoon, Thejaka Amila Kanewela, Geoffrey Fox

    Abstract: The Data Science domain has expanded monumentally in both research and industry communities during the past decade, predominantly owing to the Big Data revolution. Artificial Intelligence (AI) and Machine Learning (ML) are bringing more complexities to data engineering applications, which are now integrated into data processing pipelines to process terabytes of data. Typically, a significant amoun… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Report number: FGCS-D-23-00577R1

  5. arXiv:2301.07896  [pdf, other

    cs.DC cs.DB

    Supercharging Distributed Computing Environments For High Performance Data Engineering

    Authors: Niranda Perera, Kaiying Shan, Supun Kamburugamuwe, Thejaka Amila Kanewela, Chathura Widanage, Arup Sarker, Mills Staylor, Tianle Zhong, Vibhatha Abeykoon, Geoffrey Fox

    Abstract: The data engineering and data science community has embraced the idea of using Python & R dataframes for regular applications. Driven by the big data revolution and artificial intelligence, these applications are now essential in order to process terabytes of data. They can easily exceed the capabilities of a single machine, but also demand significant developer time & effort. Therefore it is esse… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

  6. arXiv:2212.13732  [pdf, ps, other

    cs.DC

    Hybrid Cloud and HPC Approach to High-Performance Dataframes

    Authors: Kaiying Shan, Niranda Perera, Damitha Lenadora, Tianle Zhong, Arup Sarker, Supun Kamburugamuve, Thejaka Amila Kanewela, Chathura Widanage, Geoffrey Fox

    Abstract: Data pre-processing is a fundamental component in any data-driven application. With the increasing complexity of data processing operations and volume of data, Cylon, a distributed dataframe system, is developed to facilitate data processing both as a standalone application and as a library, especially for Python applications. While Cylon shows promising performance results, we experienced difficu… ▽ More

    Submitted 29 December, 2022; v1 submitted 28 December, 2022; originally announced December 2022.

  7. arXiv:2209.12617  [pdf, other

    cs.CL cs.AI

    Evaluation of Question Answering Systems: Complexity of judging a natural language

    Authors: Amer Farea, Zhen Yang, Kien Duong, Nadeesha Perera, Frank Emmert-Streib

    Abstract: Question answering (QA) systems are among the most important and rapidly develo** research topics in natural language processing (NLP). A reason, therefore, is that a QA system allows humans to interact more naturally with a machine, e.g., via a virtual assistant or search engine. In the last decades, many QA systems have been proposed to address the requirements of different question-answering… ▽ More

    Submitted 10 September, 2022; originally announced September 2022.

  8. High Performance Dataframes from Parallel Processing Patterns

    Authors: Niranda Perera, Supun Kamburugamuve, Chathura Widanage, Vibhatha Abeykoon, Ahmet Uyar, Kaiying Shan, Hasara Maithree, Damitha Lenadora, Thejaka Amila Kanewala, Geoffrey Fox

    Abstract: The data science community today has embraced the concept of Dataframes as the de facto standard for data representation and manipulation. Ease of use, massive operator coverage, and popularization of R and Python languages have heavily influenced this transformation. However, most widely used serial Dataframes today (R, pandas) experience performance limitations even while working on even moderat… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: Will be presented in PPAM 2022

  9. arXiv:2108.06001  [pdf, other

    cs.DC cs.AI

    HPTMT Parallel Operators for High Performance Data Science & Data Engineering

    Authors: Vibhatha Abeykoon, Supun Kamburugamuve, Chathura Widanage, Niranda Perera, Ahmet Uyar, Thejaka Amila Kanewala, Gregor von Laszewski, Geoffrey Fox

    Abstract: Data-intensive applications are becoming commonplace in all science disciplines. They are comprised of a rich set of sub-domains such as data engineering, deep learning, and machine learning. These applications are built around efficient data abstractions and operators that suit the applications of different domains. Often lack of a clear definition of data structures and operators in the field ha… ▽ More

    Submitted 12 August, 2021; originally announced August 2021.

  10. arXiv:2107.12807  [pdf, other

    cs.DC cs.AI

    HPTMT: Operator-Based Architecture for Scalable High-Performance Data-Intensive Frameworks

    Authors: Supun Kamburugamuve, Chathura Widanage, Niranda Perera, Vibhatha Abeykoon, Ahmet Uyar, Thejaka Amila Kanewala, Gregor von Laszewski, Geoffrey Fox

    Abstract: Data-intensive applications impact many domains, and their steadily increasing size and complexity demands high-performance, highly usable environments. We integrate a set of ideas developed in various data science and data engineering frameworks. They employ a set of operators on specific data abstractions that include vectors, matrices, tensors, graphs, and tables. Our key concepts are inspired… ▽ More

    Submitted 29 July, 2021; v1 submitted 27 July, 2021; originally announced July 2021.

  11. arXiv:2010.14596  [pdf, other

    cs.DC cs.IR

    A Fast, Scalable, Universal Approach For Distributed Data Aggregations

    Authors: Niranda Perera, Vibhatha Abeykoon, Chathura Widanage, Supun Kamburugamuve, Thejaka Amila Kanewala, Pulasthi Wickramasinghe, Ahmet Uyar, Hasara Maithree, Damitha Lenadora, Geoffrey Fox

    Abstract: In the current era of Big Data, data engineering has transformed into an essential field of study across many branches of science. Advancements in Artificial Intelligence (AI) have broadened the scope of data engineering and opened up new applications in both enterprise and research communities. Aggregations (also termed reduce in functional programming) are an integral functionality in these appl… ▽ More

    Submitted 14 December, 2020; v1 submitted 27 October, 2020; originally announced October 2020.

  12. arXiv:2010.06312  [pdf, other

    cs.DC cs.CY cs.PF cs.SE

    Data Engineering for HPC with Python

    Authors: Vibhatha Abeykoon, Niranda Perera, Chathura Widanage, Supun Kamburugamuve, Thejaka Amila Kanewala, Hasara Maithree, Pulasthi Wickramasinghe, Ahmet Uyar, Geoffrey Fox

    Abstract: Data engineering is becoming an increasingly important part of scientific discoveries with the adoption of deep learning and machine learning. Data engineering deals with a variety of data formats, storage, data extraction, transformation, and data movements. One goal of data engineering is to transform data from original data to vector/matrix/tensor formats accepted by deep learning and machine l… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

    Comments: 9 pages, 11 images, Accepted in 9th Workshop on Python for High-Performance and Scientific Computing (In conjunction with Supercomputing 20)

  13. arXiv:2007.09589  [pdf, other

    cs.DC cs.DB

    High Performance Data Engineering Everywhere

    Authors: Chathura Widanage, Niranda Perera, Vibhatha Abeykoon, Supun Kamburugamuve, Thejaka Amila Kanewala, Hasara Maithree, Pulasthi Wickramasinghe, Ahmet Uyar, Gurhan Gunduz, Geoffrey Fox

    Abstract: The amazing advances being made in the fields of machine and deep learning are a highlight of the Big Data era for both enterprise and research communities. Modern applications require resources beyond a single node's ability to provide. However this is just a small part of the issues facing the overall data processing environment, which must also support a raft of data engineering for pre- and po… ▽ More

    Submitted 19 July, 2020; originally announced July 2020.

  14. arXiv:1708.02912  [pdf

    cs.CL cs.IR

    KeyXtract Twitter Model - An Essential Keywords Extraction Model for Twitter Designed using NLP Tools

    Authors: Tharindu Weerasooriya, Nandula Perera, S. R. Liyanage

    Abstract: Since a tweet is limited to 140 characters, it is ambiguous and difficult for traditional Natural Language Processing (NLP) tools to analyse. This research presents KeyXtract which enhances the machine learning based Stanford CoreNLP Part-of-Speech (POS) tagger with the Twitter model to extract essential keywords from a tweet. The system was developed using rule-based parsers and two corpora. The… ▽ More

    Submitted 9 August, 2017; originally announced August 2017.

    Comments: 7 Pages, 5 Figures, Proceedings of the 10th KDU International Research Conference