Skip to main content

Showing 1–7 of 7 results for author: Bakhshalipour, M

.
  1. arXiv:2309.09212  [pdf, other

    cs.RO

    RobotPerf: An Open-Source, Vendor-Agnostic, Benchmarking Suite for Evaluating Robotics Computing System Performance

    Authors: Víctor Mayoral-Vilches, Jason Jabbour, Yu-Shun Hsiao, Zishen Wan, Martiño Crespo-Álvarez, Matthew Stewart, Juan Manuel Reina-Muñoz, Prateek Nagras, Gaurav Vikhe, Mohammad Bakhshalipour, Martin Pinzger, Stefan Rass, Smruti Panigrahi, Giulio Corradi, Niladri Roy, Phillip B. Gibbons, Sabrina M. Neuman, Brian Plancher, Vijay Janapa Reddi

    Abstract: We introduce RobotPerf, a vendor-agnostic benchmarking suite designed to evaluate robotics computing performance across a diverse range of hardware platforms using ROS 2 as its common baseline. The suite encompasses ROS 2 packages covering the full robotics pipeline and integrates two distinct benchmarking approaches: black-box testing, which measures performance by eliminating upper layers and re… ▽ More

    Submitted 29 January, 2024; v1 submitted 17 September, 2023; originally announced September 2023.

  2. arXiv:2102.06261  [pdf, other

    cs.RO

    Speculative Path Planning

    Authors: Mohammad Bakhshalipour, Mohamad Qadri, Dominic Guri

    Abstract: Parallelization of A* path planning is mostly limited by the number of possible motions, which is far less than the level of parallelism that modern processors support. In this paper, we go beyond the limitations of traditional parallelism of A* and propose Speculative Path Planning to accelerate the search when there are abundant idle resources. The key idea of our approach is predicting future s… ▽ More

    Submitted 14 February, 2021; v1 submitted 11 February, 2021; originally announced February 2021.

    Comments: 6 pages, 14 figures. Minor changes to better explain the testing scenarios

  3. arXiv:2009.00715  [pdf, other

    cs.AR

    A Survey on Recent Hardware Data Prefetching Approaches with An Emphasis on Servers

    Authors: Mohammad Bakhshalipour, Mehran Shakerinava, Fatemeh Golshan, Ali Ansari, Pejman Lotfi-Karman, Hamid Sarbazi-Azad

    Abstract: Data prefetching, i.e., the act of predicting application's future memory accesses and fetching those that are not in the on-chip caches, is a well-known and widely-used approach to hide the long latency of memory accesses. The fruitfulness of data prefetching is evident to both industry and academy: nowadays, almost every high-performance processor incorporates a few data prefetchers for capturin… ▽ More

    Submitted 1 September, 2020; originally announced September 2020.

  4. arXiv:1809.08828  [pdf, other

    cs.AR

    Die-Stacked DRAM: Memory, Cache, or MemCache?

    Authors: Mohammad Bakhshalipour, HamidReza Zare, Pejman Lotfi-Kamran, Hamid Sarbazi-Azad

    Abstract: Die-stacked DRAM is a promising solution for satisfying the ever-increasing memory bandwidth requirements of multi-core processors. Manufacturing technology has enabled stacking several gigabytes of DRAM modules on the active die, thereby providing orders of magnitude higher bandwidth as compared to the conventional DIMM-based DDR memories. Nevertheless, die-stacked DRAM, due to its limited capaci… ▽ More

    Submitted 24 September, 2018; originally announced September 2018.

  5. arXiv:1808.05024  [pdf, other

    cs.AR

    Making Belady-Inspired Replacement Policies More Effective Using Expected Hit Count

    Authors: Seyed Armin Vakil Ghahani, Sara Mahdizadeh Shahri, Mohammad Bakhshalipour, Pejman Lotfi-Kamran, Hamid Sarbazi-Azad

    Abstract: Memory-intensive workloads operate on massive amounts of data that cannot be captured by last-level caches (LLCs) of modern processors. Consequently, processors encounter frequent off-chip misses, and hence, lose a significant performance potential. One way to reduce the number of off-chip misses is through using a well-behaved replacement policy in the LLC. Existing processors employ a variation… ▽ More

    Submitted 15 August, 2018; originally announced August 2018.

  6. arXiv:1808.04864  [pdf, other

    cs.AR

    Scale-Out Processors & Energy Efficiency

    Authors: Pouya Esmaili-Dokht, Mohammad Bakhshalipour, Behnam Khodabandeloo, Pejman Lotfi-Kamran, Hamid Sarbazi-Azad

    Abstract: Scale-out workloads like media streaming or Web search serve millions of users and operate on a massive amount of data, and hence, require enormous computational power. As the number of users is increasing and the size of data is expanding, even more computational power is necessary for powering up such workloads. Data centers with thousands of servers are providing the computational power necessa… ▽ More

    Submitted 14 August, 2018; originally announced August 2018.

  7. arXiv:1805.07269  [pdf, other

    cs.DC

    Parallelizing Bisection Root-Finding: A Case for Accelerating Serial Algorithms in Multicore Substrates

    Authors: Mohammad Bakhshalipour, Hamid Sarbazi-Azad

    Abstract: Multicore architectures dominate today's processor market. Even though the number of cores and threads are pretty high and continues to grow, inherently serial algorithms do not benefit from the abundance of cores and threads. In this paper, we propose Runahead Computing, a technique which uses idle threads in a multi-threaded architecture for accelerating the execution time of serial algorithms.… ▽ More

    Submitted 10 May, 2018; originally announced May 2018.

    Comments: 5 pages, 7 figures