Skip to main content

Showing 1–2 of 2 results for author: Prakriya, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.06067  [pdf, other

    cs.CL cs.LG

    HMT: Hierarchical Memory Transformer for Long Context Language Processing

    Authors: Zifan He, Zongyue Qin, Neha Prakriya, Yizhou Sun, Jason Cong

    Abstract: Transformer-based large language models (LLM) have been widely used in language processing applications. However, most of them restrict the context window that permits the model to attend to every token in the inputs. Previous works in recurrent models can memorize past tokens to enable unlimited context and maintain effectiveness. However, they have "flat" memory architectures, which have limitat… ▽ More

    Submitted 14 May, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

  2. arXiv:2311.10189  [pdf, other

    cs.DC cs.AR

    TAPA-CS: Enabling Scalable Accelerator Design on Distributed HBM-FPGAs

    Authors: Neha Prakriya, Yuze Chi, Suhail Basalama, Linghao Song, Jason Cong

    Abstract: Despite the increasing adoption of Field-Programmable Gate Arrays (FPGAs) in compute clouds, there remains a significant gap in programming tools and abstractions which can leverage network-connected, cloud-scale, multi-die FPGAs to generate accelerators with high frequency and throughput. To this end, we propose TAPA-CS, a task-parallel dataflow programming framework which automatically partition… ▽ More

    Submitted 1 February, 2024; v1 submitted 16 November, 2023; originally announced November 2023.