Skip to main content

Showing 1–19 of 19 results for author: Chidambaram, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.09649  [pdf, other

    cs.OS

    SquirrelFS: using the Rust compiler to check file-system crash consistency

    Authors: Hayley LeBlanc, Nathan Taylor, James Bornholt, Vijay Chidambaram

    Abstract: This work introduces a new approach to building crash-safe file systems for persistent memory. We exploit the fact that Rust's typestate pattern allows compile-time enforcement of a specific order of operations. We introduce a novel crash-consistency mechanism, Synchronous Soft Updates, that boils down crash safety to enforcing ordering among updates to file-system metadata. We employ this approac… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  2. arXiv:2209.08743  [pdf, other

    cs.DC cs.DB

    DINOMO: An Elastic, Scalable, High-Performance Key-Value Store for Disaggregated Persistent Memory (Extended Version)

    Authors: Sekwon Lee, Soujanya Ponnapalli, Sharad Singhal, Marcos K. Aguilera, Kimberly Keeton, Vijay Chidambaram

    Abstract: We present Dinomo, a novel key-value store for disaggregated persistent memory (DPM). Dinomo is the first key-value store for DPM that simultaneously achieves high common-case performance, scalability, and lightweight online reconfiguration. We observe that previously proposed key-value stores for DPM had architectural limitations that prevent them from achieving all three goals simultaneously. Di… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

    Comments: This is an extended version of the full paper to appear in PVLDB 15.13 (VLDB 2023)

  3. arXiv:2204.06066  [pdf, other

    cs.OS cs.SE

    Finding and Analyzing Crash-Consistency Bugs in Persistent-Memory File Systems

    Authors: Hayley LeBlanc, Shankara Pailoor, Isil Dillig, James Bornholt, Vijay Chidambaram

    Abstract: We present a study of crash-consistency bugs in persistent-memory (PM) file systems and analyze their implications for file-system design and testing crash consistency. We develop FlyTrap, a framework to test PM file systems for crash-consistency bugs. FlyTrap discovered 18 new bugs across four PM file systems; the bugs have been confirmed by developers and many have been already fixed. The discov… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    Comments: 16 pages

  4. arXiv:2111.04867  [pdf, other

    cs.DC cs.LG

    TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches

    Authors: Aashaka Shah, Vijay Chidambaram, Meghan Cowan, Saeed Maleki, Madan Musuvathi, Todd Mytkowicz, Jacob Nelson, Olli Saarikivi, Rachee Singh

    Abstract: Machine learning models are increasingly being trained across multiple GPUs and servers. In this setting, data is transferred between GPUs using communication collectives such as AlltoAll and AllReduce, which can become a significant bottleneck in training large models. Thus, it is important to use efficient algorithms for collective communication. We develop TACCL, a tool that enables algorithm d… ▽ More

    Submitted 5 October, 2022; v1 submitted 8 November, 2021; originally announced November 2021.

    Comments: Accepted at NSDI'23. Contains 20 pages, 11 figures, including Appendix

  5. arXiv:2110.06073  [pdf, other

    cs.DC cs.LG

    Synergy: Resource Sensitive DNN Scheduling in Multi-Tenant Clusters

    Authors: Jayashree Mohan, Amar Phanishayee, Janardhan Kulkarni, Vijay Chidambaram

    Abstract: Training Deep Neural Networks (DNNs) is a widely popular workload in both enterprises and cloud data centers. Existing schedulers for DNN training consider GPU as the dominant resource, and allocate other resources such as CPU and memory proportional to the number of GPUs requested by the job. Unfortunately, these schedulers do not consider the impact of a job's sensitivity to allocation of CPU, m… ▽ More

    Submitted 24 August, 2022; v1 submitted 12 October, 2021; originally announced October 2021.

  6. arXiv:2106.03617  [pdf, other

    cs.DC cs.OS

    PAIO: A Software-Defined Storage Data Plane Framework

    Authors: Ricardo Macedo, Yusuke Tanimura, Jason Haga, Vijay Chidambaram, José Pereira, João Paulo

    Abstract: We propose PAIO, the first general-purpose framework that enables system designers to build custom-made Software-Defined Storage (SDS) data plane stages. It provides the means to implement storage optimizations adaptable to different workflows and user-defined policies, and allows straightforward integration with existing applications and I/O layers. PAIO allows stages to be integrated with modern… ▽ More

    Submitted 12 August, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: 15 pages, 8 figures

  7. arXiv:2010.14501  [pdf, other

    cs.LG cs.CV

    Memory Optimization for Deep Networks

    Authors: Aashaka Shah, Chao-Yuan Wu, Jayashree Mohan, Vijay Chidambaram, Philipp Krähenbühl

    Abstract: Deep learning is slowly, but steadily, hitting a memory bottleneck. While the tensor computation in top-of-the-line GPUs increased by 32x over the last five years, the total available memory only grew by 2.5x. This prevents researchers from exploring larger architectures, as training large networks requires more memory for storing intermediate outputs. In this paper, we present MONeT, an automatic… ▽ More

    Submitted 2 April, 2021; v1 submitted 27 October, 2020; originally announced October 2020.

    Comments: 18 pages, ICLR'21

  8. arXiv:2008.04936  [pdf, other

    cs.CR cs.CY eess.SY

    Towards Software-Defined Data Protection: GDPR Compliance at the Storage Layer is Within Reach

    Authors: Zsolt Istvan, Soujanya Ponnapalli, Vijay Chidambaram

    Abstract: Enforcing data protection and privacy rules within large data processing applications is becoming increasingly important, especially in the light of GDPR and similar regulatory frameworks. Most modern data processing happens on top of a distributed storage layer, and securing this layer against accidental or malicious misuse is crucial to ensuring global privacy guarantees. However, the performanc… ▽ More

    Submitted 11 August, 2020; originally announced August 2020.

  9. arXiv:2007.06775  [pdf, other

    cs.DC cs.LG cs.OS

    Analyzing and Mitigating Data Stalls in DNN Training

    Authors: Jayashree Mohan, Amar Phanishayee, Ashish Raniwala, Vijay Chidambaram

    Abstract: Training Deep Neural Networks (DNNs) is resource-intensive and time-consuming. While prior research has explored many different ways of reducing DNN training time, the impact of input data pipeline, i.e., fetching raw data items from storage and performing data pre-processing in memory, has been relatively unexplored. This paper makes the following contributions: (1) We present the first comprehen… ▽ More

    Submitted 19 January, 2021; v1 submitted 13 July, 2020; originally announced July 2020.

  10. arXiv:1911.00498  [pdf, other

    cs.CY

    GDPR Anti-Patterns: How Design and Operation of Modern Cloud-scale Systems Conflict with GDPR

    Authors: Supreeth Shastri, Melissa Wasserman, Vijay Chidambaram

    Abstract: In recent years, our society is being plagued by unprecedented levels of privacy and security breaches. To rein in this trend, the European Union, in 2018, introduced a comprehensive legislation called the General Data Protection Regulation (GDPR). In this article, we review GDPR from a systems perspective, and identify how the design and operation of modern cloud-scale systems conflict with this… ▽ More

    Submitted 31 October, 2019; originally announced November 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1903.09305

  11. Understanding and Benchmarking the Impact of GDPR on Database Systems

    Authors: Supreeth Shastri, Vinay Banakar, Melissa Wasserman, Arun Kumar, Vijay Chidambaram

    Abstract: The General Data Protection Regulation (GDPR) provides new rights and protections to European people concerning their personal data. We analyze GDPR from a systems perspective, translating its legal articles into a set of capabilities and characteristics that compliant systems must support. Our analysis reveals the phenomenon of metadata explosion, wherein large quantities of metadata needs to be… ▽ More

    Submitted 16 March, 2020; v1 submitted 1 October, 2019; originally announced October 2019.

    Journal ref: PVLDB, 13(7): 1064-1077, 2020

  12. arXiv:1909.13670  [pdf, other

    cs.DC cs.DB cs.DS

    RECIPE : Converting Concurrent DRAM Indexes to Persistent-Memory Indexes

    Authors: Se Kwon Lee, Jayashree Mohan, Sanidhya Kashyap, Taesoo Kim, Vijay Chidambaram

    Abstract: We present Recipe, a principled approach for converting concurrent DRAM indexes into crash-consistent indexes for persistent memory (PM). The main insight behind Recipe is that isolation provided by a certain class of concurrent in-memory indexes can be translated with small changes to crash-consistency when the same index is used in PM. We present a set of conditions that enable the identificatio… ▽ More

    Submitted 8 November, 2019; v1 submitted 22 September, 2019; originally announced September 2019.

    Comments: 3pages: Added one more reference

  13. arXiv:1909.11590  [pdf, other

    cs.DC

    Rainblock: Faster Transaction Processing in Public Blockchains

    Authors: Soujanya Ponnapalli, Aashaka Shah, Amy Tai, Souvik Banerjee, Vijay Chidambaram, Dahlia Malkhi, Michael Wei

    Abstract: Public blockchains like Ethereum use Merkle trees to verify transactions received from untrusted servers before applying them to the blockchain. We empirically show that the low throughput of such blockchains is due to the I/O bottleneck associated with using Merkle trees for processing transactions. We present RAINBLOCK, a new architecture for public blockchains that increases throughput without… ▽ More

    Submitted 15 October, 2020; v1 submitted 25 September, 2019; originally announced September 2019.

  14. SplitFS: Reducing Software Overhead in File Systems for Persistent Memory

    Authors: Rohan Kadekodi, Se Kwon Lee, Sanidhya Kashyap, Taesoo Kim, Aasheesh Kolli, Vijay Chidambaram

    Abstract: We present SplitFS, a file system for persistent memory (PM) that reduces software overhead significantly compared to state-of-the-art PM file systems. SplitFS presents a novel split of responsibilities between a user-space library file system and an existing kernel PM file system. The user-space library file system handles data operations by intercepting POSIX calls, memory-map** the underlying… ▽ More

    Submitted 22 September, 2019; originally announced September 2019.

  15. arXiv:1906.12038  [pdf, other

    cs.CY

    Analyzing GDPR Compliance Through the Lens of Privacy Policy

    Authors: Jayashree Mohan, Melissa Wasserman, Vijay Chidambaram

    Abstract: With the arrival of the European Union's General Data Protection Regulation (GDPR), several companies are making significant changes to their systems to achieve compliance. The changes range from modifying privacy policies to redesigning systems which process personal data. This work analyzes the privacy policies of large-scaled cloud services which seek to be GDPR compliant. The privacy policy is… ▽ More

    Submitted 28 June, 2019; originally announced June 2019.

  16. arXiv:1903.09305  [pdf, other

    cs.CY cs.CR

    The Seven Sins of Personal-Data Processing Systems under GDPR

    Authors: Supreeth Shastri, Melissa Wasserman, Vijay Chidambaram

    Abstract: In recent years, our society is being plagued by unprecedented levels of privacy and security breaches. To rein in this trend, the European Union, in 2018, introduced a comprehensive legislation called the General Data Protection Regulation (GDPR). In this paper, we review GDPR from a system design perspective, and identify how its regulations conflict with the design, architecture, and operation… ▽ More

    Submitted 15 May, 2019; v1 submitted 7 March, 2019; originally announced March 2019.

    Comments: Accepted for publication at USENIX HotCloud 2019

  17. arXiv:1903.04880  [pdf, other

    cs.DC cs.DB

    Analyzing the Impact of GDPR on Storage Systems

    Authors: Aashaka Shah, Vinay Banakar, Supreeth Shastri, Melissa Wasserman, Vijay Chidambaram

    Abstract: The recently introduced General Data Protection Regulation (GDPR) is forcing several companies to make significant changes to their systems to achieve compliance. Motivated by the finding that more than 30% of GDPR articles are related to storage, we investigate the impact of GDPR compliance on storage systems. We illustrate the challenges of retrofitting existing systems into compliance by modify… ▽ More

    Submitted 16 May, 2019; v1 submitted 8 March, 2019; originally announced March 2019.

  18. arXiv:1810.02904  [pdf, other

    cs.OS

    Finding Crash-Consistency Bugs with Bounded Black-Box Crash Testing

    Authors: Jayashree Mohan, Ashlie Martinez, Soujanya Ponnapalli, Pandian Raju, Vijay Chidambaram

    Abstract: We present a new approach to testing file-system crash consistency: bounded black-box crash testing (B3). B3 tests the file system in a black-box manner using workloads of file-system operations. Since the space of possible workloads is infinite, B3 bounds this space based on parameters such as the number of file-system operations or which operations to include, and exhaustively generates workload… ▽ More

    Submitted 5 October, 2018; originally announced October 2018.

  19. arXiv:1707.08514  [pdf, other

    cs.OS

    Analyzing IO Amplification in Linux File Systems

    Authors: Jayashree Mohan, Rohan Kadekodi, Vijay Chidambaram

    Abstract: We present the first systematic analysis of read, write, and space amplification in Linux file systems. While many researchers are tackling write amplification in key-value stores, IO amplification in file systems has been largely unexplored. We analyze data and metadata operations on five widely-used Linux file systems: ext2, ext4, XFS, btrfs, and F2FS. We find that data operations result in sign… ▽ More

    Submitted 26 July, 2017; originally announced July 2017.