Skip to main content

Showing 1–9 of 9 results for author: Bilaniuk, O

Searching in archive cs. Search in all archives.
.
  1. BARVINN: Arbitrary Precision DNN Accelerator Controlled by a RISC-V CPU

    Authors: Mohammadhossein Askarihemmat, Sean Wagner, Olexa Bilaniuk, Yassine Hariri, Yvon Savaria, Jean-Pierre David

    Abstract: We present a DNN accelerator that allows inference at arbitrary precision with dedicated processing elements that are configurable at the bit level. Our DNN accelerator has 8 Processing Elements controlled by a RISC-V controller with a combined 8.2 TMACs of computational power when implemented with the recent Alveo U250 FPGA platform. We develop a code generator tool that ingests CNN models in ONN… ▽ More

    Submitted 31 December, 2022; originally announced January 2023.

    Comments: 7 pages. Accepted for publication in the 2023, 28th Asia and South Pacific Design Automation Conference (ASP-DAC 2023)

    ACM Class: C.1.3; C.3

  2. arXiv:2109.02429  [pdf, other

    stat.ML cs.LG

    Learning Neural Causal Models with Active Interventions

    Authors: Nino Scherrer, Olexa Bilaniuk, Yashas Annadani, Anirudh Goyal, Patrick Schwab, Bernhard Schölkopf, Michael C. Mozer, Yoshua Bengio, Stefan Bauer, Nan Rosemary Ke

    Abstract: Discovering causal structures from data is a challenging inference problem of fundamental importance in all areas of science. The appealing properties of neural networks have recently led to a surge of interest in differentiable neural network-based methods for learning causal structures from data. So far, differentiable causal discovery has focused on static datasets of observational or fixed int… ▽ More

    Submitted 5 March, 2022; v1 submitted 6 September, 2021; originally announced September 2021.

  3. arXiv:2010.16004  [pdf, other

    cs.CY cs.LG cs.MA cs.SI

    COVI-AgentSim: an Agent-based Model for Evaluating Methods of Digital Contact Tracing

    Authors: Prateek Gupta, Tegan Maharaj, Martin Weiss, Nasim Rahaman, Hannah Alsdurf, Abhinav Sharma, Nanor Minoyan, Soren Harnois-Leblanc, Victor Schmidt, Pierre-Luc St. Charles, Tristan Deleu, Andrew Williams, Akshay Patel, Meng Qu, Olexa Bilaniuk, Gaétan Marceau Caron, Pierre Luc Carrier, Satya Ortiz-Gagné, Marc-Andre Rousseau, David Buckeridge, Joumana Ghosn, Yang Zhang, Bernhard Schölkopf, Jian Tang, Irina Rish , et al. (4 additional authors not shown)

    Abstract: The rapid global spread of COVID-19 has led to an unprecedented demand for effective methods to mitigate the spread of the disease, and various digital contact tracing (DCT) methods have emerged as a component of the solution. In order to make informed public health choices, there is a need for tools which allow evaluation and comparison of DCT methods. We introduce an agent-based compartmental si… ▽ More

    Submitted 29 October, 2020; originally announced October 2020.

  4. arXiv:1910.01075  [pdf, other

    stat.ML cs.AI cs.LG

    Learning Neural Causal Models from Unknown Interventions

    Authors: Nan Rosemary Ke, Olexa Bilaniuk, Anirudh Goyal, Stefan Bauer, Hugo Larochelle, Bernhard Schölkopf, Michael C. Mozer, Chris Pal, Yoshua Bengio

    Abstract: Promising results have driven a recent surge of interest in continuous optimization methods for Bayesian network structure learning from observational data. However, there are theoretical limitations on the identifiability of underlying structures obtained from observational data alone. Interventional data provides much richer information about the underlying data-generating process. However, the… ▽ More

    Submitted 23 August, 2020; v1 submitted 2 October, 2019; originally announced October 2019.

  5. arXiv:1901.10912  [pdf, other

    cs.LG stat.ML

    A Meta-Transfer Objective for Learning to Disentangle Causal Mechanisms

    Authors: Yoshua Bengio, Tristan Deleu, Nasim Rahaman, Rosemary Ke, Sébastien Lachapelle, Olexa Bilaniuk, Anirudh Goyal, Christopher Pal

    Abstract: We propose to meta-learn causal structures based on how fast a learner adapts to new distributions arising from sparse distributional changes, e.g. due to interventions, actions of agents and other sources of non-stationarities. We show that under this assumption, the correct causal structural choices lead to faster adaptation to modified distributions because the changes are concentrated in one o… ▽ More

    Submitted 4 February, 2019; v1 submitted 30 January, 2019; originally announced January 2019.

  6. arXiv:1809.03702  [pdf, other

    cs.LG stat.ML

    Sparse Attentive Backtracking: Temporal CreditAssignment Through Reminding

    Authors: Nan Rosemary Ke, Anirudh Goyal, Olexa Bilaniuk, Jonathan Binas, Michael C. Mozer, Chris Pal, Yoshua Bengio

    Abstract: Learning long-term dependencies in extended temporal sequences requires credit assignment to events far back in the past. The most common method for training recurrent neural networks, back-propagation through time (BPTT), requires credit information to be propagated backwards through every single step of the forward computation, potentially over thousands or millions of time steps. This becomes c… ▽ More

    Submitted 11 September, 2018; originally announced September 2018.

    Comments: To appear as a Spotlight presentation at NIPS 2018

  7. arXiv:1711.02326  [pdf, other

    cs.AI cs.LG cs.NE stat.ML

    Sparse Attentive Backtracking: Long-Range Credit Assignment in Recurrent Networks

    Authors: Nan Rosemary Ke, Anirudh Goyal, Olexa Bilaniuk, Jonathan Binas, Laurent Charlin, Chris Pal, Yoshua Bengio

    Abstract: A major drawback of backpropagation through time (BPTT) is the difficulty of learning long-term dependencies, coming from having to propagate credit information backwards through every single step of the forward computation. This makes BPTT both computationally impractical and biologically implausible. For this reason, full backpropagation through time is rarely used on long sequences, and truncat… ▽ More

    Submitted 7 November, 2017; originally announced November 2017.

  8. arXiv:1705.09792  [pdf, other

    cs.NE cs.LG

    Deep Complex Networks

    Authors: Chiheb Trabelsi, Olexa Bilaniuk, Ying Zhang, Dmitriy Serdyuk, Sandeep Subramanian, João Felipe Santos, Soroush Mehri, Negar Rostamzadeh, Yoshua Bengio, Christopher J Pal

    Abstract: At present, the vast majority of building blocks, techniques, and architectures for deep learning are based on real-valued operations and representations. However, recent work on recurrent neural networks and older fundamental theoretical analysis suggests that complex numbers could have a richer representational capacity and could also facilitate noise-robust memory retrieval mechanisms. Despite… ▽ More

    Submitted 25 February, 2018; v1 submitted 27 May, 2017; originally announced May 2017.

  9. arXiv:1606.01651  [pdf, other

    cs.LG cs.NE q-bio.NC

    Feedforward Initialization for Fast Inference of Deep Generative Networks is biologically plausible

    Authors: Yoshua Bengio, Benjamin Scellier, Olexa Bilaniuk, Joao Sacramento, Walter Senn

    Abstract: We consider deep multi-layered generative models such as Boltzmann machines or Hopfield nets in which computation (which implements inference) is both recurrent and stochastic, but where the recurrence is not to model sequential structure, only to perform computation. We find conditions under which a simple feedforward computation is a very good initialization for inference, after the input units… ▽ More

    Submitted 27 June, 2016; v1 submitted 6 June, 2016; originally announced June 2016.