-
Quick-Sort Style Approximation Algorithms for Generalizations of Feedback Vertex Set in Tournaments
Authors:
Sushmita Gupta,
Sounak Modak,
Saket Saurabh,
Sanjay Seetharaman
Abstract:
A feedback vertex set (FVS) in a digraph is a subset of vertices whose removal makes the digraph acyclic. In other words, it hits all cycles in the digraph. Lokshtanov et al. [TALG '21] gave a factor 2 randomized approximation algorithm for finding a minimum weight FVS in tournaments. We generalize the result by presenting a factor $2α$ randomized approximation algorithm for finding a minimum weig…
▽ More
A feedback vertex set (FVS) in a digraph is a subset of vertices whose removal makes the digraph acyclic. In other words, it hits all cycles in the digraph. Lokshtanov et al. [TALG '21] gave a factor 2 randomized approximation algorithm for finding a minimum weight FVS in tournaments. We generalize the result by presenting a factor $2α$ randomized approximation algorithm for finding a minimum weight FVS in digraphs of independence number $α$; a generalization of tournaments which are digraphs with independence number $1$. Using the same framework, we present a factor $2$ randomized approximation algorithm for finding a minimum weight Subset FVS in tournaments: given a vertex subset $S$ in addition to the graph, find a subset of vertices that hits all cycles containing at least one vertex in $S$. Note that FVS in tournaments is a special case of Subset FVS in tournaments in which $S = V(T)$.
△ Less
Submitted 9 February, 2024;
originally announced February 2024.
-
Zyxin is all you need: machine learning adherent cell mechanics
Authors:
Matthew S. Schmitt,
Jonathan Colen,
Stefano Sala,
John Devany,
Shailaja Seetharaman,
Margaret L. Gardel,
Patrick W. Oakes,
Vincenzo Vitelli
Abstract:
Cellular form and function emerge from complex mechanochemical systems within the cytoplasm. No systematic strategy currently exists to infer large-scale physical properties of a cell from its many molecular components. This is a significant obstacle to understanding biophysical processes such as cell adhesion and migration. Here, we develop a data-driven biophysical modeling approach to learn the…
▽ More
Cellular form and function emerge from complex mechanochemical systems within the cytoplasm. No systematic strategy currently exists to infer large-scale physical properties of a cell from its many molecular components. This is a significant obstacle to understanding biophysical processes such as cell adhesion and migration. Here, we develop a data-driven biophysical modeling approach to learn the mechanical behavior of adherent cells. We first train neural networks to predict forces generated by adherent cells from images of cytoskeletal proteins. Strikingly, experimental images of a single focal adhesion protein, such as zyxin, are sufficient to predict forces and generalize to unseen biological regimes. This protein field alone contains enough information to yield accurate predictions even if forces themselves are generated by many interacting proteins. We next develop two approaches - one explicitly constrained by physics, the other more agnostic - that help construct data-driven continuum models of cellular forces using this single focal adhesion field. Both strategies consistently reveal that cellular forces are encoded by two different length scales in adhesion protein distributions. Beyond adherent cell mechanics, our work serves as a case study for how to integrate neural networks in the construction of predictive phenomenological models in cell biology, even when little knowledge of the underlying microscopic mechanisms exist.
△ Less
Submitted 28 February, 2023;
originally announced March 2023.
-
More Effort Towards Multiagent Knapsack
Authors:
Sushmita Gupta,
Pallavi Jain,
Sanjay Seetharaman
Abstract:
In this paper, we study some multiagent variants of the knapsack problem. Fluschnik et al. [AAAI 2019] considered the model in which every agent assigns some utility to every item. They studied three preference aggregation rules for finding a subset (knapsack) of items: individually best, diverse, and Nash-welfare-based. Informally, diversity is achieved by satisfying as many voters as possible. M…
▽ More
In this paper, we study some multiagent variants of the knapsack problem. Fluschnik et al. [AAAI 2019] considered the model in which every agent assigns some utility to every item. They studied three preference aggregation rules for finding a subset (knapsack) of items: individually best, diverse, and Nash-welfare-based. Informally, diversity is achieved by satisfying as many voters as possible. Motivated by the application of aggregation operators in multiwinner elections, we extend the study from diverse aggregation rule to Median and Best scoring functions. We study the computational and parameterized complexity of the problem with respect to some natural parameters, namely, the number of voters, the number of items, and the distance from an easy instance. We also study the complexity of the problem under domain restrictions. Furthermore, we present significantly faster parameterized algorithms with respect to the number of voters for the diverse aggregation rule.
△ Less
Submitted 4 August, 2022;
originally announced August 2022.
-
Gradient-based Data Subversion Attack Against Binary Classifiers
Authors:
Rosni K Vasu,
Sanjay Seetharaman,
Shubham Malaviya,
Manish Shukla,
Sachin Lodha
Abstract:
Machine learning based data-driven technologies have shown impressive performances in a variety of application domains. Most enterprises use data from multiple sources to provide quality applications. The reliability of the external data sources raises concerns for the security of the machine learning techniques adopted. An attacker can tamper the training or test datasets to subvert the predictio…
▽ More
Machine learning based data-driven technologies have shown impressive performances in a variety of application domains. Most enterprises use data from multiple sources to provide quality applications. The reliability of the external data sources raises concerns for the security of the machine learning techniques adopted. An attacker can tamper the training or test datasets to subvert the predictions of models generated by these techniques. Data poisoning is one such attack wherein the attacker tries to degrade the performance of a classifier by manipulating the training data.
In this work, we focus on label contamination attack in which an attacker poisons the labels of data to compromise the functionality of the system. We develop Gradient-based Data Subversion strategies to achieve model degradation under the assumption that the attacker has limited-knowledge of the victim model. We exploit the gradients of a differentiable convex loss function (residual errors) with respect to the predicted label as a warm-start and formulate different strategies to find a set of data instances to contaminate. Further, we analyze the transferability of attacks and the susceptibility of binary classifiers. Our experiments show that the proposed approach outperforms the baselines and is computationally efficient.
△ Less
Submitted 31 May, 2021;
originally announced May 2021.
-
Influence Based Defense Against Data Poisoning Attacks in Online Learning
Authors:
Sanjay Seetharaman,
Shubham Malaviya,
Rosni KV,
Manish Shukla,
Sachin Lodha
Abstract:
Data poisoning is a type of adversarial attack on training data where an attacker manipulates a fraction of data to degrade the performance of machine learning model. Therefore, applications that rely on external data-sources for training data are at a significantly higher risk. There are several known defensive mechanisms that can help in mitigating the threat from such attacks. For example, data…
▽ More
Data poisoning is a type of adversarial attack on training data where an attacker manipulates a fraction of data to degrade the performance of machine learning model. Therefore, applications that rely on external data-sources for training data are at a significantly higher risk. There are several known defensive mechanisms that can help in mitigating the threat from such attacks. For example, data sanitization is a popular defensive mechanism wherein the learner rejects those data points that are sufficiently far from the set of training instances. Prior work on data poisoning defense primarily focused on offline setting, wherein all the data is assumed to be available for analysis. Defensive measures for online learning, where data points arrive sequentially, have not garnered similar interest.
In this work, we propose a defense mechanism to minimize the degradation caused by the poisoned training data on a learner's model in an online setup. Our proposed method utilizes an influence function which is a classic technique in robust statistics. Further, we supplement it with the existing data sanitization methods for filtering out some of the poisoned data points. We study the effectiveness of our defense mechanism on multiple datasets and across multiple attack strategies against an online learner.
△ Less
Submitted 24 April, 2021;
originally announced April 2021.
-
Absorption and emission modulation in MoS2-GaN (0001) heterostructure by interface phonon-exciton coupling
Authors:
Yuba Poudel,
Jagoda Slawinska,
Priya Gopal,
Sairaman Seetharaman,
Zachariah Hennighausen,
Swastik Kar,
Francis Dsouza,
Marco Buongiorno Nardelli,
Arup Neogi
Abstract:
Semiconductor heterostructures based on layered two-dimensional transition metal dichalcogenides (TMD) interfaced to gallium nitride (GaN) are excellent material systems to realize broadband light emitters and absorbers. The surface properties of the polar semiconductor, such as GaN are dominated by interface phonons, thus the optical properties of the vertical heterostructure depend strongly on t…
▽ More
Semiconductor heterostructures based on layered two-dimensional transition metal dichalcogenides (TMD) interfaced to gallium nitride (GaN) are excellent material systems to realize broadband light emitters and absorbers. The surface properties of the polar semiconductor, such as GaN are dominated by interface phonons, thus the optical properties of the vertical heterostructure depend strongly on the interface exciton-phonon coupling. The origin and activation of different Raman modes in the heterostructure due to coupling between interfacial phonons and optically generated carriers in a monolayer MoS2-GaN (0001) heterostructure was observed. This coupling strongly influences the non-equilibrium absorption properties of MoS2 and the emission properties of both semiconductors. Density functional theory (DFT) calculations were performed to study the band alignment of the interface, which revealed a type-I heterostructure. The optical excitation with interband transition in MoS2 at K-point strongly modulates the C excitonic band in MoS2. The overlap of absorption and emission bands of GaN with the absorption bands of MoS2 induces the energy and charge transfer across the interface with an optical excitation at Γ-point. A strong modulation of the excitonic absorption states is observed in MoS2 on GaN substrate with transient optical pump-probe spectroscopy. The interaction of carriers with phonons and defect states leads to the enhanced and blue shifted emission in MoS2 on GaN substrate. Our results demonstrate the relevance of interface coupling between phonons and carriers for the development of optical and electronic applications.
△ Less
Submitted 10 June, 2019;
originally announced June 2019.
-
Survivability in IP over WDM networks
Authors:
Kulathumani Vinodkrishnan,
Nikhil Chandhok,
Arjan Durresi,
Raj Jain,
Ramesh Jagannathan,
Srinivasan Seetharaman
Abstract:
The Internet is emerging as the new universal telecommunication medium. IP over WDM has been envisioned as one of the most attractive architectures for the new Internet. Consequently survivability is a crucial concern in designing IP over WDM networks. This paper presents a survey of the survivability mechanisms for IP over WDM networks and thus is intended to provide a summary of what has been do…
▽ More
The Internet is emerging as the new universal telecommunication medium. IP over WDM has been envisioned as one of the most attractive architectures for the new Internet. Consequently survivability is a crucial concern in designing IP over WDM networks. This paper presents a survey of the survivability mechanisms for IP over WDM networks and thus is intended to provide a summary of what has been done in this area and help further research. A number of optical layer protection techniques have been discussed. They are examined from the point of view of cost, complexity, and application. Survivability techniques are being made available at multiple layers of the network. This paper also studies the recovery features of each network layer and explains the impact of interaction between layers on survivability. The advantages and issues of multi-layer survivability have been identified. The main idea is that the optical layer can provide fast protection while the higher layers can provide intelligent restoration. With this idea in mind, a new scheme of carrying IP over WDM using MPLS or Multi Protocol Lambda-Switching has been discussed. Finally, an architecture is suggested by means of which the optical layer can perform an automatic protection switch, with priority considerations with the help of signaling from the higher layers.
△ Less
Submitted 25 March, 2016;
originally announced March 2016.
-
Exploiting the adaptation dynamics to predict the distribution of beneficial fitness effects
Authors:
Sona John,
Sarada Seetharaman
Abstract:
Adaptation of asexual populations is driven by beneficial mutations and therefore the dynamics of this process, besides other factors, depend on the distribution of beneficial fitness effects. It is known that on uncorrelated fitness landscapes, this distribution can only be of three types: truncated, exponential and power law. We performed extensive stochastic simulations to study the adaptation…
▽ More
Adaptation of asexual populations is driven by beneficial mutations and therefore the dynamics of this process, besides other factors, depend on the distribution of beneficial fitness effects. It is known that on uncorrelated fitness landscapes, this distribution can only be of three types: truncated, exponential and power law. We performed extensive stochastic simulations to study the adaptation dynamics on rugged fitness landscapes, and identified two quantities that can be used to distinguish the underlying distribution of beneficial fitness effects. The first quantity studied here is the fitness difference between successive mutations that spread in the population, which is found to decrease in the case of truncated distributions, remain nearly a constant for exponentially decaying distributions and increase when the fitness distribution decays as a power law. The second quantity of interest, namely, the rate of change of fitness with time also shows quantitatively different behaviour for different beneficial fitness distributions. The patterns displayed by the two aforementioned quantities are found to hold for both low and high mutation rates. We discuss how these patterns can be exploited to determine the distribution of beneficial fitness effects in microbial experiments.
△ Less
Submitted 25 December, 2015; v1 submitted 26 February, 2015;
originally announced March 2015.
-
Length of adaptive walk on uncorrelated and correlated fitness landscapes
Authors:
Sarada Seetharaman,
Kavita Jain
Abstract:
We consider the adaptation dynamics of an asexual population that walks uphill on a rugged fitness landscape which is endowed with large number of local fitness peaks. We work in a parameter regime where only those mutants that are single mutation away are accessible, as a result of which the population eventually gets trapped at a local fitness maximum and the adaptive walk terminates. We study h…
▽ More
We consider the adaptation dynamics of an asexual population that walks uphill on a rugged fitness landscape which is endowed with large number of local fitness peaks. We work in a parameter regime where only those mutants that are single mutation away are accessible, as a result of which the population eventually gets trapped at a local fitness maximum and the adaptive walk terminates. We study how the number of adaptive steps taken by the population before reaching a local fitness peak depends on the initial fitness of the population, the extreme value distribution of the beneficial mutations and correlations amongst the fitnesses. Assuming that the relative fitness difference between successive steps is small, we analytically calculate the average walk length for both uncorrelated and correlated fitnesses in all extreme value domains for a given initial fitness. We present numerical results for the model where the fitness differences can be large, and find that the walk length behavior differs from that in the former model in the Fréchet domain of extreme value theory. We also discuss the relevance of our results to microbial experiments.
△ Less
Submitted 19 August, 2014; v1 submitted 26 June, 2014;
originally announced June 2014.
-
Adaptive walks and distribution of beneficial fitness effects
Authors:
Sarada Seetharaman,
Kavita Jain
Abstract:
We study the adaptation dynamics of a maladapted asexual population on rugged fitness landscapes with many local fitness peaks. The distribution of beneficial fitness effects is assumed to belong to one of the three extreme value domains, viz. Weibull, Gumbel and Fr{é}chet. We work in the strong selection-weak mutation regime in which beneficial mutations fix sequentially, and the population perfo…
▽ More
We study the adaptation dynamics of a maladapted asexual population on rugged fitness landscapes with many local fitness peaks. The distribution of beneficial fitness effects is assumed to belong to one of the three extreme value domains, viz. Weibull, Gumbel and Fr{é}chet. We work in the strong selection-weak mutation regime in which beneficial mutations fix sequentially, and the population performs an uphill walk on the fitness landscape until a local fitness peak is reached. A striking prediction of our analysis is that the fitness difference between successive steps follows a pattern of diminishing returns in the Weibull domain and accelerating returns in the Fr{é}chet domain, as the initial fitness of the population is increased. These trends are found to be robust with respect to fitness correlations. We believe that this result can be exploited in experiments to determine the extreme value domain of the distribution of beneficial fitness effects. Our work here differs significantly from the previous ones that assume the selection coefficient to be small. On taking large effect mutations into account, we find that the length of the walk shows different qualitative trends from those derived using small selection coefficient approximation.
△ Less
Submitted 29 October, 2013; v1 submitted 8 January, 2013;
originally announced January 2013.
-
Multiple adaptive substitutions during evolution in novel environments
Authors:
Kavita Jain,
Sarada Seetharaman
Abstract:
We consider an asexual population under strong selection-weak mutation conditions evolving on rugged fitness landscapes with many local fitness peaks. Unlike the previous studies in which the initial fitness of the population is assumed to be high, here we start the adaptation process with a low fitness corresponding to a population in a stressful novel environment. For generic fitness distributio…
▽ More
We consider an asexual population under strong selection-weak mutation conditions evolving on rugged fitness landscapes with many local fitness peaks. Unlike the previous studies in which the initial fitness of the population is assumed to be high, here we start the adaptation process with a low fitness corresponding to a population in a stressful novel environment. For generic fitness distributions, using an analytic argument we find that the average number of steps to a local optimum varies logarithmically with the genotype sequence length and increases as the correlations amongst genotypic fitnesses increase. When the fitnesses are exponentially or uniformly distributed, using an evolution equation for the distribution of population fitness, we analytically calculate the fitness distribution of fixed beneficial mutations and the walk length distribution.
△ Less
Submitted 2 September, 2011; v1 submitted 29 April, 2011;
originally announced April 2011.
-
Nonlinear deterministic equations in biological evolution
Authors:
Kavita Jain,
Sarada Seetharaman
Abstract:
We review models of biological evolution in which the population frequency changes deterministically with time. If the population is self-replicating, although the equations for simple prototypes can be linearised, nonlinear equations arise in many complex situations. For sexual populations, even in the simplest setting, the equations are necessarily nonlinear due to the mixing of the parental gen…
▽ More
We review models of biological evolution in which the population frequency changes deterministically with time. If the population is self-replicating, although the equations for simple prototypes can be linearised, nonlinear equations arise in many complex situations. For sexual populations, even in the simplest setting, the equations are necessarily nonlinear due to the mixing of the parental genetic material. The solutions of such nonlinear equations display interesting features such as multiple equilibria and phase transitions. We mainly discuss those models for which an analytical understanding of such nonlinear equations is available.
△ Less
Submitted 12 April, 2011; v1 submitted 1 March, 2011;
originally announced March 2011.
-
Evolutionary dynamics on strongly correlated fitness landscapes
Authors:
Sarada Seetharaman,
Kavita Jain
Abstract:
We study the evolutionary dynamics of a maladapted population of self-replicating sequences on strongly correlated fitness landscapes. Each sequence is assumed to be composed of blocks of equal length and its fitness is given by a linear combination of four independent block fitnesses. A mutation affects the fitness contribution of a single block leaving the other blocks unchanged and hence induci…
▽ More
We study the evolutionary dynamics of a maladapted population of self-replicating sequences on strongly correlated fitness landscapes. Each sequence is assumed to be composed of blocks of equal length and its fitness is given by a linear combination of four independent block fitnesses. A mutation affects the fitness contribution of a single block leaving the other blocks unchanged and hence inducing correlations between the parent and mutant fitness. On such strongly correlated fitness landscapes, we calculate the dynamical properties like the number of jumps in the most populated sequence and the temporal distribution of the last jump which is shown to exhibit a inverse square dependence as in evolution on uncorrelated fitness landscapes. We also obtain exact results for the distribution of records and extremes for correlated random variables.
△ Less
Submitted 8 September, 2010; v1 submitted 30 April, 2010;
originally announced April 2010.