-
Quality of Non-Convergent Best Response Processes in Multi-Agent Systems through Sink Equilibrium
Authors:
Rohit Konda,
Rahul Chandan,
Jason Marden
Abstract:
Examining the behavior of multi-agent systems is vitally important to many emerging distributed applications - game theory has emerged as a powerful tool set in which to do so. The main approach of game-theoretic techniques is to model agents as players in a game, and predict the emergent behavior through the relevant Nash equilibrium. The virtue from this viewpoint is that by assuming that self-i…
▽ More
Examining the behavior of multi-agent systems is vitally important to many emerging distributed applications - game theory has emerged as a powerful tool set in which to do so. The main approach of game-theoretic techniques is to model agents as players in a game, and predict the emergent behavior through the relevant Nash equilibrium. The virtue from this viewpoint is that by assuming that self-interested decision-making processes lead to Nash equilibrium, system behavior can then be captured by Nash equilibrium without studying the decision-making processes explicitly. This approach has seen success in a wide variety of domains, such as sensor coverage, traffic networks, auctions, and network coordination. However, in many other problem settings, Nash equilibrium are not necessarily guaranteed to exist or emerge from self-interested processes. Thus the main focus of the paper is on the study of sink equilibrium, which are defined as the attractors of these decision-making processes. By classifying system outcomes through a global objective function, we can analyze the resulting approximation guarantees that sink equilibrium have for a given game. Our main result is an approximation guarantee on the sink equilibrium through defining an introduced metric of misalignment, which captures how uniform agents are in their self-interested decision making. Overall, sink equilibrium are naturally occurring in many multi-agent contexts, and we display our results on their quality with respect to two practical problem settings.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
DiffPrompter: Differentiable Implicit Visual Prompts for Semantic-Segmentation in Adverse Conditions
Authors:
Sanket Kalwar,
Mihir Ungarala,
Shruti Jain,
Aaron Monis,
Krishna Reddy Konda,
Sourav Garg,
K Madhava Krishna
Abstract:
Semantic segmentation in adverse weather scenarios is a critical task for autonomous driving systems. While foundation models have shown promise, the need for specialized adaptors becomes evident for handling more challenging scenarios. We introduce DiffPrompter, a novel differentiable visual and latent prompting mechanism aimed at expanding the learning capabilities of existing adaptors in founda…
▽ More
Semantic segmentation in adverse weather scenarios is a critical task for autonomous driving systems. While foundation models have shown promise, the need for specialized adaptors becomes evident for handling more challenging scenarios. We introduce DiffPrompter, a novel differentiable visual and latent prompting mechanism aimed at expanding the learning capabilities of existing adaptors in foundation models. Our proposed $\nabla$HFC image processing block excels particularly in adverse weather conditions, where conventional methods often fall short. Furthermore, we investigate the advantages of jointly training visual and latent prompts, demonstrating that this combined approach significantly enhances performance in out-of-distribution scenarios. Our differentiable visual prompts leverage parallel and series architectures to generate prompts, effectively improving object segmentation tasks in adverse conditions. Through a comprehensive series of experiments and evaluations, we provide empirical evidence to support the efficacy of our approach. Project page at https://diffprompter.github.io.
△ Less
Submitted 26 March, 2024; v1 submitted 6 October, 2023;
originally announced October 2023.
-
Markov Games with Decoupled Dynamics: Price of Anarchy and Sample Complexity
Authors:
Runyu Zhang,
Yuyang Zhang,
Rohit Konda,
Bryce Ferguson,
Jason Marden,
Na Li
Abstract:
This paper studies the finite-time horizon Markov games where the agents' dynamics are decoupled but the rewards can possibly be coupled across agents. The policy class is restricted to local policies where agents make decisions using their local state. We first introduce the notion of smooth Markov games which extends the smoothness argument for normal form games to our setting, and leverage the…
▽ More
This paper studies the finite-time horizon Markov games where the agents' dynamics are decoupled but the rewards can possibly be coupled across agents. The policy class is restricted to local policies where agents make decisions using their local state. We first introduce the notion of smooth Markov games which extends the smoothness argument for normal form games to our setting, and leverage the smoothness property to bound the price of anarchy of the Markov game. For a specific type of Markov game called the Markov potential game, we also develop a distributed learning algorithm, multi-agent soft policy iteration (MA-SPI), which provably converges to a Nash equilibrium. Sample complexity of the algorithm is also provided. Lastly, our results are validated using a dynamic covering game.
△ Less
Submitted 7 April, 2023;
originally announced April 2023.
-
GDIP: Gated Differentiable Image Processing for Object-Detection in Adverse Conditions
Authors:
Sanket Kalwar,
Dhruv Patel,
Aakash Aanegola,
Krishna Reddy Konda,
Sourav Garg,
K Madhava Krishna
Abstract:
Detecting objects under adverse weather and lighting conditions is crucial for the safe and continuous operation of an autonomous vehicle, and remains an unsolved problem. We present a Gated Differentiable Image Processing (GDIP) block, a domain-agnostic network architecture, which can be plugged into existing object detection networks (e.g., Yolo) and trained end-to-end with adverse condition ima…
▽ More
Detecting objects under adverse weather and lighting conditions is crucial for the safe and continuous operation of an autonomous vehicle, and remains an unsolved problem. We present a Gated Differentiable Image Processing (GDIP) block, a domain-agnostic network architecture, which can be plugged into existing object detection networks (e.g., Yolo) and trained end-to-end with adverse condition images such as those captured under fog and low lighting. Our proposed GDIP block learns to enhance images directly through the downstream object detection loss. This is achieved by learning parameters of multiple image pre-processing (IP) techniques that operate concurrently, with their outputs combined using weights learned through a novel gating mechanism. We further improve GDIP through a multi-stage guidance procedure for progressive image enhancement. Finally, trading off accuracy for speed, we propose a variant of GDIP that can be used as a regularizer for training Yolo, which eliminates the need for GDIP-based image enhancement during inference, resulting in higher throughput and plausible real-world deployment. We demonstrate significant improvement in detection performance over several state-of-the-art methods through quantitative and qualitative studies on synthetic datasets such as PascalVOC, and real-world foggy (RTTS) and low-lighting (ExDark) datasets.
△ Less
Submitted 29 September, 2022;
originally announced September 2022.
-
Overtwisting and Coiling Highly Enhances Strain Generation of Twisted String Actuators
Authors:
Revanth Konda,
David Bombara,
Jun Zhang
Abstract:
Twisted string actuators (TSAs) have exhibited great promise in robotic applications by generating high translational force with low input torque. To further facilitate their robotic applications, it is strongly desirable but challenging to enhance their consistent strain generation while maintaining compliance. Existing studies predominantly considered overtwisting and coiling after the regular t…
▽ More
Twisted string actuators (TSAs) have exhibited great promise in robotic applications by generating high translational force with low input torque. To further facilitate their robotic applications, it is strongly desirable but challenging to enhance their consistent strain generation while maintaining compliance. Existing studies predominantly considered overtwisting and coiling after the regular twisting stage to be undesirable non-uniform and unpredictable knots, entanglements, and coils formed to create an unstable and failure-prone structure. Overtwisting would work well for TSAs when uniform coils can be consistently formed. In this study, we realize uniform and consistent coil formation in overtwisted TSAs, which greatly increases their strain. Furthermore, we investigate methods for enabling uniform coil formation upon overtwisting the strings in a TSA and present a procedure to systematically "train" the strings. To the authors' best knowledge, this is the first study to experimentally investigate overtwisting for TSAs with different stiffnesses and realize consistent uniform coil formation. Ultra-high molecular-weight polyethylene (UHMWPE) strings form the stiff TSAs whereas compliant TSAs are realized with stretchable and conductive supercoiled polymer (SCP) strings. The strain, force, velocity, and torque of each overtwisted TSA was studied. Overtwisting and coiling resulted in approximately 70% strain in stiff TSAs and approximately 60% strain in compliant TSAs. This is more than twice the strain achieved through regular twisting. Lastly, the overtwisted TSA was successfully demonstrated in a robotic bicep.
△ Less
Submitted 23 September, 2022;
originally announced September 2022.
-
Anthropomorphic Twisted String-Actuated Soft Robotic Gripper with Tendon-Based Stiffening
Authors:
David Bombara,
Revanth Konda,
Steven Swanbeck,
Jun Zhang
Abstract:
Realizing high-performance soft robotic grippers is challenging because of the inherent limitations of the soft actuators and artificial muscles that drive them, including low force output, small actuation range, and poor compactness. Despite advances in this area, realizing compact soft grippers with high dexterity and force output is still challenging. This paper explores twisted string actuator…
▽ More
Realizing high-performance soft robotic grippers is challenging because of the inherent limitations of the soft actuators and artificial muscles that drive them, including low force output, small actuation range, and poor compactness. Despite advances in this area, realizing compact soft grippers with high dexterity and force output is still challenging. This paper explores twisted string actuators (TSAs) to drive a soft robotic gripper. TSAs have been used in numerous robotic applications, but their inclusion in soft robots has been limited. The proposed design of the gripper was inspired by the human hand. Tunable stiffness was implemented in the fingers with antagonistic TSAs. The fingers' bending angles, actuation speed, blocked force output, and stiffness tuning were experimentally characterized. The gripper achieved a score of 6 on the Kapandji test and recreated 31 of the 33 grasps of the Feix GRASP taxonomy. It exhibited a maximum gras** force of 72 N, which was almost 13 times its own weight. A comparison study revealed that the proposed gripper exhibited equivalent or superior performance compared to other similar soft grippers.
△ Less
Submitted 22 September, 2022; v1 submitted 7 July, 2022;
originally announced July 2022.
-
Contextual road lane and symbol generation for autonomous driving
Authors:
Ajay Soni,
Pratik Padamwar,
Krishna Reddy Konda
Abstract:
In this paper we present a novel approach for lane detection and segmentation using generative models. Traditionally discriminative models have been employed to classify pixels semantically on a road. We model the probability distribution of lanes and road symbols by training a generative adversarial network. Based on the learned probability distribution, context-aware lanes and road signs are gen…
▽ More
In this paper we present a novel approach for lane detection and segmentation using generative models. Traditionally discriminative models have been employed to classify pixels semantically on a road. We model the probability distribution of lanes and road symbols by training a generative adversarial network. Based on the learned probability distribution, context-aware lanes and road signs are generated for a given image which are further quantized for nearest class label. Proposed method has been tested on BDD100K and Baidu ApolloScape datasets and performs better than state of the art and exhibits robustness to adverse conditions by generating lanes in faded out and occluded scenarios.
△ Less
Submitted 18 January, 2022;
originally announced January 2022.
-
Execution Order Matters in Greedy Algorithms with Limited Information
Authors:
Rohit Konda,
David Grimsman,
Jason Marden
Abstract:
In this work, we study the multi-agent decision problem where agents try to coordinate to optimize a given system-level objective. While solving for the global optimal is intractable in many cases, the greedy algorithm is a well-studied and efficient way to provide good approximate solutions - notably for submodular optimization problems. Executing the greedy algorithm requires the agents to be or…
▽ More
In this work, we study the multi-agent decision problem where agents try to coordinate to optimize a given system-level objective. While solving for the global optimal is intractable in many cases, the greedy algorithm is a well-studied and efficient way to provide good approximate solutions - notably for submodular optimization problems. Executing the greedy algorithm requires the agents to be ordered and execute a local optimization based on the solutions of the previous agents. However, in limited information settings, passing the solution from the previous agents may be nontrivial, as some agents may not be able to directly communicate with each other. Thus the communication time required to execute the greedy algorithm is closely tied to the order that the agents are given. In this work, we characterize interplay between the communication complexity and agent orderings by showing that the complexity using the best ordering is O(n) and increases considerably to O(n^2) when using the worst ordering. Motivated by this, we also propose an algorithm that can find an ordering and execute the greedy algorithm quickly, in a distributed fashion. We also show that such an execution of the greedy algorithm is advantageous over current methods for distributed submodular maximization.
△ Less
Submitted 30 November, 2022; v1 submitted 2 November, 2021;
originally announced November 2021.
-
Model Free Barrier Functions via Implicit Evading Maneuvers
Authors:
Eric Squires,
Rohit Konda,
Samuel Coogan,
Magnus Egerstedt
Abstract:
This paper demonstrates that the safety override arising from the use of a barrier function can in some cases be needlessly restrictive. In particular, we examine the case of fixed-wing collision avoidance and show that when using a barrier function, there are cases where two fixed-wing aircraft can come closer to colliding than if there were no barrier function at all. In addition, we construct c…
▽ More
This paper demonstrates that the safety override arising from the use of a barrier function can in some cases be needlessly restrictive. In particular, we examine the case of fixed-wing collision avoidance and show that when using a barrier function, there are cases where two fixed-wing aircraft can come closer to colliding than if there were no barrier function at all. In addition, we construct cases where the barrier function labels the system as unsafe even when the vehicles start arbitrarily far apart. In other words, the barrier function ensures safety but with unnecessary costs to performance. We therefore introduce model-free barrier functions which take a data driven approach to creating a barrier function. We demonstrate the effectiveness of model-free barrier functions in a collision avoidance simulation of two fixed-wing aircraft.
△ Less
Submitted 23 September, 2022; v1 submitted 27 July, 2021;
originally announced July 2021.
-
Balancing Asymptotic and Transient Efficiency Guarantees in Set Covering Games
Authors:
Rohit Konda,
Rahul Chandan,
David Grimsman,
Jason R. Marden
Abstract:
Game theoretic approaches have gained traction as robust methodologies for designing distributed local algorithms that induce a desired overall system configuration in multi-agent settings. However, much of the emphasis in these approaches is on providing asymptotic guarantees on the performance of a network of agents, and there is a gap in the study of efficiency guarantees along transients of th…
▽ More
Game theoretic approaches have gained traction as robust methodologies for designing distributed local algorithms that induce a desired overall system configuration in multi-agent settings. However, much of the emphasis in these approaches is on providing asymptotic guarantees on the performance of a network of agents, and there is a gap in the study of efficiency guarantees along transients of these distributed algorithms. Therefore, in this paper, we study the transient efficiency guarantees of a natural game-theoretic algorithm in the class of set covering games, which have been used to model a variety of applications. Our main results characterize the optimal utility design that maximizes the guaranteed efficiency along the transient of the natural dynamics. Furthermore, we characterize the Pareto-optimal frontier with regards to guaranteed efficiency in the transient and the asymptote under a class of game-theoretic designs. Surprisingly, we show that there exists an extreme trade-off between the long-term and short-term guarantees in that an asymptotically optimal game-theoretic design can perform arbitrarily bad in the transient.
△ Less
Submitted 2 November, 2021; v1 submitted 7 June, 2021;
originally announced June 2021.
-
Mission Level Uncertainty in Multi-Agent Resource Allocation
Authors:
Rohit Konda,
Rahul Chandan,
Jason R. Marden
Abstract:
In recent years, a significant research effort has been devoted to the design of distributed protocols for the control of multi-agent systems, as the scale and limited communication bandwidth characteristic of such systems render centralized control impossible. Given the strict operating conditions, it is unlikely that every agent in a multi-agent system will have local information that is consist…
▽ More
In recent years, a significant research effort has been devoted to the design of distributed protocols for the control of multi-agent systems, as the scale and limited communication bandwidth characteristic of such systems render centralized control impossible. Given the strict operating conditions, it is unlikely that every agent in a multi-agent system will have local information that is consistent with the true system state. Yet, the majority of works in the literature assume that agents share perfect knowledge of their environment. This paper focuses on understanding the impact that inconsistencies in agents' local information can have on the performance of multi-agent systems. More specifically, we consider the design of multi-agent operations under a game theoretic lens where individual agents are assigned utilities that guide their local decision making. We provide a tractable procedure for designing utilities that optimize the efficiency of the resulting collective behavior (i.e., price of anarchy) for classes of set covering games where the extent of the information inconsistencies is known. In the setting where the extent of the informational inconsistencies is not known, we show -- perhaps surprisingly -- that underestimating the level of uncertainty leads to better price of anarchy than overestimating it.
△ Less
Submitted 7 June, 2021;
originally announced June 2021.
-
Composition of Safety Constraints For Fixed-Wing Collision Avoidance Amidst Limited Communications
Authors:
Eric Squires,
Pietro Pierpaoli,
Rohit Konda,
Samuel Coogan,
Magnus Egerstedt
Abstract:
This paper considers how to ensure that a system of fixed wing Unmanned Aerial Vehicles (UAVs) can avoid collisions. To do so we develop a novel method for creating a barrier function, which is similar to a Lyapunov function and can be used to ensure that a system can stay safe for all future times. After introducing the general approach, it is shown how to ensure that collision avoidance for two…
▽ More
This paper considers how to ensure that a system of fixed wing Unmanned Aerial Vehicles (UAVs) can avoid collisions. To do so we develop a novel method for creating a barrier function, which is similar to a Lyapunov function and can be used to ensure that a system can stay safe for all future times. After introducing the general approach, it is shown how to ensure that collision avoidance for two vehicles can be guaranteed for all future times. The construction is then extended to the case of arbitrarily many vehicles by addressing how to satisfy multiple safety objectives simultaneously. We do this while ensuring output actuator commands are within specified limits. Because this formulation requires communication of control values and may therefore reduce throughput of other important messages, we then show how to reformulate the solution without this significant communication overhead while still ensuring safety is maintained and actuator limits are respected. We validate the theoretical developments of this paper in the simulator SCRIMMAGE with a simulation of 20 UAVs that maintain safe distances from each other even though their nominal paths would otherwise cause a collision.
△ Less
Submitted 21 July, 2021; v1 submitted 9 June, 2019;
originally announced June 2019.
-
Only sparsity based loss function for learning representations
Authors:
Vivek Bakaraju,
Kishore Reddy Konda
Abstract:
We study the emergence of sparse representations in neural networks. We show that in unsupervised models with regularization, the emergence of sparsity is the result of the input data samples being distributed along highly non-linear or discontinuous manifold. We also derive a similar argument for discriminatively trained networks and present experiments to support this hypothesis. Based on our st…
▽ More
We study the emergence of sparse representations in neural networks. We show that in unsupervised models with regularization, the emergence of sparsity is the result of the input data samples being distributed along highly non-linear or discontinuous manifold. We also derive a similar argument for discriminatively trained networks and present experiments to support this hypothesis. Based on our study of sparsity, we introduce a new loss function which can be used as regularization term for models like autoencoders and MLPs. Further, the same loss function can also be used as a cost function for an unsupervised single-layered neural network model for learning efficient representations.
△ Less
Submitted 7 March, 2019;
originally announced March 2019.
-
Learning to encode motion using spatio-temporal synchrony
Authors:
Kishore Reddy Konda,
Roland Memisevic,
Vincent Michalski
Abstract:
We consider the task of learning to extract motion from videos. To this end, we show that the detection of spatial transformations can be viewed as the detection of synchrony between the image sequence and a sequence of features undergoing the motion we wish to detect. We show that learning about synchrony is possible using very fast, local learning rules, by introducing multiplicative "gating" in…
▽ More
We consider the task of learning to extract motion from videos. To this end, we show that the detection of spatial transformations can be viewed as the detection of synchrony between the image sequence and a sequence of features undergoing the motion we wish to detect. We show that learning about synchrony is possible using very fast, local learning rules, by introducing multiplicative "gating" interactions between hidden units across frames. This makes it possible to achieve competitive performance in a wide variety of motion estimation tasks, using a small fraction of the time required to learn features, and to outperform hand-crafted spatio-temporal features by a large margin. We also show how learning about synchrony can be viewed as performing greedy parameter estimation in the well-known motion energy model.
△ Less
Submitted 10 February, 2014; v1 submitted 13 June, 2013;
originally announced June 2013.