Search | arXiv e-print repository

arXiv:2405.20426 [pdf, ps, other]

Quality of Non-Convergent Best Response Processes in Multi-Agent Systems through Sink Equilibrium

Authors: Rohit Konda, Rahul Chandan, Jason Marden

Abstract: Examining the behavior of multi-agent systems is vitally important to many emerging distributed applications - game theory has emerged as a powerful tool set in which to do so. The main approach of game-theoretic techniques is to model agents as players in a game, and predict the emergent behavior through the relevant Nash equilibrium. The virtue from this viewpoint is that by assuming that self-i… ▽ More Examining the behavior of multi-agent systems is vitally important to many emerging distributed applications - game theory has emerged as a powerful tool set in which to do so. The main approach of game-theoretic techniques is to model agents as players in a game, and predict the emergent behavior through the relevant Nash equilibrium. The virtue from this viewpoint is that by assuming that self-interested decision-making processes lead to Nash equilibrium, system behavior can then be captured by Nash equilibrium without studying the decision-making processes explicitly. This approach has seen success in a wide variety of domains, such as sensor coverage, traffic networks, auctions, and network coordination. However, in many other problem settings, Nash equilibrium are not necessarily guaranteed to exist or emerge from self-interested processes. Thus the main focus of the paper is on the study of sink equilibrium, which are defined as the attractors of these decision-making processes. By classifying system outcomes through a global objective function, we can analyze the resulting approximation guarantees that sink equilibrium have for a given game. Our main result is an approximation guarantee on the sink equilibrium through defining an introduced metric of misalignment, which captures how uniform agents are in their self-interested decision making. Overall, sink equilibrium are naturally occurring in many multi-agent contexts, and we display our results on their quality with respect to two practical problem settings. △ Less

Submitted 30 May, 2024; originally announced May 2024.

arXiv:2310.04181 [pdf, other]

DiffPrompter: Differentiable Implicit Visual Prompts for Semantic-Segmentation in Adverse Conditions

Authors: Sanket Kalwar, Mihir Ungarala, Shruti Jain, Aaron Monis, Krishna Reddy Konda, Sourav Garg, K Madhava Krishna

Abstract: Semantic segmentation in adverse weather scenarios is a critical task for autonomous driving systems. While foundation models have shown promise, the need for specialized adaptors becomes evident for handling more challenging scenarios. We introduce DiffPrompter, a novel differentiable visual and latent prompting mechanism aimed at expanding the learning capabilities of existing adaptors in founda… ▽ More Semantic segmentation in adverse weather scenarios is a critical task for autonomous driving systems. While foundation models have shown promise, the need for specialized adaptors becomes evident for handling more challenging scenarios. We introduce DiffPrompter, a novel differentiable visual and latent prompting mechanism aimed at expanding the learning capabilities of existing adaptors in foundation models. Our proposed $\nabla$HFC image processing block excels particularly in adverse weather conditions, where conventional methods often fall short. Furthermore, we investigate the advantages of jointly training visual and latent prompts, demonstrating that this combined approach significantly enhances performance in out-of-distribution scenarios. Our differentiable visual prompts leverage parallel and series architectures to generate prompts, effectively improving object segmentation tasks in adverse conditions. Through a comprehensive series of experiments and evaluations, we provide empirical evidence to support the efficacy of our approach. Project page at https://diffprompter.github.io. △ Less

Submitted 26 March, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

arXiv:2304.03840 [pdf, other]

Markov Games with Decoupled Dynamics: Price of Anarchy and Sample Complexity

Authors: Runyu Zhang, Yuyang Zhang, Rohit Konda, Bryce Ferguson, Jason Marden, Na Li

Abstract: This paper studies the finite-time horizon Markov games where the agents' dynamics are decoupled but the rewards can possibly be coupled across agents. The policy class is restricted to local policies where agents make decisions using their local state. We first introduce the notion of smooth Markov games which extends the smoothness argument for normal form games to our setting, and leverage the… ▽ More This paper studies the finite-time horizon Markov games where the agents' dynamics are decoupled but the rewards can possibly be coupled across agents. The policy class is restricted to local policies where agents make decisions using their local state. We first introduce the notion of smooth Markov games which extends the smoothness argument for normal form games to our setting, and leverage the smoothness property to bound the price of anarchy of the Markov game. For a specific type of Markov game called the Markov potential game, we also develop a distributed learning algorithm, multi-agent soft policy iteration (MA-SPI), which provably converges to a Nash equilibrium. Sample complexity of the algorithm is also provided. Lastly, our results are validated using a dynamic covering game. △ Less

Submitted 7 April, 2023; originally announced April 2023.

arXiv:2209.14922 [pdf, other]

GDIP: Gated Differentiable Image Processing for Object-Detection in Adverse Conditions

Authors: Sanket Kalwar, Dhruv Patel, Aakash Aanegola, Krishna Reddy Konda, Sourav Garg, K Madhava Krishna

Abstract: Detecting objects under adverse weather and lighting conditions is crucial for the safe and continuous operation of an autonomous vehicle, and remains an unsolved problem. We present a Gated Differentiable Image Processing (GDIP) block, a domain-agnostic network architecture, which can be plugged into existing object detection networks (e.g., Yolo) and trained end-to-end with adverse condition ima… ▽ More Detecting objects under adverse weather and lighting conditions is crucial for the safe and continuous operation of an autonomous vehicle, and remains an unsolved problem. We present a Gated Differentiable Image Processing (GDIP) block, a domain-agnostic network architecture, which can be plugged into existing object detection networks (e.g., Yolo) and trained end-to-end with adverse condition images such as those captured under fog and low lighting. Our proposed GDIP block learns to enhance images directly through the downstream object detection loss. This is achieved by learning parameters of multiple image pre-processing (IP) techniques that operate concurrently, with their outputs combined using weights learned through a novel gating mechanism. We further improve GDIP through a multi-stage guidance procedure for progressive image enhancement. Finally, trading off accuracy for speed, we propose a variant of GDIP that can be used as a regularizer for training Yolo, which eliminates the need for GDIP-based image enhancement during inference, resulting in higher throughput and plausible real-world deployment. We demonstrate significant improvement in detection performance over several state-of-the-art methods through quantitative and qualitative studies on synthetic datasets such as PascalVOC, and real-world foggy (RTTS) and low-lighting (ExDark) datasets. △ Less

Submitted 29 September, 2022; originally announced September 2022.

Comments: Submitted to ICRA2023. More information at https://gatedip.github.io

arXiv:2209.11420 [pdf]

Overtwisting and Coiling Highly Enhances Strain Generation of Twisted String Actuators

Authors: Revanth Konda, David Bombara, Jun Zhang

Abstract: Twisted string actuators (TSAs) have exhibited great promise in robotic applications by generating high translational force with low input torque. To further facilitate their robotic applications, it is strongly desirable but challenging to enhance their consistent strain generation while maintaining compliance. Existing studies predominantly considered overtwisting and coiling after the regular t… ▽ More Twisted string actuators (TSAs) have exhibited great promise in robotic applications by generating high translational force with low input torque. To further facilitate their robotic applications, it is strongly desirable but challenging to enhance their consistent strain generation while maintaining compliance. Existing studies predominantly considered overtwisting and coiling after the regular twisting stage to be undesirable non-uniform and unpredictable knots, entanglements, and coils formed to create an unstable and failure-prone structure. Overtwisting would work well for TSAs when uniform coils can be consistently formed. In this study, we realize uniform and consistent coil formation in overtwisted TSAs, which greatly increases their strain. Furthermore, we investigate methods for enabling uniform coil formation upon overtwisting the strings in a TSA and present a procedure to systematically "train" the strings. To the authors' best knowledge, this is the first study to experimentally investigate overtwisting for TSAs with different stiffnesses and realize consistent uniform coil formation. Ultra-high molecular-weight polyethylene (UHMWPE) strings form the stiff TSAs whereas compliant TSAs are realized with stretchable and conductive supercoiled polymer (SCP) strings. The strain, force, velocity, and torque of each overtwisted TSA was studied. Overtwisting and coiling resulted in approximately 70% strain in stiff TSAs and approximately 60% strain in compliant TSAs. This is more than twice the strain achieved through regular twisting. Lastly, the overtwisted TSA was successfully demonstrated in a robotic bicep. △ Less

Submitted 23 September, 2022; originally announced September 2022.

arXiv:2207.03603 [pdf, other]

Anthropomorphic Twisted String-Actuated Soft Robotic Gripper with Tendon-Based Stiffening

Authors: David Bombara, Revanth Konda, Steven Swanbeck, Jun Zhang

Abstract: Realizing high-performance soft robotic grippers is challenging because of the inherent limitations of the soft actuators and artificial muscles that drive them, including low force output, small actuation range, and poor compactness. Despite advances in this area, realizing compact soft grippers with high dexterity and force output is still challenging. This paper explores twisted string actuator… ▽ More Realizing high-performance soft robotic grippers is challenging because of the inherent limitations of the soft actuators and artificial muscles that drive them, including low force output, small actuation range, and poor compactness. Despite advances in this area, realizing compact soft grippers with high dexterity and force output is still challenging. This paper explores twisted string actuators (TSAs) to drive a soft robotic gripper. TSAs have been used in numerous robotic applications, but their inclusion in soft robots has been limited. The proposed design of the gripper was inspired by the human hand. Tunable stiffness was implemented in the fingers with antagonistic TSAs. The fingers' bending angles, actuation speed, blocked force output, and stiffness tuning were experimentally characterized. The gripper achieved a score of 6 on the Kapandji test and recreated 31 of the 33 grasps of the Feix GRASP taxonomy. It exhibited a maximum gras** force of 72 N, which was almost 13 times its own weight. A comparison study revealed that the proposed gripper exhibited equivalent or superior performance compared to other similar soft grippers. △ Less

Submitted 22 September, 2022; v1 submitted 7 July, 2022; originally announced July 2022.

Comments: 19 pages, 15 figures

arXiv:2201.07120 [pdf, other]

Contextual road lane and symbol generation for autonomous driving

Authors: Ajay Soni, Pratik Padamwar, Krishna Reddy Konda

Abstract: In this paper we present a novel approach for lane detection and segmentation using generative models. Traditionally discriminative models have been employed to classify pixels semantically on a road. We model the probability distribution of lanes and road symbols by training a generative adversarial network. Based on the learned probability distribution, context-aware lanes and road signs are gen… ▽ More In this paper we present a novel approach for lane detection and segmentation using generative models. Traditionally discriminative models have been employed to classify pixels semantically on a road. We model the probability distribution of lanes and road symbols by training a generative adversarial network. Based on the learned probability distribution, context-aware lanes and road signs are generated for a given image which are further quantized for nearest class label. Proposed method has been tested on BDD100K and Baidu ApolloScape datasets and performs better than state of the art and exhibits robustness to adverse conditions by generating lanes in faded out and occluded scenarios. △ Less

Submitted 18 January, 2022; originally announced January 2022.

arXiv:2111.09154 [pdf, other]

Execution Order Matters in Greedy Algorithms with Limited Information

Authors: Rohit Konda, David Grimsman, Jason Marden

Abstract: In this work, we study the multi-agent decision problem where agents try to coordinate to optimize a given system-level objective. While solving for the global optimal is intractable in many cases, the greedy algorithm is a well-studied and efficient way to provide good approximate solutions - notably for submodular optimization problems. Executing the greedy algorithm requires the agents to be or… ▽ More In this work, we study the multi-agent decision problem where agents try to coordinate to optimize a given system-level objective. While solving for the global optimal is intractable in many cases, the greedy algorithm is a well-studied and efficient way to provide good approximate solutions - notably for submodular optimization problems. Executing the greedy algorithm requires the agents to be ordered and execute a local optimization based on the solutions of the previous agents. However, in limited information settings, passing the solution from the previous agents may be nontrivial, as some agents may not be able to directly communicate with each other. Thus the communication time required to execute the greedy algorithm is closely tied to the order that the agents are given. In this work, we characterize interplay between the communication complexity and agent orderings by showing that the complexity using the best ordering is O(n) and increases considerably to O(n^2) when using the worst ordering. Motivated by this, we also propose an algorithm that can find an ordering and execute the greedy algorithm quickly, in a distributed fashion. We also show that such an execution of the greedy algorithm is advantageous over current methods for distributed submodular maximization. △ Less

Submitted 30 November, 2022; v1 submitted 2 November, 2021; originally announced November 2021.

arXiv:2107.12871 [pdf, other]

Model Free Barrier Functions via Implicit Evading Maneuvers

Authors: Eric Squires, Rohit Konda, Samuel Coogan, Magnus Egerstedt

Abstract: This paper demonstrates that the safety override arising from the use of a barrier function can in some cases be needlessly restrictive. In particular, we examine the case of fixed-wing collision avoidance and show that when using a barrier function, there are cases where two fixed-wing aircraft can come closer to colliding than if there were no barrier function at all. In addition, we construct c… ▽ More This paper demonstrates that the safety override arising from the use of a barrier function can in some cases be needlessly restrictive. In particular, we examine the case of fixed-wing collision avoidance and show that when using a barrier function, there are cases where two fixed-wing aircraft can come closer to colliding than if there were no barrier function at all. In addition, we construct cases where the barrier function labels the system as unsafe even when the vehicles start arbitrarily far apart. In other words, the barrier function ensures safety but with unnecessary costs to performance. We therefore introduce model-free barrier functions which take a data driven approach to creating a barrier function. We demonstrate the effectiveness of model-free barrier functions in a collision avoidance simulation of two fixed-wing aircraft. △ Less

Submitted 23 September, 2022; v1 submitted 27 July, 2021; originally announced July 2021.

Comments: This work has been submitted to the American Controls Conference

arXiv:2106.04031 [pdf, other]

Balancing Asymptotic and Transient Efficiency Guarantees in Set Covering Games

Authors: Rohit Konda, Rahul Chandan, David Grimsman, Jason R. Marden

Abstract: Game theoretic approaches have gained traction as robust methodologies for designing distributed local algorithms that induce a desired overall system configuration in multi-agent settings. However, much of the emphasis in these approaches is on providing asymptotic guarantees on the performance of a network of agents, and there is a gap in the study of efficiency guarantees along transients of th… ▽ More Game theoretic approaches have gained traction as robust methodologies for designing distributed local algorithms that induce a desired overall system configuration in multi-agent settings. However, much of the emphasis in these approaches is on providing asymptotic guarantees on the performance of a network of agents, and there is a gap in the study of efficiency guarantees along transients of these distributed algorithms. Therefore, in this paper, we study the transient efficiency guarantees of a natural game-theoretic algorithm in the class of set covering games, which have been used to model a variety of applications. Our main results characterize the optimal utility design that maximizes the guaranteed efficiency along the transient of the natural dynamics. Furthermore, we characterize the Pareto-optimal frontier with regards to guaranteed efficiency in the transient and the asymptote under a class of game-theoretic designs. Surprisingly, we show that there exists an extreme trade-off between the long-term and short-term guarantees in that an asymptotically optimal game-theoretic design can perform arbitrarily bad in the transient. △ Less

Submitted 2 November, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

arXiv:2106.04029 [pdf, other]

Mission Level Uncertainty in Multi-Agent Resource Allocation

Authors: Rohit Konda, Rahul Chandan, Jason R. Marden

Abstract: In recent years, a significant research effort has been devoted to the design of distributed protocols for the control of multi-agent systems, as the scale and limited communication bandwidth characteristic of such systems render centralized control impossible. Given the strict operating conditions, it is unlikely that every agent in a multi-agent system will have local information that is consist… ▽ More In recent years, a significant research effort has been devoted to the design of distributed protocols for the control of multi-agent systems, as the scale and limited communication bandwidth characteristic of such systems render centralized control impossible. Given the strict operating conditions, it is unlikely that every agent in a multi-agent system will have local information that is consistent with the true system state. Yet, the majority of works in the literature assume that agents share perfect knowledge of their environment. This paper focuses on understanding the impact that inconsistencies in agents' local information can have on the performance of multi-agent systems. More specifically, we consider the design of multi-agent operations under a game theoretic lens where individual agents are assigned utilities that guide their local decision making. We provide a tractable procedure for designing utilities that optimize the efficiency of the resulting collective behavior (i.e., price of anarchy) for classes of set covering games where the extent of the information inconsistencies is known. In the setting where the extent of the informational inconsistencies is not known, we show -- perhaps surprisingly -- that underestimating the level of uncertainty leads to better price of anarchy than overestimating it. △ Less

Submitted 7 June, 2021; originally announced June 2021.

arXiv:1906.03771 [pdf, other]

Composition of Safety Constraints For Fixed-Wing Collision Avoidance Amidst Limited Communications

Authors: Eric Squires, Pietro Pierpaoli, Rohit Konda, Samuel Coogan, Magnus Egerstedt

Abstract: This paper considers how to ensure that a system of fixed wing Unmanned Aerial Vehicles (UAVs) can avoid collisions. To do so we develop a novel method for creating a barrier function, which is similar to a Lyapunov function and can be used to ensure that a system can stay safe for all future times. After introducing the general approach, it is shown how to ensure that collision avoidance for two… ▽ More This paper considers how to ensure that a system of fixed wing Unmanned Aerial Vehicles (UAVs) can avoid collisions. To do so we develop a novel method for creating a barrier function, which is similar to a Lyapunov function and can be used to ensure that a system can stay safe for all future times. After introducing the general approach, it is shown how to ensure that collision avoidance for two vehicles can be guaranteed for all future times. The construction is then extended to the case of arbitrarily many vehicles by addressing how to satisfy multiple safety objectives simultaneously. We do this while ensuring output actuator commands are within specified limits. Because this formulation requires communication of control values and may therefore reduce throughput of other important messages, we then show how to reformulate the solution without this significant communication overhead while still ensuring safety is maintained and actuator limits are respected. We validate the theoretical developments of this paper in the simulator SCRIMMAGE with a simulation of 20 UAVs that maintain safe distances from each other even though their nominal paths would otherwise cause a collision. △ Less

Submitted 21 July, 2021; v1 submitted 9 June, 2019; originally announced June 2019.

arXiv:1903.02893 [pdf, other]

Only sparsity based loss function for learning representations

Authors: Vivek Bakaraju, Kishore Reddy Konda

Abstract: We study the emergence of sparse representations in neural networks. We show that in unsupervised models with regularization, the emergence of sparsity is the result of the input data samples being distributed along highly non-linear or discontinuous manifold. We also derive a similar argument for discriminatively trained networks and present experiments to support this hypothesis. Based on our st… ▽ More We study the emergence of sparse representations in neural networks. We show that in unsupervised models with regularization, the emergence of sparsity is the result of the input data samples being distributed along highly non-linear or discontinuous manifold. We also derive a similar argument for discriminatively trained networks and present experiments to support this hypothesis. Based on our study of sparsity, we introduce a new loss function which can be used as regularization term for models like autoencoders and MLPs. Further, the same loss function can also be used as a cost function for an unsupervised single-layered neural network model for learning efficient representations. △ Less

Submitted 7 March, 2019; originally announced March 2019.

arXiv:1306.3162 [pdf, other]

Learning to encode motion using spatio-temporal synchrony

Authors: Kishore Reddy Konda, Roland Memisevic, Vincent Michalski

Abstract: We consider the task of learning to extract motion from videos. To this end, we show that the detection of spatial transformations can be viewed as the detection of synchrony between the image sequence and a sequence of features undergoing the motion we wish to detect. We show that learning about synchrony is possible using very fast, local learning rules, by introducing multiplicative "gating" in… ▽ More We consider the task of learning to extract motion from videos. To this end, we show that the detection of spatial transformations can be viewed as the detection of synchrony between the image sequence and a sequence of features undergoing the motion we wish to detect. We show that learning about synchrony is possible using very fast, local learning rules, by introducing multiplicative "gating" interactions between hidden units across frames. This makes it possible to achieve competitive performance in a wide variety of motion estimation tasks, using a small fraction of the time required to learn features, and to outperform hand-crafted spatio-temporal features by a large margin. We also show how learning about synchrony can be viewed as performing greedy parameter estimation in the well-known motion energy model. △ Less

Submitted 10 February, 2014; v1 submitted 13 June, 2013; originally announced June 2013.

Showing 1–14 of 14 results for author: Konda, R