Search | arXiv e-print repository

Enhancing Educational Efficiency: Generative AI Chatbots and DevOps in Education 4.0

Authors: Edis Mekić, Mihailo Jovanović, Kristijan Kuk, Bojan Prlinčević, Ana Savić

Abstract: This research paper will bring forth the innovative pedagogical approach in computer science education, which uses a combination of methodologies borrowed from Artificial Intelligence (AI) and DevOps to enhance the learning experience in Content Management Systems (CMS) Development. It has been done over three academic years, comparing the traditional way of teaching with the lately introduced AI-… ▽ More This research paper will bring forth the innovative pedagogical approach in computer science education, which uses a combination of methodologies borrowed from Artificial Intelligence (AI) and DevOps to enhance the learning experience in Content Management Systems (CMS) Development. It has been done over three academic years, comparing the traditional way of teaching with the lately introduced AI-supported techniques. This had three structured sprints, each one of them covering the major parts of the sprint: object-oriented PHP, theme development, and plugin development. In each sprint, the student deals with part of the theoretical content and part of the practical task, using ChatGPT as an auxiliary tool. In that sprint, the model will provide solutions in code debugging and extensions of complex problems. The course includes practical examples like code replication with PHP, functionality expansion of the CMS, even development of custom plugins, and themes. The course practice includes versions' control with Git repositories. Efficiency will touch the theme and plugin output rates during development and mobile/web application development. Comparative analysis indicates that there is a marked increase in efficiency and shows effectiveness with the proposed AI- and DevOps-supported methodology. The study is very informative since education in computer science and its landscape change embodies an emerging technology that could have transformation impacts on amplifying the potential for scalable and adaptive learning approaches. △ Less

Submitted 18 April, 2024; originally announced June 2024.

arXiv:2404.18311 [pdf]

Towards Incremental Learning in Large Language Models: A Critical Review

Authors: Mladjan Jovanovic, Peter Voss

Abstract: Incremental learning is the ability of systems to acquire knowledge over time, enabling their adaptation and generalization to novel tasks. It is a critical ability for intelligent, real-world systems, especially when data changes frequently or is limited. This review provides a comprehensive analysis of incremental learning in Large Language Models. It synthesizes the state-of-the-art incremental… ▽ More Incremental learning is the ability of systems to acquire knowledge over time, enabling their adaptation and generalization to novel tasks. It is a critical ability for intelligent, real-world systems, especially when data changes frequently or is limited. This review provides a comprehensive analysis of incremental learning in Large Language Models. It synthesizes the state-of-the-art incremental learning paradigms, including continual learning, meta-learning, parameter-efficient learning, and mixture-of-experts learning. We demonstrate their utility for incremental learning by describing specific achievements from these related topics and their critical factors. An important finding is that many of these approaches do not update the core model, and none of them update incrementally in real-time. The paper highlights current problems and challenges for future research in the field. By consolidating the latest relevant research developments, this review offers a comprehensive understanding of incremental learning and its implications for designing and develo** LLM-based learning systems. △ Less

Submitted 5 May, 2024; v1 submitted 28 April, 2024; originally announced April 2024.

arXiv:2312.04449 [pdf]

Climb Against Time -- Self-perspective through a psychological game

Authors: Stefan Cliff, Mlađan Jovanović

Abstract: With the rapid development of technology and its place in our lives, so too has the idea of needing to grow up faster, do more, be more and more as we are exposed to so many of our betters billboarding their successes and achievements that very often we can experience burnout, depression, feeling of inadequacy and worse. All because we cannot keep up with their tempos in life, and in this chaos, w… ▽ More With the rapid development of technology and its place in our lives, so too has the idea of needing to grow up faster, do more, be more and more as we are exposed to so many of our betters billboarding their successes and achievements that very often we can experience burnout, depression, feeling of inadequacy and worse. All because we cannot keep up with their tempos in life, and in this chaos, we often lose the very important fact and truth, our life should be lived at the tempo that fits our actual wants, our capabilities and opportunities. In recent years, since the mid 2010s, video games have entered the mainstream even more than before as a media platform that provides a more interactive experience than others like it. Where the players actions have consequences, outcomes both good and bad, and the experience of the player is highly linked to their capabilities. Based on the type of video game, be it single player or multiplayer, often the solution to the problem the player is facing will vary. With the increase popularity of both buying and creating games, more and more personal stories, talented teams and individuals, unique takes and ideas are being tried and often for the games that do succeed, there is financial gain but more impactful is communities built around the message and/or its execution. These communities usually reside on social media, such as Reddit, X, Tumblr, or on their own community pages made by the developers to directly interact with their players. But this is true often even for games that do not gain much financial success, they gain a certain cult following, especially if the topics of the game are either obscure or whose workings revolve around mental health issues, trauma survival, loss, or just in general very human and emotional topics. △ Less

Submitted 7 December, 2023; originally announced December 2023.

arXiv:2312.02796 [pdf, other]

Materials Expert-Artificial Intelligence for Materials Discovery

Authors: Yanjun Liu, Milena Jovanovic, Krishnanand Mallayya, Wesley J. Maddox, Andrew Gordon Wilson, Sebastian Klemenz, Leslie M. Schoop, Eun-Ah Kim

Abstract: The advent of material databases provides an unprecedented opportunity to uncover predictive descriptors for emergent material properties from vast data space. However, common reliance on high-throughput ab initio data necessarily inherits limitations of such data: mismatch with experiments. On the other hand, experimental decisions are often guided by an expert's intuition honed from experiences… ▽ More The advent of material databases provides an unprecedented opportunity to uncover predictive descriptors for emergent material properties from vast data space. However, common reliance on high-throughput ab initio data necessarily inherits limitations of such data: mismatch with experiments. On the other hand, experimental decisions are often guided by an expert's intuition honed from experiences that are rarely articulated. We propose using machine learning to "bottle" such operational intuition into quantifiable descriptors using expertly curated measurement-based data. We introduce "Materials Expert-Artificial Intelligence" (ME-AI) to encapsulate and articulate this human intuition. As a first step towards such a program, we focus on the topological semimetal (TSM) among square-net materials as the property inspired by the expert-identified descriptor based on structural information: the tolerance factor. We start by curating a dataset encompassing 12 primary features of 879 square-net materials, using experimental data whenever possible. We then use Dirichlet-based Gaussian process regression using a specialized kernel to reveal composite descriptors for square-net topological semimetals. The ME-AI learned descriptors independently reproduce expert intuition and expand upon it. Specifically, new descriptors point to hypervalency as a critical chemical feature predicting TSM within square-net compounds. Our success with a carefully defined problem points to the "machine bottling human insight" approach as promising for machine learning-aided material discovery. △ Less

Submitted 5 December, 2023; originally announced December 2023.

Comments: 8 pages main text, 4 figs, 8 pages Supplementary material

arXiv:2309.01622 [pdf]

Concepts is All You Need: A More Direct Path to AGI

Authors: Peter Voss, Mladjan Jovanovic

Abstract: Little demonstrable progress has been made toward AGI (Artificial General Intelligence) since the term was coined some 20 years ago. In spite of the fantastic breakthroughs in Statistical AI such as AlphaZero, ChatGPT, and Stable Diffusion none of these projects have, or claim to have, a clear path to AGI. In order to expedite the development of AGI it is crucial to understand and identify the cor… ▽ More Little demonstrable progress has been made toward AGI (Artificial General Intelligence) since the term was coined some 20 years ago. In spite of the fantastic breakthroughs in Statistical AI such as AlphaZero, ChatGPT, and Stable Diffusion none of these projects have, or claim to have, a clear path to AGI. In order to expedite the development of AGI it is crucial to understand and identify the core requirements of human-like intelligence as it pertains to AGI. From that one can distill which particular development steps are necessary to achieve AGI, and which are a distraction. Such analysis highlights the need for a Cognitive AI approach rather than the currently favored statistical and generative efforts. More specifically it identifies the central role of concepts in human-like cognition. Here we outline an architecture and development plan, together with some preliminary results, that offers a much more direct path to full Human-Level AI (HLAI)/ AGI. △ Less

Submitted 4 September, 2023; originally announced September 2023.

arXiv:2308.03598 [pdf]

Why We Don't Have AGI Yet

Authors: Peter Voss, Mladjan Jovanovic

Abstract: The original vision of AI was re-articulated in 2002 via the term 'Artificial General Intelligence' or AGI. This vision is to build 'Thinking Machines' - computer systems that can learn, reason, and solve problems similar to the way humans do. This is in stark contrast to the 'Narrow AI' approach practiced by almost everyone in the field over the many decades. While several large-scale efforts hav… ▽ More The original vision of AI was re-articulated in 2002 via the term 'Artificial General Intelligence' or AGI. This vision is to build 'Thinking Machines' - computer systems that can learn, reason, and solve problems similar to the way humans do. This is in stark contrast to the 'Narrow AI' approach practiced by almost everyone in the field over the many decades. While several large-scale efforts have nominally been working on AGI (most notably DeepMind), the field of pure focused AGI development has not been well funded or promoted. This is surprising given the fantastic value that true AGI can bestow on humanity. In addition to the dearth of effort in this field, there are also several theoretical and methodical missteps that are hampering progress. We highlight why purely statistical approaches are unlikely to lead to AGI, and identify several crucial cognitive abilities required to achieve human-like adaptability and autonomous learning. We conclude with a survey of socio-technical factors that have undoubtedly slowed progress towards AGI. △ Less

Submitted 19 September, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

arXiv:2306.00212 [pdf, ps, other]

Provably Efficient Generalized Lagrangian Policy Optimization for Safe Multi-Agent Reinforcement Learning

Authors: Dongsheng Ding, Xiaohan Wei, Zhuoran Yang, Zhaoran Wang, Mihailo R. Jovanović

Abstract: We examine online safe multi-agent reinforcement learning using constrained Markov games in which agents compete by maximizing their expected total rewards under a constraint on expected total utilities. Our focus is confined to an episodic two-player zero-sum constrained Markov game with independent transition functions that are unknown to agents, adversarial reward functions, and stochastic util… ▽ More We examine online safe multi-agent reinforcement learning using constrained Markov games in which agents compete by maximizing their expected total rewards under a constraint on expected total utilities. Our focus is confined to an episodic two-player zero-sum constrained Markov game with independent transition functions that are unknown to agents, adversarial reward functions, and stochastic utility functions. For such a Markov game, we employ an approach based on the occupancy measure to formulate it as an online constrained saddle-point problem with an explicit constraint. We extend the Lagrange multiplier method in constrained optimization to handle the constraint by creating a generalized Lagrangian with minimax decision primal variables and a dual variable. Next, we develop an upper confidence reinforcement learning algorithm to solve this Lagrangian problem while balancing exploration and exploitation. Our algorithm updates the minimax decision primal variables via online mirror descent and the dual variable via projected gradient step and we prove that it enjoys sublinear rate $ O((|X|+|Y|) L \sqrt{T(|A|+|B|)}))$ for both regret and constraint violation after playing $T$ episodes of the game. Here, $L$ is the horizon of each episode, $(|X|,|A|)$ and $(|Y|,|B|)$ are the state/action space sizes of the min-player and the max-player, respectively. To the best of our knowledge, we provide the first provably efficient online safe reinforcement learning algorithm in constrained Markov games. △ Less

Submitted 31 May, 2023; originally announced June 2023.

Comments: 59 pages, a full version of the main paper in the 5th Annual Conference on Learning for Dynamics and Control

arXiv:2209.11920 [pdf, other]

Tradeoffs between convergence rate and noise amplification for momentum-based accelerated optimization algorithms

Authors: Hesameddin Mohammadi, Meisam Razaviyayn, Mihailo R. Jovanović

Abstract: We study momentum-based first-order optimization algorithms in which the iterations utilize information from the two previous steps and are subject to an additive white noise. This setup uses noise to account for uncertainty in either gradient evaluation or iteration updates, and it includes Polyak's heavy-ball and Nesterov's accelerated methods as special cases. For strongly convex quadratic prob… ▽ More We study momentum-based first-order optimization algorithms in which the iterations utilize information from the two previous steps and are subject to an additive white noise. This setup uses noise to account for uncertainty in either gradient evaluation or iteration updates, and it includes Polyak's heavy-ball and Nesterov's accelerated methods as special cases. For strongly convex quadratic problems, we use the steady-state variance of the error in the optimization variable to quantify noise amplification and identify fundamental stochastic performance tradeoffs. Our approach utilizes the Jury stability criterion to provide a novel geometric characterization of conditions for linear convergence, and it reveals the relation between the noise amplification and convergence rate as well as their dependence on the condition number and the constant algorithmic parameters. This geometric insight leads to simple alternative proofs of standard convergence results and allows us to establish ``uncertainty principle'' of strongly convex optimization: for the two-step momentum method with linear convergence rate, the lower bound on the product between the settling time and noise amplification scales quadratically with the condition number. Our analysis also identifies a key difference between the gradient and iterate noise models: while the amplification of gradient noise can be made arbitrarily small by sufficiently decelerating the algorithm, the best achievable variance for the iterate noise model increases linearly with the settling time in the decelerating regime. Finally, we introduce two parameterized families of algorithms that strike a balance between noise amplification and settling time while preserving order-wise Pareto optimality for both noise models. △ Less

Submitted 19 June, 2024; v1 submitted 24 September, 2022; originally announced September 2022.

Comments: 23 pages; 7 figures

arXiv:2209.04206 [pdf]

doi 10.1016/j.techfore.2022.121981

Managing a blockchain-based platform ecosystem for industry-wide adoption: The case of TradeLens

Authors: Marin Jovanovic, Nikola Kostić, Ina M. Sebastian, Tomaz Sedej

Abstract: The proliferation of blockchain-based platform ecosystems in recent years has prompted scholars across various disciplines to explore the conditions leading to their successful deployment. However, develo** a blockchain-based platform ecosystem creates various challenges for the platform sponsor that may influence industry-wide adoption and, ultimately, the platform's success. This study follows… ▽ More The proliferation of blockchain-based platform ecosystems in recent years has prompted scholars across various disciplines to explore the conditions leading to their successful deployment. However, develo** a blockchain-based platform ecosystem creates various challenges for the platform sponsor that may influence industry-wide adoption and, ultimately, the platform's success. This study follows the development of TradeLens, a leading global ship** platform ecosystem underpinned by blockchain technology. We examine the factors affecting industry-wide adoption among global supply chain actors by unpacking platform value drivers and platform governance mechanisms identified at TradeLens. While the platform value hinges on the digitalization of workflows and the ecosystem leverage, the platform governance includes strategic (off-chain), technology (on-chain), and interoperability (on- and off-chain) governance - as mechanisms for effectively managing a blockchain-based platform ecosystem. This paper contributes to the literature on blockchain-based platform ecosystems and the platform literature. △ Less

Submitted 9 September, 2022; originally announced September 2022.

arXiv:2207.01487 [pdf]

State of the Art of Audio- and Video-Based Solutions for AAL

Authors: Slavisa Aleksic, Michael Atanasov, Jean Calleja Agius, Kenneth Camilleri, Anto Cartolovni, Pau Climent-Peerez, Sara Colantonio, Stefania Cristina, Vladimir Despotovic, Hazim Kemal Ekenel, Ekrem Erakin, Francisco Florez-Revuelta, Danila Germanese, Nicole Grech, Steinunn Gróa Sigurðardóttir, Murat Emirzeoglu, Ivo Iliev, Mladjan Jovanovic, Martin Kampel, William Kearns, Andrzej Klimczuk, Lambros Lambrinos, Jennifer Lumetzberger, Wiktor Mucha, Sophie Noiret , et al. (14 additional authors not shown)

Abstract: The report illustrates the state of the art of the most successful AAL applications and functions based on audio and video data, namely (i) lifelogging and self-monitoring, (ii) remote monitoring of vital signs, (iii) emotional state recognition, (iv) food intake monitoring, activity and behaviour recognition, (v) activity and personal assistance, (vi) gesture recognition, (vii) fall detection and… ▽ More The report illustrates the state of the art of the most successful AAL applications and functions based on audio and video data, namely (i) lifelogging and self-monitoring, (ii) remote monitoring of vital signs, (iii) emotional state recognition, (iv) food intake monitoring, activity and behaviour recognition, (v) activity and personal assistance, (vi) gesture recognition, (vii) fall detection and prevention, (viii) mobility assessment and frailty recognition, and (ix) cognitive and motor rehabilitation. For these application scenarios, the report illustrates the state of play in terms of scientific advances, available products and research project. The open challenges are also highlighted. △ Less

Submitted 5 July, 2022; v1 submitted 26 June, 2022; originally announced July 2022.

ACM Class: I.2

arXiv:2206.02346 [pdf, other]

Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs

Authors: Dongsheng Ding, Kaiqing Zhang, Jiali Duan, Tamer Başar, Mihailo R. Jovanović

Abstract: We study sequential decision making problems aimed at maximizing the expected total reward while satisfying a constraint on the expected total utility. We employ the natural policy gradient method to solve the discounted infinite-horizon optimal control problem for Constrained Markov Decision Processes (constrained MDPs). Specifically, we propose a new Natural Policy Gradient Primal-Dual (NPG-PD)… ▽ More We study sequential decision making problems aimed at maximizing the expected total reward while satisfying a constraint on the expected total utility. We employ the natural policy gradient method to solve the discounted infinite-horizon optimal control problem for Constrained Markov Decision Processes (constrained MDPs). Specifically, we propose a new Natural Policy Gradient Primal-Dual (NPG-PD) method that updates the primal variable via natural policy gradient ascent and the dual variable via projected sub-gradient descent. Although the underlying maximization involves a nonconcave objective function and a nonconvex constraint set, under the softmax policy parametrization we prove that our method achieves global convergence with sublinear rates regarding both the optimality gap and the constraint violation. Such convergence is independent of the size of the state-action space, i.e., it is~dimension-free. Furthermore, for log-linear and general smooth policy parametrizations, we establish sublinear convergence rates up to a function approximation error caused by restricted policy parametrization. We also provide convergence and finite-sample complexity guarantees for two sample-based NPG-PD algorithms. Finally, we use computational experiments to showcase the merits and the effectiveness of our approach. △ Less

Submitted 17 October, 2023; v1 submitted 6 June, 2022; originally announced June 2022.

Comments: 72 pages, 4 figures, 2 tables; revised sample complexity and computational experiments, and added zero constraint violation

arXiv:2202.04129 [pdf, other]

Independent Policy Gradient for Large-Scale Markov Potential Games: Sharper Rates, Function Approximation, and Game-Agnostic Convergence

Authors: Dongsheng Ding, Chen-Yu Wei, Kaiqing Zhang, Mihailo R. Jovanović

Abstract: We examine global non-asymptotic convergence properties of policy gradient methods for multi-agent reinforcement learning (RL) problems in Markov potential games (MPG). To learn a Nash equilibrium of an MPG in which the size of state space and/or the number of players can be very large, we propose new independent policy gradient algorithms that are run by all players in tandem. When there is no un… ▽ More We examine global non-asymptotic convergence properties of policy gradient methods for multi-agent reinforcement learning (RL) problems in Markov potential games (MPG). To learn a Nash equilibrium of an MPG in which the size of state space and/or the number of players can be very large, we propose new independent policy gradient algorithms that are run by all players in tandem. When there is no uncertainty in the gradient evaluation, we show that our algorithm finds an $ε$-Nash equilibrium with $O(1/ε^2)$ iteration complexity which does not explicitly depend on the state space size. When the exact gradient is not available, we establish $O(1/ε^5)$ sample complexity bound in a potentially infinitely large state space for a sample-based algorithm that utilizes function approximation. Moreover, we identify a class of independent policy gradient algorithms that enjoys convergence for both zero-sum Markov games and Markov cooperative games with the players that are oblivious to the types of games being played. Finally, we provide computational experiments to corroborate the merits and the effectiveness of our theoretical developments. △ Less

Submitted 4 August, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

Comments: 55 pages, 6 figures; Revised comments in Sections 3, 5 of published ICML 2022 version, results unchanged

arXiv:2110.15011 [pdf]

Simulating a questionnaire on the framing effects in decision-making processes in a serious game

Authors: Sara Knežević, Mlađan Jovanović

Abstract: The rapid development of technology has introduced new formats of human-computer interaction, which have in turn produced many new forms of media and a whole new field of interactive multimedia. One of the major mediums that has grown in popularity since its early development is video games. For a long time, video games have been developed and distributed for the purpose of entertainment, however,… ▽ More The rapid development of technology has introduced new formats of human-computer interaction, which have in turn produced many new forms of media and a whole new field of interactive multimedia. One of the major mediums that has grown in popularity since its early development is video games. For a long time, video games have been developed and distributed for the purpose of entertainment, however, in the late 2010s, researchers have taken an interest in the characteristics of games and how they can be used for different purposes. Video games allow a tight loop of action-reaction which provides fertile ground for many types of experiments which would be impossible or prohibitively difficult to perform in the physical world, and as such serve as strong virtual alternatives. A video game that is able to produce an immersive experience for the player in which the player believes that they are "actually there" in the game, and that the game is an extension of reality provides an alternate way to explore human behaviors and decision-making processes. Prospect theory questionnaires explore decision-making in hypothetical situations. In most cases, these experiments are done in controlled environments and rely on the respondent's imagination to reproduce the situation which is presented to them. Creating a virtual world with which the players can directly interact with and face tangible consequences of their decisions brings the hypothetical situations of the prospect theory questions closer to the respondent. If the players can interact with and manipulate the virtual world, then it is much easier for them to empathize with it and their character, and thus, the assumption is that the answers represent a more realistic image of the player's decision making. △ Less

Submitted 28 October, 2021; originally announced October 2021.

arXiv:2109.00359 [pdf, other]

doi 10.1109/TCNS.2023.3237483

Can Decentralized Control Outperform Centralized? The Role of Communication Latency

Authors: Luca Ballotta, Mihailo R. Jovanović, Luca Schenato

Abstract: In this paper, we examine the influence of communication latency on performance of networked control systems. Even though distributed control architectures offer advantages in terms of communication, maintenance costs, and scalability, it is an open question how communication latency that varies with network topology influences closed-loop performance. For networks in which delays increase with th… ▽ More In this paper, we examine the influence of communication latency on performance of networked control systems. Even though distributed control architectures offer advantages in terms of communication, maintenance costs, and scalability, it is an open question how communication latency that varies with network topology influences closed-loop performance. For networks in which delays increase with the number of links, we establish the existence of a fundamental performance trade-off that arises from control architecture. In particular, we utilize consensus dynamics with single- and double-integrator agents to show that, if delays increase fast enough, a sparse controller with nearest neighbor interactions can outperform the centralized one with all-to-all communication topology. △ Less

Submitted 26 January, 2023; v1 submitted 1 September, 2021; originally announced September 2021.

Comments: 12 pages, 13 figures; accepted for publication on IEEE Transactions on Control of Network Systems; final accepted version

MSC Class: 93B70 (Primary) 93C43 (Secondary) ACM Class: C.2.1

arXiv:2105.05729 [pdf]

User requirements for inclusive technology for older adults

Authors: Mladjan Jovanovic, Antonella De Angeli, Andrew McNeill, Lynne Coventry

Abstract: Active aging technologies are increasingly designed to support an active lifestyle. However, the way in which they are designed can raise different barriers to acceptance of and use by older adults. Their designers can adopt a negative stereotype of aging. Thorough understanding of user requirements is central to this problem. This paper investigates user requirements for technologies that encoura… ▽ More Active aging technologies are increasingly designed to support an active lifestyle. However, the way in which they are designed can raise different barriers to acceptance of and use by older adults. Their designers can adopt a negative stereotype of aging. Thorough understanding of user requirements is central to this problem. This paper investigates user requirements for technologies that encourage an active lifestyle and provide older people with the means to self-manage their physical, mental, and emotional health. This requires consideration of the person and the sociotechnical context of use. We describe our work in collecting and analyzing older adults' requirements for a technology which enables an active lifestyle. The main contribution of the paper is a model of user requirements for inclusive technology for older people. △ Less

Submitted 12 May, 2021; originally announced May 2021.

arXiv:2105.05306 [pdf, ps, other]

Intelligent interactive technologies for mental health and well-being

Authors: Mladjan Jovanovic, Aleksandar Jevremovic, Milica Pejovic-Milovancevic

Abstract: Mental healthcare has seen numerous benefits from interactive technologies and artificial intelligence. Various interventions have successfully used intelligent technologies to automate the assessment and evaluation of psychological treatments and mental well-being and functioning. These technologies include different types of robots, video games, and conversational agents. The paper critically an… ▽ More Mental healthcare has seen numerous benefits from interactive technologies and artificial intelligence. Various interventions have successfully used intelligent technologies to automate the assessment and evaluation of psychological treatments and mental well-being and functioning. These technologies include different types of robots, video games, and conversational agents. The paper critically analyzes existing solutions with the outlooks for their future. In particular, we: i)give an overview of the technology for mental health, ii) critically analyze the technology against the proposed criteria, and iii) provide the design outlooks for these technologies. △ Less

Submitted 11 May, 2021; originally announced May 2021.

arXiv:2103.08017 [pdf, other]

Transient growth of accelerated optimization algorithms

Authors: Hesameddin Mohammadi, Samantha Samuelson, Mihailo R. Jovanović

Abstract: Optimization algorithms are increasingly being used in applications with limited time budgets. In many real-time and embedded scenarios, only a few iterations can be performed and traditional convergence metrics cannot be used to evaluate performance in these non-asymptotic regimes. In this paper, we examine the transient behavior of accelerated first-order optimization algorithms. For convex quad… ▽ More Optimization algorithms are increasingly being used in applications with limited time budgets. In many real-time and embedded scenarios, only a few iterations can be performed and traditional convergence metrics cannot be used to evaluate performance in these non-asymptotic regimes. In this paper, we examine the transient behavior of accelerated first-order optimization algorithms. For convex quadratic problems, we employ tools from linear systems theory to show that transient growth arises from the presence of non-normal dynamics. We identify the existence of modes that yield an algebraic growth in early iterations and quantify the transient excursion from the optimal solution caused by these modes. For strongly convex smooth optimization problems, we utilize the theory of integral quadratic constraints (IQCs) to establish an upper bound on the magnitude of the transient response of Nesterov's accelerated algorithm. We show that both the Euclidean distance between the optimization variable and the global minimizer and the rise time to the transient peak are proportional to the square root of the condition number of the problem. Finally, for problems with large condition numbers, we demonstrate tightness of the bounds that we derive up to constant factors. △ Less

Submitted 23 December, 2021; v1 submitted 14 March, 2021; originally announced March 2021.

Comments: 12 pages, 2 figures

arXiv:2102.04862 [pdf]

doi 10.1016/j.technovation.2020.102218

Co-evolution of platform architecture, platform services, and platform governance: Expanding the platform value of industrial digital platforms

Authors: Marin Jovanovic, David Sjodin, Vinit Parida

Abstract: Industrial manufacturers increasingly develop digital platforms in the business-to-business (B2B) context. This emergent form of digital platforms requires a profound yet little understood holistic perspective that encompasses the co-evolution of platform architecture, platform services, and platform governance. To address this research gap, our study examines multiple platform sponsors from an in… ▽ More Industrial manufacturers increasingly develop digital platforms in the business-to-business (B2B) context. This emergent form of digital platforms requires a profound yet little understood holistic perspective that encompasses the co-evolution of platform architecture, platform services, and platform governance. To address this research gap, our study examines multiple platform sponsors from an industrial manufacturing context. The study demarcates three platform archetypes: product platform, supply chain platform, and platform ecosystem. We argue that each platform archetype involves a gradual development of platform architecture, platform services, and platform governance, which mirror each other. We also find that each platform archetype is characterized by a specific innovation mechanism that contributes to the platform service discovery and expands the platform value. Our study extends the co-evolution perspective of platform ecosystem literature and digital servitization literature. △ Less

Submitted 24 January, 2021; originally announced February 2021.

arXiv:2011.07901 [pdf]

doi 10.15308/Sinteza-2020-14-22

Conversational agents for learning foreign languages -- a survey

Authors: Jasna Petrovic, Mladjan Jovanovic

Abstract: Conversational practice, while crucial for all language learners, can be challenging to get enough of and very expensive. Chatbots are computer programs developed to engage in conversations with humans. They are designed as software avatars with limited, but growing conversational capability. The most natural and potentially powerful application of chatbots is in line with their fundamental nature… ▽ More Conversational practice, while crucial for all language learners, can be challenging to get enough of and very expensive. Chatbots are computer programs developed to engage in conversations with humans. They are designed as software avatars with limited, but growing conversational capability. The most natural and potentially powerful application of chatbots is in line with their fundamental nature - language practice. However, their role and outcomes within (in)formal language learning are currently tangential at best. Existing research in the area has generally focused on chatbots' comprehensibility and the motivation they inspire in their users. In this paper, we provide an overview of the chatbots for learning languages, critically analyze existing approaches, and discuss the major challenges for future work. △ Less

Submitted 16 November, 2020; originally announced November 2020.

Comments: Sinteza 2020 - International Scientific Conference on Information Technology and Data Related Research, Belgrade, Singidunum University, Serbia

arXiv:2011.07495 [pdf, other]

FAIR: Fair Adversarial Instance Re-weighting

Authors: Andrija Petrović, Mladen Nikolić, Sandro Radovanović, Boris Delibašić, Miloš Jovanović

Abstract: With growing awareness of societal impact of artificial intelligence, fairness has become an important aspect of machine learning algorithms. The issue is that human biases towards certain groups of population, defined by sensitive features like race and gender, are introduced to the training data through data collection and labeling. Two important directions of fairness ensuring research have foc… ▽ More With growing awareness of societal impact of artificial intelligence, fairness has become an important aspect of machine learning algorithms. The issue is that human biases towards certain groups of population, defined by sensitive features like race and gender, are introduced to the training data through data collection and labeling. Two important directions of fairness ensuring research have focused on (i) instance weighting in order to decrease the impact of more biased instances and (ii) adversarial training in order to construct data representations informative of the target variable, but uninformative of the sensitive attributes. In this paper we propose a Fair Adversarial Instance Re-weighting (FAIR) method, which uses adversarial training to learn instance weighting function that ensures fair predictions. Merging the two paradigms, it inherits desirable properties from both -- interpretability of reweighting and end-to-end trainability of adversarial training. We propose four different variants of the method and, among other things, demonstrate how the method can be cast in a fully probabilistic framework. Additionally, theoretical analysis of FAIR models' properties have been studied extensively. We compare FAIR models to 7 other related and state-of-the-art models and demonstrate that FAIR is able to achieve a better trade-off between accuracy and unfairness. To the best of our knowledge, this is the first model that merges reweighting and adversarial approaches by means of a weighting function that can provide interpretable information about fairness of individual instances. △ Less

Submitted 15 November, 2020; originally announced November 2020.

arXiv:2011.03969 [pdf]

doi 10.1109/MIC.2020.3037151

Chatbots as conversational healthcare services

Authors: Mlađan Jovanović, Marcos Baez, Fabio Casati

Abstract: Chatbots are emerging as a promising platform for accessing and delivering healthcare services. The evidence is in the growing number of publicly available chatbots aiming at taking an active role in the provision of prevention, diagnosis, and treatment services. This article takes a closer look at how these emerging chatbots address design aspects relevant to healthcare service provision, emphasi… ▽ More Chatbots are emerging as a promising platform for accessing and delivering healthcare services. The evidence is in the growing number of publicly available chatbots aiming at taking an active role in the provision of prevention, diagnosis, and treatment services. This article takes a closer look at how these emerging chatbots address design aspects relevant to healthcare service provision, emphasizing the Human-AI interaction aspects and the transparency in AI automation and decision making. △ Less

Submitted 8 November, 2020; originally announced November 2020.

arXiv:2003.00534 [pdf, ps, other]

Provably Efficient Safe Exploration via Primal-Dual Policy Optimization

Authors: Dongsheng Ding, Xiaohan Wei, Zhuoran Yang, Zhaoran Wang, Mihailo R. Jovanović

Abstract: We study the Safe Reinforcement Learning (SRL) problem using the Constrained Markov Decision Process (CMDP) formulation in which an agent aims to maximize the expected total reward subject to a safety constraint on the expected total value of a utility function. We focus on an episodic setting with the function approximation where the Markov transition kernels have a linear structure but do not im… ▽ More We study the Safe Reinforcement Learning (SRL) problem using the Constrained Markov Decision Process (CMDP) formulation in which an agent aims to maximize the expected total reward subject to a safety constraint on the expected total value of a utility function. We focus on an episodic setting with the function approximation where the Markov transition kernels have a linear structure but do not impose any additional assumptions on the sampling model. Designing SRL algorithms with provable computational and statistical efficiency is particularly challenging under this setting because of the need to incorporate both the safety constraint and the function approximation into the fundamental exploitation/exploration tradeoff. To this end, we present an \underline{O}ptimistic \underline{P}rimal-\underline{D}ual Proximal Policy \underline{OP}timization (OPDOP) algorithm where the value function is estimated by combining the least-squares policy evaluation and an additional bonus term for safe exploration. We prove that the proposed algorithm achieves an $\tilde{O}(d H^{2.5}\sqrt{T})$ regret and an $\tilde{O}(d H^{2.5}\sqrt{T})$ constraint violation, where $d$ is the dimension of the feature map**, $H$ is the horizon of each episode, and $T$ is the total number of steps. These bounds hold when the reward/utility functions are fixed but the feedback after each episode is bandit. Our bounds depend on the capacity of the state-action space only through the dimension of the feature map** and thus our results hold even when the number of states goes to infinity. To the best of our knowledge, we provide the first provably efficient online policy optimization algorithm for CMDP with safe exploration in the function approximation setting. △ Less

Submitted 25 October, 2020; v1 submitted 1 March, 2020; originally announced March 2020.

Comments: 44 pages. We have revised the linear MDP assumption and fixed a bug in our previous proofs

arXiv:1912.11899 [pdf, other]

Convergence and sample complexity of gradient methods for the model-free linear quadratic regulator problem

Authors: Hesameddin Mohammadi, Armin Zare, Mahdi Soltanolkotabi, Mihailo R. Jovanović

Abstract: Model-free reinforcement learning attempts to find an optimal control action for an unknown dynamical system by directly searching over the parameter space of controllers. The convergence behavior and statistical properties of these approaches are often poorly understood because of the nonconvex nature of the underlying optimization problems and the lack of exact gradient computation. In this pape… ▽ More Model-free reinforcement learning attempts to find an optimal control action for an unknown dynamical system by directly searching over the parameter space of controllers. The convergence behavior and statistical properties of these approaches are often poorly understood because of the nonconvex nature of the underlying optimization problems and the lack of exact gradient computation. In this paper, we take a step towards demystifying the performance and efficiency of such methods by focusing on the standard infinite-horizon linear quadratic regulator problem for continuous-time systems with unknown state-space parameters. We establish exponential stability for the ordinary differential equation (ODE) that governs the gradient-flow dynamics over the set of stabilizing feedback gains and show that a similar result holds for the gradient descent method that arises from the forward Euler discretization of the corresponding ODE. We also provide theoretical bounds on the convergence rate and sample complexity of the random search method with two-point gradient estimates. We prove that the required simulation time for achieving $ε$-accuracy in the model-free setup and the total number of function evaluations both scale as $\log \, (1/ε)$. △ Less

Submitted 15 March, 2021; v1 submitted 26 December, 2019; originally announced December 2019.

Comments: 39 pages, 4 figures

arXiv:1910.00783 [pdf, other]

Global exponential stability of primal-dual gradient flow dynamics based on the proximal augmented Lagrangian: A Lyapunov-based approach

Authors: Dongsheng Ding, Mihailo R. Jovanović

Abstract: For a class of nonsmooth composite optimization problems with linear equality constraints, we utilize a Lyapunov-based approach to establish the global exponential stability of the primal-dual gradient flow dynamics based on the proximal augmented Lagrangian. The result holds when the differentiable part of the objective function is strongly convex with a Lipschitz continuous gradient; the non-dif… ▽ More For a class of nonsmooth composite optimization problems with linear equality constraints, we utilize a Lyapunov-based approach to establish the global exponential stability of the primal-dual gradient flow dynamics based on the proximal augmented Lagrangian. The result holds when the differentiable part of the objective function is strongly convex with a Lipschitz continuous gradient; the non-differentiable part is proper, lower semi-continuous, and convex; and the matrix in the linear constraint is full row rank. Our quadratic Lyapunov function generalizes recent result from strongly convex problems with either affine equality or inequality constraints to a broader class of composite optimization problems with nonsmooth regularizers and it provides a worst-case lower bound of the exponential decay rate. Finally, we use computational experiments to demonstrate that our convergence rate estimate is less conservative than the existing alternatives. △ Less

Submitted 2 October, 2019; originally announced October 2019.

Comments: 6 pages, 3 figures

arXiv:1908.09487 [pdf, other]

doi 10.1146/annurev-control-053018-023843

Stochastic dynamical modeling of turbulent flows

Authors: Armin Zare, Tryphon T. Georgiou, Mihailo R. Jovanović

Abstract: Advanced measurement techniques and high performance computing have made large data sets available for a wide range of turbulent flows that arise in engineering applications. Drawing on this abundance of data, dynamical models can be constructed to reproduce structural and statistical features of turbulent flows, opening the way to the design of effective model-based flow control strategies. This… ▽ More Advanced measurement techniques and high performance computing have made large data sets available for a wide range of turbulent flows that arise in engineering applications. Drawing on this abundance of data, dynamical models can be constructed to reproduce structural and statistical features of turbulent flows, opening the way to the design of effective model-based flow control strategies. This review describes a framework for completing second-order statistics of turbulent flows by models that are based on the Navier-Stokes equations linearized around the turbulent mean velocity. Systems theory and convex optimization are combined to address the inherent uncertainty in the dynamics and the statistics of the flow by seeking a suitable parsimonious correction to the prior linearized model. Specifically, dynamical couplings between states of the linearized model dictate structural constraints on the statistics of flow fluctuations. Thence, colored-in-time stochastic forcing that drives the linearized model is sought to account for and reconcile dynamics with available data (i.e., partially known second order statistics). The number of dynamical degrees of freedom that are directly affected by stochastic excitation is minimized as a measure of model parsimony. The spectral content of the resulting colored-in-time stochastic contribution can alternatively be seen to arise from a low-rank structural perturbation of the linearized dynamical generator, pointing to suitable dynamical corrections that may account for the absence of the nonlinear interactions in the linearized model. △ Less

Submitted 26 August, 2019; originally announced August 2019.

Comments: To appear in the Annual Review of Control, Robotics, and Autonomous Systems

Journal ref: Annu. Rev. Control Robot. Auton. Syst., vol. 3, pp. 195-219, May 2020

arXiv:1908.09043 [pdf, ps, other]

Proximal gradient flow and Douglas-Rachford splitting dynamics: global exponential stability via integral quadratic constraints

Authors: Sepideh Hassan-Moghaddam, Mihailo R. Jovanović

Abstract: Many large-scale and distributed optimization problems can be brought into a composite form in which the objective function is given by the sum of a smooth term and a nonsmooth regularizer. Such problems can be solved via a proximal gradient method and its variants, thereby generalizing gradient descent to a nonsmooth setup. In this paper, we view proximal algorithms as dynamical systems and lever… ▽ More Many large-scale and distributed optimization problems can be brought into a composite form in which the objective function is given by the sum of a smooth term and a nonsmooth regularizer. Such problems can be solved via a proximal gradient method and its variants, thereby generalizing gradient descent to a nonsmooth setup. In this paper, we view proximal algorithms as dynamical systems and leverage techniques from control theory to study their global properties. In particular, for problems with strongly convex objective functions, we utilize the theory of integral quadratic constraints to prove the global exponential stability of the equilibrium points of the differential equations that govern the evolution of proximal gradient and Douglas-Rachford splitting flows. In our analysis, we use the fact that these algorithms can be interpreted as variable-metric gradient methods on the suitable envelopes and exploit structural properties of the nonlinear terms that arise from the gradient of the smooth part of the objective function and the proximal operator associated with the nonsmooth regularizer. We also demonstrate that these envelopes can be obtained from the augmented Lagrangian associated with the original nonsmooth problem and establish conditions for global exponential convergence even in the absence of strong convexity. △ Less

Submitted 25 June, 2020; v1 submitted 23 August, 2019; originally announced August 2019.

Comments: 8 pages; 1 figure

arXiv:1908.02805 [pdf, other]

Fast Multi-Agent Temporal-Difference Learning via Homotopy Stochastic Primal-Dual Optimization

Authors: Dongsheng Ding, Xiaohan Wei, Zhuoran Yang, Zhaoran Wang, Mihailo R. Jovanović

Abstract: We study the policy evaluation problem in multi-agent reinforcement learning where a group of agents, with jointly observed states and private local actions and rewards, collaborate to learn the value function of a given policy via local computation and communication over a connected undirected network. This problem arises in various large-scale multi-agent systems, including power grids, intellig… ▽ More We study the policy evaluation problem in multi-agent reinforcement learning where a group of agents, with jointly observed states and private local actions and rewards, collaborate to learn the value function of a given policy via local computation and communication over a connected undirected network. This problem arises in various large-scale multi-agent systems, including power grids, intelligent transportation systems, wireless sensor networks, and multi-agent robotics. When the dimension of state-action space is large, the temporal-difference learning with linear function approximation is widely used. In this paper, we develop a new distributed temporal-difference learning algorithm and quantify its finite-time performance. Our algorithm combines a distributed stochastic primal-dual method with a homotopy-based approach to adaptively adjust the learning rate in order to minimize the mean-square projected Bellman error by taking fresh online samples from a causal on-policy trajectory. We explicitly take into account the Markovian nature of sampling and improve the best-known finite-time error bound from $O(1/\sqrt{T})$ to~$O(1/T)$, where $T$ is the total number of iterations. △ Less

Submitted 4 November, 2021; v1 submitted 7 August, 2019; originally announced August 2019.

Comments: 29 pages, 4 figures

arXiv:1905.11011 [pdf, other]

Robustness of accelerated first-order algorithms for strongly convex optimization problems

Authors: Hesameddin Mohammadi, Meisam Razaviyayn, Mihailo R. Jovanović

Abstract: We study the robustness of accelerated first-order algorithms to stochastic uncertainties in gradient evaluation. Specifically, for unconstrained, smooth, strongly convex optimization problems, we examine the mean-squared error in the optimization variable when the iterates are perturbed by additive white noise. This type of uncertainty may arise in situations where an approximation of the gradien… ▽ More We study the robustness of accelerated first-order algorithms to stochastic uncertainties in gradient evaluation. Specifically, for unconstrained, smooth, strongly convex optimization problems, we examine the mean-squared error in the optimization variable when the iterates are perturbed by additive white noise. This type of uncertainty may arise in situations where an approximation of the gradient is sought through measurements of a real system or in a distributed computation over a network. Even though the underlying dynamics of first-order algorithms for this class of problems are nonlinear, we establish upper bounds on the mean-squared deviation from the optimal solution that are tight up to constant factors. Our analysis quantifies fundamental trade-offs between noise amplification and convergence rates obtained via any acceleration scheme similar to Nesterov's or heavy-ball methods. To gain additional analytical insight, for strongly convex quadratic problems, we explicitly evaluate the steady-state variance of the optimization variable in terms of the eigenvalues of the Hessian of the objective function. We demonstrate that the entire spectrum of the Hessian, rather than just the extreme eigenvalues, influence robustness of noisy algorithms. We specialize this result to the problem of distributed averaging over undirected networks and examine the role of network size and topology on the robustness of noisy accelerated algorithms. △ Less

Submitted 20 February, 2020; v1 submitted 27 May, 2019; originally announced May 2019.

Comments: 45 pages, 6 figures

arXiv:1902.00045 [pdf, other]

Gaussian Conditional Random Fields for Classification

Authors: Andrija Petrović, Mladen Nikolić, Miloš Jovanović, Boris Delibašić

Abstract: Gaussian conditional random fields (GCRF) are a well-known used structured model for continuous outputs that uses multiple unstructured predictors to form its features and at the same time exploits dependence structure among outputs, which is provided by a similarity measure. In this paper, a Gaussian conditional random fields model for structured binary classification (GCRFBC) is proposed. The mo… ▽ More Gaussian conditional random fields (GCRF) are a well-known used structured model for continuous outputs that uses multiple unstructured predictors to form its features and at the same time exploits dependence structure among outputs, which is provided by a similarity measure. In this paper, a Gaussian conditional random fields model for structured binary classification (GCRFBC) is proposed. The model is applicable to classification problems with undirected graphs, intractable for standard classification CRFs. The model representation of GCRFBC is extended by latent variables which yield some appealing properties. Thanks to the GCRF latent structure, the model becomes tractable, efficient and open to improvements previously applied to GCRF regression models. In addition, the model allows for reduction of noise, that might appear if structures were defined directly between discrete outputs. Additionally, two different forms of the algorithm are presented: GCRFBCb (GCRGBC - Bayesian) and GCRFBCnb (GCRFBC - non Bayesian). The extended method of local variational approximation of sigmoid function is used for solving empirical Bayes in Bayesian GCRFBCb variant, whereas MAP value of latent variables is the basis for learning and inference in the GCRFBCnb variant. The inference in GCRFBCb is solved by Newton-Cotes formulas for one-dimensional integration. Both models are evaluated on synthetic data and real-world data. It was shown that both models achieve better prediction performance than unstructured predictors. Furthermore, computational and memory complexity is evaluated. Advantages and disadvantages of the proposed GCRFBCb and GCRFBCnb are discussed in detail. △ Less

Submitted 31 January, 2019; originally announced February 2019.

Comments: Draft paper without experimental evaluation

arXiv:1807.01739 [pdf, other]

doi 10.1109/TAC.2019.2948268

Proximal algorithms for large-scale statistical modeling and sensor/actuator selection

Authors: Armin Zare, Hesameddin Mohammadi, Neil K. Dhingra, Tryphon T. Georgiou, Mihailo R. Jovanović

Abstract: Several problems in modeling and control of stochastically-driven dynamical systems can be cast as regularized semi-definite programs. We examine two such representative problems and show that they can be formulated in a similar manner. The first, in statistical modeling, seeks to reconcile observed statistics by suitably and minimally perturbing prior dynamics. The second seeks to optimally selec… ▽ More Several problems in modeling and control of stochastically-driven dynamical systems can be cast as regularized semi-definite programs. We examine two such representative problems and show that they can be formulated in a similar manner. The first, in statistical modeling, seeks to reconcile observed statistics by suitably and minimally perturbing prior dynamics. The second seeks to optimally select a subset of available sensors and actuators for control purposes. To address modeling and control of large-scale systems we develop a unified algorithmic framework using proximal methods. Our customized algorithms exploit problem structure and allow handling statistical modeling, as well as sensor and actuator selection, for substantially larger scales than what is amenable to current general-purpose solvers. We establish linear convergence of the proximal gradient algorithm, draw contrast between the proposed proximal algorithms and alternating direction method of multipliers, and provide examples that illustrate the merits and effectiveness of our framework. △ Less

Submitted 26 December, 2019; v1 submitted 4 July, 2018; originally announced July 2018.

Comments: To appear in IEEE Trans. Automat. Control

arXiv:1803.10705 [pdf, other]

Semi-supervised learning for structured regression on partially observed attributed graphs

Authors: Jelena Stojanovic, Milos Jovanovic, Djordje Gligorijevic, Zoran Obradovic

Abstract: Conditional probabilistic graphical models provide a powerful framework for structured regression in spatio-temporal datasets with complex correlation patterns. However, in real-life applications a large fraction of observations is often missing, which can severely limit the representational power of these models. In this paper we propose a Marginalized Gaussian Conditional Random Fields (m-GCRF)… ▽ More Conditional probabilistic graphical models provide a powerful framework for structured regression in spatio-temporal datasets with complex correlation patterns. However, in real-life applications a large fraction of observations is often missing, which can severely limit the representational power of these models. In this paper we propose a Marginalized Gaussian Conditional Random Fields (m-GCRF) structured regression model for dealing with missing labels in partially observed temporal attributed graphs. This method is aimed at learning with both labeled and unlabeled parts and effectively predicting future values in a graph. The method is even capable of learning from nodes for which the response variable is never observed in history, which poses problems for many state-of-the-art models that can handle missing data. The proposed model is characterized for various missingness mechanisms on 500 synthetic graphs. The benefits of the new method are also demonstrated on a challenging application for predicting precipitation based on partial observations of climate variables in a temporal graph that spans the entire continental US. We also show that the method can be useful for optimizing the costs of data collection in climate applications via active reduction of the number of weather stations to consider. In experiments on these real-world and synthetic datasets we show that the proposed model is consistently more accurate than alternative semi-supervised structured models, as well as models that either use imputation to deal with missing values or simply ignore them altogether. △ Less

Submitted 28 March, 2018; originally announced March 2018.

Comments: Proceedings of the 2015 SIAM International Conference on Data Mining (SDM 2015) Vancouver, Canada, April 30 - May 02, 2015

arXiv:1712.10128 [pdf, other]

doi 10.1109/TCNS.2018.2820499

Structured decentralized control of positive systems with applications to combination drug therapy and leader selection in directed networks

Authors: Neil K. Dhingra, Marcello Colombino, Mihailo R. Jovanović

Abstract: We study a class of structured optimal control problems in which the main diagonal of the dynamic matrix is a linear function of the design variable. While such problems are in general challenging and nonconvex, for positive systems we prove convexity of the $H_2$ and $H_\infty$ optimal control formulations which allow for arbitrary convex constraints and regularization of the control input. Moreo… ▽ More We study a class of structured optimal control problems in which the main diagonal of the dynamic matrix is a linear function of the design variable. While such problems are in general challenging and nonconvex, for positive systems we prove convexity of the $H_2$ and $H_\infty$ optimal control formulations which allow for arbitrary convex constraints and regularization of the control input. Moreover, we establish differentiability of the $H_\infty$ norm when the graph associated with the dynamical generator is weakly connected and develop a customized algorithm for computing the optimal solution even in the absence of differentiability. We apply our results to the problems of leader selection in directed consensus networks and combination drug therapy for HIV treatment. In the context of leader selection, we address the combinatorial challenge by deriving upper and lower bounds on optimal performance. For combination drug therapy, we develop a customized subgradient method for efficient treatment of diseases whose mutation patterns are not connected. △ Less

Submitted 4 March, 2018; v1 submitted 29 December, 2017; originally announced December 2017.

Comments: 11 pages, 7 figures

Journal ref: IEEE Trans. Control Netw. Syst., vol. 6, no. 1, pp. 352-362, March 2019

arXiv:1709.01610 [pdf, other]

A second order primal-dual method for nonsmooth convex composite optimization

Authors: Neil K. Dhingra, Sei Zhen Khong, Mihailo R. Jovanović

Abstract: We develop a second order primal-dual method for optimization problems in which the objective function is given by the sum of a strongly convex twice differentiable term and a possibly nondifferentiable convex regularizer. After introducing an auxiliary variable, we utilize the proximal operator of the nonsmooth regularizer to transform the associated augmented Lagrangian into a function that is o… ▽ More We develop a second order primal-dual method for optimization problems in which the objective function is given by the sum of a strongly convex twice differentiable term and a possibly nondifferentiable convex regularizer. After introducing an auxiliary variable, we utilize the proximal operator of the nonsmooth regularizer to transform the associated augmented Lagrangian into a function that is once, but not twice, continuously differentiable. The saddle point of this function corresponds to the solution of the original optimization problem. We employ a generalization of the Hessian to define second order updates on this function and prove global exponential stability of the corresponding differential inclusion. Furthermore, we develop a globally convergent customized algorithm that utilizes the primal-dual augmented Lagrangian as a merit function. We show that the search direction can be computed efficiently and prove quadratic/superlinear asymptotic convergence. We use the $\ell_1$-regularized model predictive control problem and the problem of designing a distributed controller for a spatially-invariant system to demonstrate the merits and the effectiveness of our method. △ Less

Submitted 27 August, 2020; v1 submitted 5 September, 2017; originally announced September 2017.

Comments: 32 pages, 8 figures

arXiv:1411.7785 [pdf, other]

doi 10.1109/WIOPT.2015.7151124

Performance laws of large heterogeneous cellular networks

Authors: Bartlomiej Blaszczyszyn, Miodrag Jovanovic, Mohamed Kadhem Karray

Abstract: We propose a model for heterogeneous cellular networks assuming a space-time Poisson process of call arrivals, independently marked by data volumes, and served by different types of base stations (having different transmission powers) represented by the superposition of independent Poisson processes on the plane. Each station applies a processor sharing policy to serve users arriving in its vicini… ▽ More We propose a model for heterogeneous cellular networks assuming a space-time Poisson process of call arrivals, independently marked by data volumes, and served by different types of base stations (having different transmission powers) represented by the superposition of independent Poisson processes on the plane. Each station applies a processor sharing policy to serve users arriving in its vicinity, modeled by the Voronoi cell perturbed by some random signal propagation effects (shadowing). Users' peak service rates depend on their signal-to-interference-and-noise ratios (SINR) with respect to the serving station. The mutual-dependence of the cells (due to the extra-cell interference) is captured via some system of cell-load equations impacting the spatial distribution of the SINR. We use this model to study in a semi-analytic way (involving only static simulations, with the temporal evolution handled by the queuing theoretic results) network performance metrics (cell loads, mean number of users) and the quality of service perceived by the users (mean throughput) served by different types of base stations. Our goal is to identify macroscopic laws regarding these performance metrics, involving averaging both over time and the network geometry. The reveled laws are validated against real field measurement in an operational network. △ Less

Submitted 19 March, 2015; v1 submitted 28 November, 2014; originally announced November 2014.

arXiv:1307.8409 [pdf, other]

How user throughput depends on the traffic demand in large cellular networks

Authors: Bartlomiej Blaszczyszyn, Miodrag Jovanovic, Mohamed Kadhem Karray

Abstract: Little's law allows to express the mean user throughput in any region of the network as the ratio of the mean traffic demand to the steady-state mean number of users in this region. Corresponding statistics are usually collected in operational networks for each cell. Using ergodic arguments and Palm theoretic formalism, we show that the global mean user throughput in the network is equal to the ra… ▽ More Little's law allows to express the mean user throughput in any region of the network as the ratio of the mean traffic demand to the steady-state mean number of users in this region. Corresponding statistics are usually collected in operational networks for each cell. Using ergodic arguments and Palm theoretic formalism, we show that the global mean user throughput in the network is equal to the ratio of these two means in the steady state of the "typical cell". Here, both means account for double averaging: over time and network geometry, and can be related to the per-surface traffic demand, base-station density and the spatial distribution of the SINR. This latter accounts for network irregularities, shadowing and idling cells via cell-load equations. We validate our approach comparing analytical and simulation results for Poisson network model to real-network cell-measurements. △ Less

Submitted 24 March, 2014; v1 submitted 31 July, 2013; originally announced July 2013.

Journal ref: WiOpt - SpaSWiN (2014)

arXiv:1304.5034 [pdf, other]

Quality of Real-Time Streaming in Wireless Cellular Networks - Stochastic Modeling and Analysis

Authors: Bartlomiej Blaszczyszyn, Miodrag Jovanovic, Mohamed Kadhem Karray

Abstract: We present a new stochastic service model with capacity sharing and interruptions, appropriate for the evaluation of the quality of real-time streaming (RTS), like e.g. mobile TV, in wireless cellular networks. The general model takes into account multi-class Markovian process of call arrivals, (to capture different radio channel conditions, requested streaming bit-rates and durations) and allows… ▽ More We present a new stochastic service model with capacity sharing and interruptions, appropriate for the evaluation of the quality of real-time streaming (RTS), like e.g. mobile TV, in wireless cellular networks. The general model takes into account multi-class Markovian process of call arrivals, (to capture different radio channel conditions, requested streaming bit-rates and durations) and allows for a general resource allocation policy saying which users are temporarily denied the requested fixed streaming bit-rates (put in outage) due to resource constraints. We give expressions for several important performance characteristics of the model, including mean time spent in outage and mean number of outage incidents for a typical user of a given class. These expressions involve only stationary probabilities of the (free) traffic demand process, which is a vector of independent Poisson random variables describing the number of users of different classes. In order to analyze RTS in 3GPP Long Term Evolution (LTE) cellular networks, we specify our general model assuming orthogonal user channels with the peak bit-rates close to the theoretical Shannon's bound in the additive white Gaussian noise (AWGN) channel, which leads to the resource constraints in a multi-rate linear form. In this setting we consider a natural class of least-effort-served-first resource allocation policies, for which the characteristics of the model can be further evaluated using Fourier analysis of Poisson variables. Within this class we identify and evaluate an optimal and a fair policy, the latter being suggested by LTE implementations. We also propose some intermediate policies, which allow to solve the optimality/fairness tradeoff caused by unequal user radio-channel conditions. Our results can be used for the evaluation of the quality of RTS in LTE networks and dimensioning of these networks. △ Less

Submitted 4 March, 2014; v1 submitted 18 April, 2013; originally announced April 2013.

Comments: (06/2012)

arXiv:1302.0450 [pdf, other]

doi 10.1109/TAC.2014.2314223

Algorithms for leader selection in stochastically forced consensus networks

Authors: Fu Lin, Makan Fardad, Mihailo R. Jovanović

Abstract: We are interested in assigning a pre-specified number of nodes as leaders in order to minimize the mean-square deviation from consensus in stochastically forced networks. This problem arises in several applications including control of vehicular formations and localization in sensor networks. For networks with leaders subject to noise, we show that the Boolean constraints (a node is either a leade… ▽ More We are interested in assigning a pre-specified number of nodes as leaders in order to minimize the mean-square deviation from consensus in stochastically forced networks. This problem arises in several applications including control of vehicular formations and localization in sensor networks. For networks with leaders subject to noise, we show that the Boolean constraints (a node is either a leader or it is not) are the only source of nonconvexity. By relaxing these constraints to their convex hull we obtain a lower bound on the global optimal value. We also use a simple but efficient greedy algorithm to identify leaders and to compute an upper bound. For networks with leaders that perfectly follow their desired trajectories, we identify an additional source of nonconvexity in the form of a rank constraint. Removal of the rank constraint and relaxation of the Boolean constraints yields a semidefinite program for which we develop a customized algorithm well-suited for large networks. Several examples ranging from regular lattices to random graphs are provided to illustrate the effectiveness of the developed algorithms. △ Less

Submitted 29 May, 2013; v1 submitted 2 February, 2013; originally announced February 2013.

Comments: Submitted to IEEE Transactions on Automatic Control

Journal ref: IEEE Trans. Automat. Control (2014), vol. 59, no. 7, pp. 1789-1802

arXiv:1112.4113 [pdf, other]

doi 10.1109/TAC.2011.2181790

Optimal Control of Vehicular Formations with Nearest Neighbor Interactions

Authors: Fu Lin, Makan Fardad, Mihailo R. Jovanović

Abstract: We consider the design of optimal localized feedback gains for one-dimensional formations in which vehicles only use information from their immediate neighbors. The control objective is to enhance coherence of the formation by making it behave like a rigid lattice. For the single-integrator model with symmetric gains, we establish convexity, implying that the globally optimal controller can be com… ▽ More We consider the design of optimal localized feedback gains for one-dimensional formations in which vehicles only use information from their immediate neighbors. The control objective is to enhance coherence of the formation by making it behave like a rigid lattice. For the single-integrator model with symmetric gains, we establish convexity, implying that the globally optimal controller can be computed efficiently. We also identify a class of convex problems for double-integrators by restricting the controller to symmetric position and uniform diagonal velocity gains. To obtain the optimal non-symmetric gains for both the single- and the double-integrator models, we solve a parameterized family of optimal control problems ranging from an easily solvable problem to the problem of interest as the underlying parameter increases. When this parameter is kept small, we employ perturbation analysis to decouple the matrix equations that result from the optimality conditions, thereby rendering the unique optimal feedback gain. This solution is used to initialize a homotopy-based Newton's method to find the optimal localized gain. To investigate the performance of localized controllers, we examine how the coherence of large-scale stochastically forced formations scales with the number of vehicles. We establish several explicit scaling relationships and show that the best performance is achieved by a localized controller that is both non-symmetric and spatially-varying. △ Less

Submitted 17 December, 2011; originally announced December 2011.

Comments: To appear in IEEE Trans. Automat. Control; 15 pages, 10 figures

Journal ref: IEEE Trans. Automat. Control (2012), vol. 57, no. 9, pp. 2203-2218

arXiv:1112.4011 [pdf, other]

doi 10.1109/TAC.2012.2202052

Coherence in Large-Scale Networks: Dimension-Dependent Limitations of Local Feedback

Authors: Bassam Bamieh, Mihailo R. Jovanović, Partha Mitra, Stacy Patterson

Abstract: We consider distributed consensus and vehicular formation control problems. Specifically we address the question of whether local feedback is sufficient to maintain coherence in large-scale networks subject to stochastic disturbances. We define macroscopic performance measures which are global quantities that capture the notion of coherence; a notion of global order that quantifies how closely the… ▽ More We consider distributed consensus and vehicular formation control problems. Specifically we address the question of whether local feedback is sufficient to maintain coherence in large-scale networks subject to stochastic disturbances. We define macroscopic performance measures which are global quantities that capture the notion of coherence; a notion of global order that quantifies how closely the formation resembles a solid object. We consider how these measures scale asymptotically with network size in the topologies of regular lattices in 1, 2 and higher dimensions, with vehicular platoons corresponding to the 1 dimensional case. A common phenomenon appears where a higher spatial dimension implies a more favorable scaling of coherence measures, with a dimensions of 3 being necessary to achieve coherence in consensus and vehicular formations under certain conditions. In particular, we show that it is impossible to have large coherent one dimensional vehicular platoons with only local feedback. We analyze these effects in terms of the underlying energetic modes of motion, showing that they take the form of large temporal and spatial scales resulting in an accordion-like motion of formations. A conclusion can be drawn that in low spatial dimensions, local feedback is unable to regulate large-scale disturbances, but it can in higher spatial dimensions. This phenomenon is distinct from, and unrelated to string instability issues which are commonly encountered in control problems for automated highways. △ Less

Submitted 16 December, 2011; originally announced December 2011.

Comments: To appear in IEEE Trans. Automat. Control; 15 pages, 2 figures

Journal ref: IEEE Trans. Automat. Control (2012), vol. 57, no. 9, pp. 2235-2249

Showing 1–39 of 39 results for author: Jovanović, M