| |
Last updated on December 4, 2020. This conference program is tentative and subject to change
Technical Program for Wednesday December 16, 2020
|
WeA01 Tutorial Session, Coordinated Universal Time (UTC) |
Add to My Program |
Direct Transcription for Dynamic Optimization: A Tutorial and Case Study on
Dual-Patient Ventilation During the COVID-19 Pandemic |
|
|
Chair: Kerrigan, Eric C. | Imperial College London |
Co-Chair: Solis-Lemus, Jose A. | School of Biomedical Engineering and Imaging Sciences, King’s College London |
Organizer: Kerrigan, Eric C. | Imperial College London |
|
13:00-13:15, Paper WeA01.1 | Add to My Program |
Direct Transcription for Dynamic Optimization: A Tutorial with a Case Study on Dual-Patient Ventilation During the COVID-19 Pandemic (I) |
|
Kerrigan, Eric C. | Imperial College London |
Nie, Yuanbo | Imperial College London |
Faqir, Omar James | Imperial College London |
Kennedy, Caroline H. | Evelina Children’s Hospital, Guy’s and St Thomas’ NHS Foundation |
Niederer, Steven A. | School of Biomedical Engineering and Imaging Sciences, King’s Co |
Solis-Lemus, Jose A. | School of Biomedical Engineering and Imaging Sciences, King’s Co |
Vincent, Peter | Imperial College London |
Williams, Steven E. | School of Biomedical Engineering and Imaging Sciences, King’s Co |
Keywords: Optimal control, Estimation, Healthcare and medical systems
Abstract: A variety of optimal control, estimation, system identification and design problems can be formulated as functional optimization problems with differential equality and inequality constraints. Since these problems are infinite-dimensional and often do not have a known analytical solution, one has to resort to numerical methods to compute an approximate solution. This paper uses a unifying notation to outline some of the techniques used in the transcription step of simultaneous direct methods (which discretize-then-optimize) for solving continuous-time dynamic optimization problems. We focus on collocation, integrated residual and Runge-Kutta schemes. These transcription methods are then applied to a simulation case study to answer a question that arose during the COVID-19 pandemic, namely: If there are not enough ventilators, is it possible to ventilate more than one patient on a single ventilator? The results suggest that it is possible, in principle, to estimate individual patient parameters sufficiently accurately, using a relatively small number of flow rate measurements, without needing to disconnect a patient from the system or needing more than one flow rate sensor. We also show that it is possible to ensure that two different patients can indeed receive their desired tidal volume, by modifying the resistance experienced by the air flow to each patient and controlling the ventilator pressure.
|
|
13:15-13:40, Paper WeA01.2 | Add to My Program |
Dual-Patient Ventilation (I) |
|
Solis-Lemus, Jose A. | School of Biomedical Engineering and Imaging Sciences, King’s Co |
Kennedy, Caroline H. | Evelina Children’s Hospital, Guy’s and St Thomas’ NHS Foundation |
Niederer, Steven A. | School of Biomedical Engineering and Imaging Sciences, King’s Co |
Vincent, Peter | Imperial College London |
Williams, Steven E. | School of Biomedical Engineering and Imaging Sciences, King’s Co |
Keywords: Healthcare and medical systems, Biomedical, Human-in-the-loop control
Abstract: Due to concerns at the availability of ventilators to care for the more unwell patients during the COVID-19 pandemic, we consider the question: In an extreme circumstance, where no other options are available, can two patients with different lung physiologies be successfully ventilated on a single ventilator? We present a model that allows one to do system identification, estimation and control of a dual-patient ventilator system. This case study is used throughout other talks in this session.
|
|
13:40-14:20, Paper WeA01.3 | Add to My Program |
Direct Transcription of Dynamic Optimization Problems (I) |
|
Nie, Yuanbo | Imperial College London |
Keywords: Numerical algorithms, Optimal control, Estimation
Abstract: We give an outline of direct methods for discretizing continuous-time dynamic optimization problems so that one can compute approximate solutions by solving a sequence of finite-dimensional optimization problems. We focus on direct collocation, integrated residual, Runge-Kutta and shooting methods. Error analysis and mesh refinement is discussed as a key part of the solution process. The dual-patient ventilator case study is used for numerical simulations.
|
|
14:20-14:45, Paper WeA01.4 | Add to My Program |
Numerical Methods for Structured Optimization Problems (I) |
|
Faqir, Omar James | Imperial College London |
Keywords: Optimization algorithms, Optimal control, Optimization
Abstract: The optimization problems arising from transcribing dynamic optimization problems are highly structured. We outline some numerical methods that could be used to efficiently exploit this structure when computing the solution to the optimization problem.The dual-patient ventilator case study is used for numerical simulations.
|
|
14:45-15:00, Paper WeA01.5 | Add to My Program |
The Past, Present and Future of Dynamic Optimization (I) |
|
Kerrigan, Eric C. | Imperial College London |
Keywords: Optimal control, Identification, Estimation
Abstract: The key points discussed in this tutorial session will be summarised. We will also briefly discuss topics not discussed in this session and outline areas for research.
|
|
WeA02 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Resource Allocation |
|
|
Chair: Kofman, Daniel | Telecom Paris |
Co-Chair: Eisen, Mark | Intel Corporation |
|
13:00-13:15, Paper WeA02.1 | Add to My Program |
Learning Constrained Resource Allocation Policies in Wireless Control Systems |
|
Lima, Vinicius | University of Pennsylvania |
Eisen, Mark | Intel Corporation |
Ribeiro, Alejandro | University of Pennsylvania |
Keywords: Networked control systems, Machine learning, Communication networks
Abstract: Emerging applications in IoT systems employ wireless communication networks to exchange data between spatially distributed components of a control system. As wireless networks are noisy and subject to packet losses - which might impact the operation of the control system - proper distribution of communication resources among components of the wireless control system sharing the communication network is essential to maintain the system operating reliably. Here, in particular, we study settings in which the decision maker must meet additional constraints on the control system while distributing communication resources. The existence of constraints and the infinite dimensionality of the problem make the resource allocation problem challenging. To reduce the dimensionality of the problem, we parameterize the resource allocation policy in terms of neural networks - high capability approximators - and leverage reinforcement learning techniques to design policies that do not require knowledge of plant dynamics or communication models. We further reformulate the resource allocation problem in the dual domain to handle the constraints, which leads naturally to a primal-dual policy gradient algorithm that alternates between updating the policy parameters via reinforcement learning iterations and updating a dual variable that enforces constraint satisfaction. We conclude the paper with numerical simulations that show the strong performance of the learned allocation policies against baseline resource allocation solutions.
|
|
13:15-13:30, Paper WeA02.2 | Add to My Program |
Distributed Time-Varying Resource Allocation Optimization Based on Finite-Time Consensus Approach |
|
Wang, Bo | Beijing Institute of Technology |
Fei, Qing | Beijjing Institute of Technology |
Wu, Qinghe | Beijing Inst. of Tech |
Keywords: Control of networks, Decentralized control, Optimal control
Abstract: In this paper, we study the distributed algorithm to solve the optimal resource allocation problem with timevarying cost functions and resources for continuous-time multiagent systems. Subject to a coupled linear equality constraint, all agents aim to minimize the sum of all local cost functions known only to each agent. Since the cost functions and resources are time varying, the optimal solutions are trajectories changing over time rather than some constants. By combining the predictioncorrection method with the finite-time nonsmooth consensus idea, we propose a distributed continuous-time algorithm that ensures that the states of all agents will track the corresponding timevarying optimal trajectories with vanishing error when the cost functions have identical Hessians. Here each agent exchanges only local information through a fixed connected undirected graph in a distributed manner. We perform a simulation for a grid-connected battery energy storage system to illustrate the effectiveness of the proposed distributed continuous-time optimal resource allocation algorithm.
|
|
13:30-13:45, Paper WeA02.3 | Add to My Program |
Toward Distributed Optimization for Critical Service Restoration with Distributed Energy Resources |
|
Liang, Junkai | North Carolina State University |
Tang, Wenyuan | North Carolina State University |
Lu, Xiaonan | Temple University |
Keywords: Optimization algorithms, Smart grid, Power systems
Abstract: Increased penetration of distributed energy resources, especially solar at a community level, can provide additional resiliency and improve the reliability of the distribution system. By altering the statuses of switches, dynamic microgrids that feature adjustable topologies and boundaries can use distributed generation to serve one or more critical service facilities within them. In this paper, the critical service restoration problem is first formulated in a distributed manner. Moreover, a heuristic distributed optimization approach is proposed, whose optimality gap is shown to be bounded. The case study compares the proposed algorithm with the centralized optimization approach.
|
|
13:45-14:00, Paper WeA02.4 | Add to My Program |
Efficient Distributed Solutions for Sharing Energy Resources at the Local Level: A Cooperative Game Approach |
|
Kiedanski, Diego | Telecom ParisTech |
Busic, Ana | Inria |
Kofman, Daniel | Telecom Paris |
Orda, Ariel | Technion |
Keywords: Game theory, Smart grid, Decentralized control
Abstract: Local energy generation as well as local energy storage represent key opportunities for energy transition. Nevertheless, their massive deployment is being delayed mainly due to cost reasons. Sharing resources at the local level enables not only reducing these costs significantly, but also to further optimize the cost of the energy exchanged with providers external to the local community. A key question that arises while sharing resources is how to distribute the obtained benefits among the various local players that cooperate. In this paper we propose a cooperative game model, where the players are the holders of energy resources (generation and storage); they cooperate in order to reduce their individual electricity costs. We prove that the core of the game is non-empty; i.e., the proposed cooperative game has a stable solution (distribution of the payoffs among the players) for the case where all players participate in a unique community, and no strict subset of players can obtain a better gain by leaving the community. We propose a formulation of this game, based on the theory of linear production games, which lead us to the two main contributions of this paper. First, we propose an efficient (with linear complexity) centralized algorithm for finding a stable payoff. Second, we provide an efficient distributed algorithm that computes an allocation in the core of the game without any requirement for the players to share any private information. The distributed algorithm requires the exchange of intermediate solutions among players. The topology of the network that enables these exchanges is closely related to the performance of the distributed algorithm. We show, by way of simulations, which are the best topologies for these communication graphs.
|
|
14:00-14:15, Paper WeA02.5 | Add to My Program |
Social Resource Allocation in a Mobility System with Connected and Automated Vehicles: A Mechanism Design Problem |
|
Chremos, Ioannis Vasileios | University of Delaware |
Malikopoulos, Andreas A. | University of Delaware |
Keywords: Game theory, Transportation networks
Abstract: In this paper, we investigate the social resource allocation in an emerging mobility system consisting of connected and automated vehicles (CAVs) using mechanism design. CAVs provide the most intriguing opportunity for enabling travelers to monitor mobility system conditions efficiently and make better decisions. However, this new reality will influence travelers' tendency-of-travel and might give rise to rebound effects, e.g., increased-vehicle-miles traveled. To tackle this phenomenon, we propose a mechanism design formulation that provides an efficient social resource allocation of travel time for all travelers. Our focus is on the socio-technical aspect of the problem, i.e., by designing appropriate socio-economic incentives, we seek to prevent potential rebound effects. In particular, we propose an economically inspired mechanism to influence the impact of the travelers' decision-making on the well-being of an emerging mobility system.
|
|
14:15-14:30, Paper WeA02.6 | Add to My Program |
On the Stability and Fairness of Submodular Allocations |
|
Kyriakis, Panagiotis | University of Southern California |
Pequito, Sergio | Rensselaer Polytechnic Institute |
Bogdan, Paul | University of Southern California |
Keywords: Optimization, Optimization algorithms
Abstract: In this paper, we focus on the allocation of goods (or resources) among competitive agents under submodular utility functions. We introduce two quantities, namely stability and fairness, to characterize an allocation. Our definition of stability is motivated by the famous stable marriage problem and measures the incentive that the agents have to deviate from the allocation. Furthermore, we quantify the fairness using the maximin fairness ratio which stems from the maximin fair shares. These shares represent an ideal scenario wherein each agent is given the option to maximize her worst possible outcome. Following that, we design an efficient greedy mbox{round-robin} algorithm for the submodular allocations problem. We prove that our algorithm gives a stable allocation and we extract a curvature-dependant maximin fairness ratio. To demonstrate the empirical performance of our algorithm, we conduct experimental studies and observe that the empirical fairness ratio is higher than 95% of the solution given by a brute-force approach, which outperforms the theoretical guarantee.
|
|
14:30-14:45, Paper WeA02.7 | Add to My Program |
A Cross-Layer Optimal Co-Design of Control and Networking in Time-Sensitive Cyber-Physical Systems |
|
Mamduhi, Mohammadhossein | Kth - Tum |
Maity, Dipankar | Georgia Institute of Technology |
Baras, John S. | University of Maryland |
Johansson, Karl H. | Royal Institute of Technology |
Keywords: Networked control systems, Optimization, Control over communications
Abstract: In the design of cyber-physical systems (CPS) where multiple heterogeneous physical systems are coupled via a communication network, a key aspect is to study how network services are distributed among the users. To answer this adequately, we consider the coupling parameters between the control and network layers, and also the time-sensitive limitations and tolerances of the individual physical systems and the network. In this article, we first describe a cross-layer model for CPS wherein multiple stochastic linear processes are coupled via a shared network that provides a diverse range of cost-prone and capacity-limited services with distinct latency characteristics. Service prices are given such that low latency services incur higher communication cost, and prices remain fixed over a constant period of time but will be adjusted by the network for the future time periods. Physical systems decide to use specific services over each time interval depending on the service prices and their own time sensitivity requirements. Considering the service availability, the network coordinates resource allocation such that physical systems are serviced the closest to their preferences. Performance of individual systems are measured by an expected quadratic cost and we formulate a social optimization problem subject to time-sensitive requirements of the physical systems and the network constraints. From the formulated optimization problem, we derive the joint optimal time-sensitive control and service allocation policies.
|
|
14:45-15:00, Paper WeA02.8 | Add to My Program |
Particle Based Optimization for Predictive Energy Efficient Data Center Management |
|
Carnerero, A. Daniel | University of Seville |
Ramirez, Daniel R. | Univ. of Sevilla |
Limon, Daniel | Universidad De Sevilla |
Alamo, Teodoro | Universidad De Sevilla |
Keywords: Control applications, Queueing systems, Predictive control for nonlinear systems
Abstract: Data centers are energy-hungry infrastructures that provide cloud computing services. The growing number of data centers in use has led to a drastic increment of the energy consumption associated to these facilities, causing environmental concerns. For that reason, efficient management strategies are needed in order to reduce the energy consumption while the quality of service is kept. This paper presents a unified management approach for the thermal and workload distribution problem in data centers, shaped as a Model Predictive Control problem. The corresponding optimization problem is intractable for conventional solvers because the model is based on multiple queues and the decision variables are a mix of integer and real valued ones. A highly parallelizable particle based optimization algorithm is proposed to solve the optimization problem. Numerical simulations are provided in order to illustrate the effectiveness of the strategy.
|
|
WeA03 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Fault Detection and Handling |
|
|
Chair: Niemann, Henrik | Technical Univ. of Denmark |
Co-Chair: Silvestre, Carlos | University of Macau |
|
13:00-13:15, Paper WeA03.1 | Add to My Program |
Fault Detection and Isolation for Linear Structured Systems |
|
Jia, Jiajia | University of Groningen |
Trentelman, Harry L. | Univ. of Groningen |
Camlibel, M. Kanat | University of Groningen |
Keywords: Fault detection, Fault diagnosis, Linear systems
Abstract: This paper deals with the fault detection and isolation (FDI) problem for linear structured systems in which the system matrices are given by zero/nonzero/arbitrary pattern matrices. In this paper, we follow a geometric approach to verify solvability of the FDI problem for such systems. To do so, we first develop a necessary and sufficient condition under which the FDI problem for a given particular linear time-invariant system is solvable. Next, we establish a necessary condition for solvability of the FDI problem for linear structured systems. In addition, we develop a sufficient algebraic condition for solvability of the FDI problem in terms of a rank test on an associated pattern matrix. To illustrate that this condition is not necessary, we provide a counterexample in which the FDI problem is solvable while the condition is not satisfied. Finally, we develop a graph-theoretic condition for the full rank property of a given pattern matrix, which leads to a graph-theoretic condition for solvability of the FDI problem.
|
|
13:15-13:30, Paper WeA03.2 | Add to My Program |
Fault Isolation in MIMO Systems Based on Active Decoupling |
|
Niemann, Henrik | Technical Univ. of Denmark |
Stoustrup, Jakob | Aalborg University |
Poulsen, Niels Kjølstad | Tech. Univ. of Denmark |
Keywords: Fault diagnosis, Control system architecture, Linear systems
Abstract: A decoupling approach for fault isolation in MIMO systems is presented in this paper. The fault isolation approach is based on a closed-loop concept, where the feedback controller is an integrated part of the set-up. The YJBK-parameterization (after Youla, Jabr, Bongiorno, and Kucera) for controllers is introduced. This allows the feedback controller to be modified by changing the YJBK matrix transfer function without changing the nominal feedback controller. Modification of the feedback controller via the YJBK transfer matrix causes the output residual response to be changed in the case of parametric faults and unchanged in the fault-free case. This facilitates fault detection. In the isolation case, the controller is modified in a way such that the residual response from the closed-loop system is independent of the modification for one specific parametric fault and else it depends on the parametric faults. This allows for any specific fault and eventually for all faults to be isolated.
|
|
13:30-13:45, Paper WeA03.3 | Add to My Program |
A Novel Online Active Fault Diagnosis Method Based on Invariant Sets |
|
YANG, Songlin | Graduate School at Shenzhen, Tsinghua University |
Xu, Feng | Tsinghua University |
Wang, Xueqian | Tsinghua University |
Liang, Bin | Tsinghua University |
Keywords: Fault diagnosis, Linear systems
Abstract: This letter proposes a novel online active fault diagnosis (AFD) method based on invariant sets for linear timeinvariant (LTI) systems with bounded disturbances and noises. In general, the system has the healthy mode and several faulty modes. During system operation, with respect to the healthy and faulty modes, the corresponding healthy and faulty output estimation sets are computed, respectively. Thus, the task of AFD can be converted to separate healthy and faulty output estimation sets, which can consequently distinguish the current system mode and implement AFD. The existing AFD methods implement the objective by designing an input sequence to separate the healthy and faulty output estimation sets. However, it is difficult to use these methods to simultaneously combine AFD and fault-tolerant control. Thus, we propose a new AFD method in this letter to online design inputs step by step, which has potential to overcome some weakness of the existing AFD methods. Particularly, we implement this AFD objective by online designing inputs at each step to maximize the distances of these healthy and faulty output estimation sets. Moreover, this online input design problem can finally be transformed into the resolution of two mixed integer programming (MIP) problems with a similar structure. At the end of this letter, a quadruple-tank benchmark system is used to illustrate the effectiveness of the proposed method by comparing with an existing set-based AFD method.
|
|
13:45-14:00, Paper WeA03.4 | Add to My Program |
Detection and Isolation of Actuator Faults and Collisions for a Flexible Robot Arm |
|
Gaz, Claudio Roberto | Sapienza - Università Di Roma |
Cristofaro, Andrea | Sapienza University of Rome |
De Luca, Alessandro | Sapienza Università Di Roma |
Keywords: Fault detection, Robotics, Flexible structures
Abstract: This paper presents a unified approach to detection and isolation of both actuator faults and unexpected collisions for a two-link robot with a flexible forearm. The proposed approach is sensorless, i.e., no dedicated exteroceptive sensors are considered, and is based on the design of residuals to be used as monitoring filters. The method has been tested by extensive simulations on the Flexarm robot used as case study. The reported results show the efficacy in detecting and isolating faults in the actuators or collisions on the robot links.
|
|
14:00-14:15, Paper WeA03.5 | Add to My Program |
Structural Methods for Distributed Fault Diagnosis of Large-Scale Systems |
|
Jung, Daniel | Linkoping University |
Keywords: Fault diagnosis, Large-scale systems, Control system architecture
Abstract: Structural analysis is a useful tool for fault diagnosability analysis to handle systems that are described by a large set of non-linear differential algebraic equations. Distributed fault diagnosis is an attractive approach for complex systems to reduce computational complexity by partitioning the system into a set of smaller subsystems and perform fault diagnosis of each subsystem. Defining these subsystems requires methods to understand how fault diagnosis properties of each subsystem relates to the properties of the whole system. Another related problem is that large and complex systems are likely to be developed by several companies where each company is developing different subsystems that can be used in different system configurations. In these situations, each subsystem will have limited model information about the other subsystems, which complicates performing structural analysis of the whole system. The main contribution in this work is extending some of the existing results in structural analysis for one system model to a distributed set of connected subsystems. The results show the relationship between structural fault diagnosis properties of the whole system and properties of the set of individual subsystems.
|
|
14:15-14:30, Paper WeA03.6 | Add to My Program |
Fault-Tolerant Tracking Control for Heterogeneous Multi-Agent Systems |
|
Pham, Van Thiem | University of Reims Champagne-Ardenne |
Nguyen, Thi Thanh Quynh | CReSTIC, University of Reims Champange Ardene |
Messai, Nadhir | Université De Reims Champagne-Ardenne |
Manamanni, Noureddine | University of Reims |
Keywords: Fault tolerant systems, Agents-based systems, Communication networks
Abstract: This paper focuses on the fault-tolerant tracking problem in networks of leader-following heterogeneous linear multi-agent systems. Firstly, a fault estimation observer is constructed to estimate simultaneously the actuator faults and the followers' states. Then, a fault-tolerant tracking controller is proposed. The latter is based on a dynamic internal reference for each agent and exploits the estimated faults. Thus, it is at first shown that the fault-tolerant tracking problem can be indirectly solved through the consensus tracking of the defined internal references. Then, a sufficient condition of consensus tracking for the internal references is proposed. Finally, a sufficient and necessary condition is derived for the fault-tolerant tracking control of linear heterogeneous multi-agent systems. The effectiveness of the proposed approach is illustrated with a numerical simulation.
|
|
14:30-14:45, Paper WeA03.7 | Add to My Program |
A General Discrete-Time Method to Achieve Resilience in Consensus Algorithms |
|
Ramos, Guilherme | Department of Electrical and Computer Engineering, Faculty of En |
Silvestre, Daniel | University of Macau |
Silvestre, Carlos | University of Macau |
Keywords: Fault tolerant systems, Fault detection, Agents-based systems
Abstract: In this paper, we approach the problem of a set of network agents reaching resilient consensus in the presence of a subset of attacked nodes. We devise a generalized method, with polynomial time complexity, which receives as input a discrete-time, synchronous-communication consensus algorithm, a dynamic network of agents, and the maximum number of attacked nodes. The distributed algorithm enables each normal node to detect and discard the values of the attacked agents while reaching the consensus of normal agents for the input consensus algorithm. Hence, the proposed method adds an extra layer of resilience to a given discrete-time and synchronous-communication consensus algorithm. Finally, we demonstrate the effectiveness of the method with experimental results, showing some attack circumstances which we can counter, where the state-of-the-art methods fail.
|
|
14:45-15:00, Paper WeA03.8 | Add to My Program |
Model Reference Sliding Mode Fault Tolerant Control of a BWB Aircraft |
|
Vile, Liam | University of Exeter |
Alwi, Halim | University of Exeter |
Edwards, Christopher | University of Exeter |
Keywords: Fault tolerant systems, Variable-structure/sliding-mode control
Abstract: In the event of actuator faults and failures the total resources available to the controller can be severely reduced. In such cases it is important for the controller to be reconfigured to avoid saturating and overloading the remaining healthy actuators. Actuator saturation can lead to loss of system performance and in some cases, even instability. This paper proposes a method of reducing the closed-loop performance requirements through a scheduled reference model, with aim to minimise any saturation in the closed-loop system in the event of serious faults. The reference model is then tracked with a robust sliding mode control allocation scheme to ensure asymptotic tracking despite actuator faults/failures and uncertainty in their detection/estimation.
|
|
WeA04 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Uncertain Systems |
|
|
Chair: Weyer, Erik | Univ. of Melbourne |
Co-Chair: Putot, Sylvie | CNRS & Ecole Polytechnique |
|
13:00-13:15, Paper WeA04.1 | Add to My Program |
An MCMC Method for Uncertainty Set Generation Via Operator-Theoretic Metrics |
|
Srinivasan, Anand | Massachusetts Institute of Technology |
Takeishi, Naoya | RIKEN |
Keywords: Uncertain systems, Statistical learning, Nonlinear systems identification
Abstract: Model uncertainty sets are required in many robust optimization problems, such as robust control and prediction with uncertainty, but there is no definite methodology to generate uncertainty sets for nonlinear dynamical systems. In this paper, we propose a method for model uncertainty set generation via Markov chain Monte Carlo. The proposed method samples from distributions over dynamical systems via metrics over transfer operators and is applicable to general nonlinear systems. We adapt Hamiltonian Monte Carlo for sampling high-dimensional transfer operators in a computationally efficient manner. We present numerical examples to validate the proposed method for uncertainty set generation.
|
|
13:15-13:30, Paper WeA04.2 | Add to My Program |
Robust Under-Approximations and Application to Reachability of Non-Linear Control Systems with Disturbances |
|
Goubault, Eric | CEA France |
Putot, Sylvie | CNRS & Ecole Polytechnique |
Keywords: Uncertain systems, Computer-aided control design, Computational methods
Abstract: We describe a set-based approach, relying on mean-value extensions, for computing guaranteed under-approximations of ranges (or images) of continuously differentiable vector-valued functions, including what we call robust ranges, i.e. ranges of functions under adversarial uncertainties. Our method is capable of computing efficiently, at a low computational cost, full n-dimensional subsets of the image of a function. As an application, we show how to compute under-approximations of robust reachable sets of non-linear controlled dynamical systems under time-varying uncertainties, which is central to many verification problems in control.
|
|
13:30-13:45, Paper WeA04.3 | Add to My Program |
A Quadratic Program Based Control Synthesis under Spatiotemporal Constraints and Non-Vanishing Disturbances |
|
Black, Mitchell | University of Michigan |
Garg, Kunal | University of Michigan-Ann Arbor |
Panagou, Dimitra | University of Michigan, Ann Arbor |
Keywords: Uncertain systems
Abstract: In this paper, we study the effect of non-vanishing disturbances on the stability of fixed-time stable (FxTS) systems. We present a new result on FxTS, which allows a positive term in the time derivative of the Lyapunov function with the aim to model bounded, non-vanishing disturbances in system dynamics. We characterize the neighborhood to which the system trajectories converge, as well as the convergence time. Then, we use the new FxTS result and formulate a quadratic program (QP) that yields control inputs which drive the trajectories of a class of nonlinear, control-affine systems to a goal set in the presence of control input constraints and non-vanishing, bounded disturbances in the system dynamics. We consider an overtaking problem on a highway as a case study, and discuss how to both set up the QP and decide when to start the overtake maneuver in the presence of sensing errors.
|
|
13:45-14:00, Paper WeA04.4 | Add to My Program |
Explorative Probabilistic Planning with Unknown Target Locations |
|
Savvas Sadiq Ali, Farhad Nawaz | University of Illinois at Urbana-Champaign |
Ornik, Melkior | University of Illinois at Urbana-Champaign |
Keywords: Uncertain systems, Autonomous systems, Automata
Abstract: Motion planning in an unknown environment demands synthesis of an optimal control policy that balances between exploration and exploitation. In this paper, we present the environment as a labeled graph where the labels of states are initially unknown, and consider a motion planning objective to fulfill a generalized reach-avoid specification given on these labels in minimum time. By describing the record of visited labels as an automaton, we translate our problem to a Canadian traveler problem on an adapted state space. We propose a strategy that enables the agent to perform its task by exploiting possible a priori knowledge about the labels and the environment and incrementally revealing the environment online. Namely, the agent plans, follows, and replans the optimal path by assigning edge weights that balance between exploration and exploitation, given the current knowledge of the environment. We illustrate our strategy on the setting of an agent operating on a two-dimensional grid environment.
|
|
14:00-14:15, Paper WeA04.5 | Add to My Program |
A Scenario Approach for Robust Optimization Over Uncertain System Identification Models |
|
Wang, Xiaopuwen | University of Melbourne |
Weyer, Erik | Univ. of Melbourne |
Keywords: Uncertain systems, Optimization, Robust control
Abstract: In this paper, we propose a scenario based method for robust optimization over uncertain system identification models. By considering linear regression models with deterministic regressors and assuming that the distribution of the noise acting on the system is known, we give an algorithm for drawing independent samples from the probability distribution which characterizes the uncertainty in the least squares estimate when a finite number of data points are used for parameter estimation. A number of these uncertain model parameters are drawn, and a min-max optimization problem is solved using these models. Under the assumption that the cost function is quasi-convex in the model parameters, we establish probabilistic guarantees on the value of the cost function when it is evaluated at the found scenario solution and the true system parameter. The true system parameter is regarded as a deterministic quantity, and the probabilities are with respect to the observed data from the system and the drawn scenarios. The approach and its theoretical properties are illustrated in a simulation example.
|
|
14:15-14:30, Paper WeA04.6 | Add to My Program |
Comparative Study of Output-Based and Error-Based ADRC Schemes in Application to Buck Converter-Fed DC Motor System |
|
Madonski, Rafal | Jinan University |
Lakomy, Krzysztof | Poznan University of Technology |
Yang, Jun | Southeast University |
Keywords: Uncertain systems, Robust control, Electrical machine control
Abstract: The majority of currently used active disturbance rejection control (ADRC) techniques is expressed in 2DOF output-based form. For relatively high-order systems, this imposes limitations on the conventional ADRC, like the necessity of having certain higher-order terms available for controller synthesis and the lack of direct compatibility with standard industrial control software, which favors simplicity and intuitiveness of feedback error-based designs. That is why, in this work, a comparative study is conducted between the conventional 2DOF output-based ADRC and the recently proposed 1DOF error-based formulation of ADRC. Through a set of hardware experiments on a converter-fed motor testbed, the error-based ADRC is shown to retain the robustness of the 2DOF design, while having a simplified controller, being free from assumptions on availability of particular signals, and with the entire observer-based control structure being represented in a practically appealing, compact form. The obtained results also show that a major design simplification can be achieved with error-based ADRC without sacrificing the performance of the original output-based form.
|
|
14:30-14:45, Paper WeA04.7 | Add to My Program |
Robust Impulsive Observer--Based Stabilization for Uncertain Nonlinear Systems with Sampled--Output |
|
Jaramillo, Oscar David | Center for Research and Advanced Studies of the National Polytec |
Castillo-Toledo, Bernardino | CINVESTAV-GDL, Mexico |
Di Gennaro, Stefano | University of L'Aquila |
Keywords: Uncertain systems, Hybrid systems, Observers for nonlinear systems
Abstract: In this paper, the design of an impulsive observer-based control, for a class of nonlinear systems with time-varying uncertainties, is proposed based on the LMI framework. The nonlinearities under consideration are assumed to satisfy local Lipschitz conditions. The observer uses sampled measurements of the system output. A time-varying Lyapunov function is used to give sufficient conditions for the existence of the impulsive observer-based control. The observer and controller gain can be obtained through feasible solutions of the LMIs proposed.The scheme presented here demonstrates that mathematically and through simulation, the proposed approach properly estimates and stabilizes all states.
|
|
14:45-15:00, Paper WeA04.8 | Add to My Program |
Guaranteed Reachability for Systems with Unknown Dynamics |
|
Ornik, Melkior | University of Illinois at Urbana-Champaign |
Keywords: Uncertain systems, Algebraic/geometric methods, Estimation
Abstract: The problem of computing the reachable set for a given system is a quintessential question in nonlinear control theory. Motivated by prior work on safety-critical online planning, this paper considers an environment where the only available information about system dynamics is that of dynamics at a single point. Limited to such knowledge, we study the problem of describing the set of all states that are guaranteed to be reachable regardless of the unknown true dynamics. We show that such a set can be underapproximated by a reachable set of a related known system whose dynamics at every state depend on the velocity vectors that are available in all control systems consistent with the assumed knowledge. Complementing the theory, we discuss a simple model of an aircraft in distress to verify that such an underapproximation is meaningful in practice.
|
|
WeA05 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Formation Control I |
|
|
Chair: Colombo, Leonardo Jesus | Universidad Autonoma De Madrid |
Co-Chair: Sun, Zhiyong | Eindhoven University of Technology (TU/e) |
|
13:00-13:15, Paper WeA05.1 | Add to My Program |
Distance-Based Formation Control with Bounded Disturbances |
|
Vu, Dung Van | Hanoi University of Science and Technology |
Trinh, Minh Hoang | Hanoi University of Science and Technology (HUST) |
Nguyen, Phuoc | Hanoi University of Science and Technology |
Ahn, Hyo-Sung | Gwangju Institute of Science and Technology (GIST) |
Keywords: Agents-based systems, Distributed control, Cooperative control
Abstract: In this paper, we study distance-based formation control of single- or double-integrator modeled agents with unknown bounded disturbances. We propose distributed control laws combining a component to eliminate the effects of the disturbances and another to achieve the desired distances in the formation. The stability of the system under the proposed control laws is analyzed based on Lyapunov stability theory of nonsmooth systems and Barbalat's lemma. Further, sufficient conditions are given to ensure the desired formation to be achieved. Finally, simulation results are given to support the analysis.
|
|
13:15-13:30, Paper WeA05.2 | Add to My Program |
Event-Based Formation Control of Networked Multi-Agent Systems Using Complex Laplacian under Directed Topology |
|
Ranjbar, Mojtaba | Tarbiat Modares University |
Beheshti, Mohammad T. H. | Univ. of Tarbiat Modares |
Bolouki, Sadegh | Tarbiat Modares University |
Keywords: Cooperative control, Agents-based systems
Abstract: In this paper, the formation control problem of multi-agent systems with directed communication graphs is considered. A novel distributed event-triggered approach, which involves complex Laplacian, is proposed to address the problem. The event condition depends on periodic samplings of agent states, which automatically eliminates the possibility of Zeno behavior. Then, it is shown that, under simple verifiable conditions, the proposed control strategy results in the desired formation of agents. Finally, the results are verified by numerical examples.
|
|
13:30-13:45, Paper WeA05.3 | Add to My Program |
A Discrete Time Model for Swarm Formation with Coordinates Coupling Matrix |
|
Fedele, Giuseppe | University of Calabria |
D'Alfonso, Luigi | GIPSTECH S.r.l |
Bono, Antonio | University of Calabria |
Keywords: Agents-based systems, Autonomous systems, Cooperative control
Abstract: In this paper a discrete time kinematic model for a swarm of agents moving in a multi-dimensional space is proposed. The model evolves using two main terms accounting for attraction of each swarm agent to a common goal position and inter-agents interactions. This last term can be tuned to obtain desired overall swarm behaviors using an interaction matrix that weighs the displacements among agents. The main model properties will be studied proving that agents aggregate in finite-time in known regions, ensuring swarm centroid convergence to a target position. The entire analysis will be performed assuming that each agent has a finite detection radius. The connections topology evolution will be studied too and it will be proved that under not restrictive assumptions on the detection range size and on the starting connections topology, the swarm reaches, in finite-time, a configuration where each agent can always detect all the other ones. Finally a laboratory experiment is developed to show the applicability of the method.
|
|
13:45-14:00, Paper WeA05.4 | Add to My Program |
A Data-Driven Method Based on Quadratic Programming for Distance-Based Formation Control of Euler-Lagrange Systems |
|
Arroyo Rodriguez, Veronica | Polytechnic University of Madrid |
Gamonal Fernandez, Manuela | Instituto De Ciencias Matematicas & Complutense University of Ma |
Moreno, Patricio | University of Buenos Aires |
Colombo, Leonardo Jesus | Universidad Autonoma De Madrid |
Keywords: Agents-based systems, Algebraic/geometric methods, Nonlinear systems identification
Abstract: Distance-based formation control of second-order autonomous agents can be seen as a physical system of particles linked by springs, whose evolution may be described by a Lagrangian function. In this letter we present a method based on quadratic programming to approximate the unknown Lagrangian function associated with the distance-based formation control of second-order agents from a set of sample data.
|
|
14:00-14:15, Paper WeA05.5 | Add to My Program |
Distance-Based Formation Control Over Directed Triangulated Laman Graphs in 2-D Space |
|
Babazadeh, Reza | Concordia University |
Selmic, Rastko | Concordia University |
Keywords: Agents-based systems, Cooperative control, Optimal control
Abstract: In this paper, the problem of distance-based formation control over directed graphs is studied. We consider the desired distance-based formation modeled as a directed graph where each edge is assigned to only one of its incident agents to preserve the desired distance. We propose a distributed control solution based on the state-dependent Riccati equation (SDRE) method that guarantees the formation's asymptotic stability. Moreover, we developed a procedure for preventing the formation's flip ambiguity where the signed area constraints are used. We also provide an extension that guarantees collision avoidance besides preventing the flip ambiguity. Simulation results are provided to illustrate the effectiveness of the proposed methods.
|
|
14:15-14:30, Paper WeA05.6 | Add to My Program |
Distributed Adaptive Event-Triggered Coordination with Discrete Updates of Controllers |
|
Cheng, Bin | Peking University |
Li, Zhongkui | Peking University |
Lv, Yuezu | Southeast University |
Keywords: Adaptive control, Cooperative control, Networked control systems
Abstract: This paper considers the distributed consensus control problem constrained by discrete communications among neighboring agents and discrete control updating. For undi- rected graphs, we progressively propose several event-triggered protocols composed of controllers and event-triggered condi- tions. For directed graphs, we include time-varying coupling gains plus monotonically increasing functions into both the two components of the protocol, in order to avoid the requirement of global information and provide extra freedom for design. It is shown that the presented event-triggered protocols can guarantee that all agents converge to a common state and Zeno behavior is excluded. Compared to the existing papers, the main contribution of this paper is that the protocols here are able to reduce the frequency of communication and control updating, and can be used in directed graphs. Numerical examples are provided to verify the effectiveness of the proposed event- triggered protocols.
|
|
14:30-14:45, Paper WeA05.7 | Add to My Program |
Formation Control of Multi-Agent Systems with Generalized Relative Measurements |
|
Sakurama, Kazunori | Kyoto University |
Keywords: Cooperative control, Agents-based systems, Distributed control
Abstract: This paper provides a unified solution to a formation control problem with generalized relative measurements. First, we generalize an expression of relative measurements over local frames by introducing a wide class of transformation of the frames. The transformation is provided in an arbitrary subset of a semidirect product, which enables us to describe various types of practical measurements including uncertainty. Second, we derive a strict condition on the agent configuration obtained under the class of relative measurements. Especially, the configuration is described with ambiguity caused by the measurement. Here, we design a distributed controller consisting of clique-wise and agent-wise functions, which respectively work to coordinate the agents and to remove redundant ambiguity from the configuration. As a result, the best performance is guaranteed in terms of providing the smallest ambiguity under appropriate coordination. Finally, the effectiveness of the designed controller is demonstrated through numerical examples.
|
|
14:45-15:00, Paper WeA05.8 | Add to My Program |
On Global Convergence of Area-Constrained Formations of Hierarchical Multi-Agent Systems |
|
Sugie, Toshiharu | Osaka University |
Tong, Fei | Kyoto University |
Anderson, Brian D.O. | Australian National University |
Sun, Zhiyong | Eindhoven University of Technology (TU/e) |
Keywords: Networked control systems, Agents-based systems, Cooperative control
Abstract: This paper is concerned with a formation shaping problem for point agents in a two-dimensional space, where control avoids the possibility of reflection ambiguities. One solution for this type of problems was given first by considering a potential function which consists of both the distance error and the signed area terms for three or four agents. Then, by exploiting a hierarchical control strategy with such potential functions, the method was extended to any number of agents recently. However, a specific gain on the signed area term must be employed there, and it does not guarantee the global convergence. To overcome this issue, this paper provides a necessary and sufficient condition for the global convergence, subject to the constraint that the desired formation consists of isosceles triangles only. This clarifies the admissible range of the gain on the signed area for this case. In addition, as for formations consisting of general triangles, it is shown when high gain on the signed area is admissible for global convergence.
|
|
WeA06 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Platooning and Intersections |
|
|
Chair: Di Benedetto, Maria Domenica | University of L'Aquila |
Co-Chair: Evangelou, Simos Andreas | Imperial College |
|
13:00-13:15, Paper WeA06.1 | Add to My Program |
On the Utilization of Macroscopic Information for String Stability of a Vehicular Platoon |
|
Mirabilio, Marco | University of L'Aquila |
Iovine, Alessio | UC Berkeley |
De Santis, Elena | University of L'Aquila |
Di Benedetto, Maria Domenica | University of L'Aquila |
Pola, Giordano | University of L'Aquila |
Keywords: Traffic control, Autonomous vehicles, Stability of nonlinear systems
Abstract: The use of macroscopic information for the controlof a vehicular platoon composed of autonomous vehicles is investigated. A mesoscopic control law is provided, and String Stability is proved by Lyapunov functions and Input-to-State Stability (ISS) concepts. Simulations are implemented in order to validate the controller and to show the efficacy of the proposed approach for mitigating traffic oscillations.
|
|
13:15-13:30, Paper WeA06.2 | Add to My Program |
Traffic Control Via Platoons of Intelligent Vehicles for Saving Fuel Consumption in Freeway Systems |
|
Piacentini, Giulia | University of Pavia |
Goatin, Paola | Inria |
Ferrara, Antonella | University of Pavia |
Keywords: Traffic control, Autonomous vehicles
Abstract: In this paper a coupled PDE-ODE model describing the interaction between the bulk traffic flow and a platoon of connected vehicles is adopted to develop a control action aiming at reducing the fuel consumption of the overall traffic flow. The platoon is modeled as a capacity restriction acting on the surrounding traffic. The trajectory of the initial and final points of the platoon are optimized by means of a model predictive control strategy, acting on the speeds of the front-end and back-end of the platoon, thus resulting in controlling both the speed and the length of the platoon. The approach is assessed in simulations.
|
|
13:30-13:45, Paper WeA06.3 | Add to My Program |
Reducing Time Headway in Platoons under the MPF Topology When Using Sensors and Wireless Communications |
|
Abolfazli, Elham | Aalto University |
Besselink, Bart | University of Groningen |
Charalambous, Themistoklis | Aalto University |
Keywords: Networked control systems, Autonomous vehicles, Delay systems
Abstract: In platoons under the multiple-predecessor following (MPF) topology, the exchange of information is usually assumed to be via Vehicle-to-Vehicle (V2V) communication links. Communication delays, however, deteriorate internal and string stability and, as a consequence, to guarantee stability of the platoon the time headway between vehicles should be increased. Autonomous vehicles are nowadays equipped with a multitude of sensors, the three primary being radar, lidar and camera. The combination of such sensors allows a vehicle, among others, to detect the distance and speed of nearby objects. In this paper, we incorporate the available sensors of vehicles to anticipate the communication delays vehicles experience with their preceding vehicles, thus improving the stability conditions and subsequently reducing the time headway. More specifically, by incorporating sensors we alleviate the communication delays between neighboring vehicles. We demonstrate that emph{(i)} the system is internally stable irrespective of the size of the communication delays, and emph{(ii)} the time headway can be reduced, by deriving a sufficient condition which provides a lower bound on the time headway that guarantees string stability. With this new lower bound on the time headway, platoons do not need to massively increase the time headway in order to compensate for the effects of communications delays. Simulations demonstrate the effectiveness of the proposed lower bound.
|
|
13:45-14:00, Paper WeA06.4 | Add to My Program |
Optimal Motion Control for Connected and Automated Electric Vehicles at Signal-Free Intersections |
|
Pan, Xiao | Imperial College London |
Chen, Boli | University College London |
Evangelou, Simos Andreas | Imperial College |
Timotheou, Stelios | University of Cyprus |
Keywords: Automotive control, Automotive systems, Autonomous vehicles
Abstract: Traffic congestion is one of the major issues for urban traffic networks. The connected and autonomous vehicles (CAV) is an emerging technology that has the potential to address this issue by improving safety, efficiency, and capacity of the transportation system. In this paper, the problem of optimal trajectory planning of battery-electric CAVs in the context of cooperative crossing of an unsignalized intersection is addressed. An optimization-based centralized intersection controller is proposed to find the optimal velocity trajectory of each vehicle so as to minimize electric energy consumption and traffic throughput. Solving the underlying optimization problem for a group of CAVs is not straightforward because of the nonlinear and nonconvex dynamics, especially when the powertrain model is explicitly modeled. In order to ensure a rapid solution search and a unique global optimum, the optimal control problem (OCP) is reformulated via convex modeling techniques. Several simulation case studies are presented to show the effectiveness of the proposed approach and the trade-off between energy consumption and traffic throughput is illustrated.
|
|
14:00-14:15, Paper WeA06.5 | Add to My Program |
String Stable Integral Control of Vehicle Platoons with Actuator Dynamics and Disturbances |
|
Froes Silva, Guilherme | Queensland University of Technology |
Donaire, Alejandro | The University of Newcastle |
McFadyen, Aaron | Queensland University of Technology |
Ford, Jason John | Queensland Univeristy of Technology |
Keywords: Agents-based systems, Automotive control, Decentralized control
Abstract: This paper presents the design of an integral controller for vehicle platoons with actuator dynamics. The proposed controller ensures string stability with disturbances and simultaneously compensates for constant disturbances through integral action. Sufficient conditions for string stability are satisfied by the use of a suitable state transformation. The proposed controller guarantees disturbance string stability for a prescribed time constant of the actuator dynamics, and we show through simulation that platoons with faster dynamics are also made disturbance string stable.
|
|
14:15-14:30, Paper WeA06.6 | Add to My Program |
Coordinated Lateral and Longitudinal Vehicle-Following Control of Connected and Automated Vehicles Considering Nonlinear Dynamics |
|
Wang, Yulei | Tongji University |
Bian, Ning | Dongfeng Technical Center, Dongfeng Motor Corporation |
Zhang, Lin | Tongji University |
Chen, Hong | Jilin University, Campus NanLing |
Keywords: Autonomous vehicles, Automotive control, Feedback linearization
Abstract: In this paper, the problem of coordinated lateral and longitudinal vehicle-following control for connected and automated vehicles (CAVs) is investigated by considering nonlinear dynamics. Based on a nonlinear model of longitudinal and lateral vehicle dynamics and vehicle-to-vehicle (V2V) communication, we propose a triple-step nonlinear control law for a CAV to follow the trajectory of preceding vehicle. The stability regarding the time gap spacing and velocity errors is discussed via Lyapunov theory. Simulation results illustrate the effectiveness of the proposed control for curved lanes.
|
|
14:30-14:45, Paper WeA06.7 | Add to My Program |
Distributed Nonlinear Model Predictive Control and Metric Learning for Heterogeneous Vehicle Platooning with Cut-in/Cut-Out Maneuvers |
|
Basiri, Mohammad Hossein | University of Waterloo |
Ghojogh, Benyamin | University of Waterloo |
Azad, Nasser Lashgarian | University of Waterloo |
Fischmeister, Sebastian | University of Waterloo |
Karray, Fakhri | University of Waterloo |
Crowley, Mark | University of Waterloo |
Keywords: Automotive control, Predictive control for nonlinear systems, Optimization
Abstract: Vehicle platooning has been shown to be quite fruitful in the transportation industry to enhance fuel economy, road throughput, and driving comfort. Model Predictive Control (MPC) is widely used in literature for platoon control to achieve certain objectives, such as safely reducing the distance among consecutive vehicles while following the leader vehicle. In this paper, we propose a Distributed Nonlinear MPC (DNMPC), based upon an existing approach, to control a heterogeneous dynamic platoon with unidirectional topologies, handling possible cut-in/cut-out maneuvers. The introduced method guarantees a collision-free driving experience while tracking the desired speed profile and maintaining a safe desired gap among the vehicles. The time of convergence in the dynamic platooning is derived based on the time of cut-in and/or cut-out maneuvers. In addition, we analyze the level of improvement of driving comfort, fuel economy, and absolute and relative convergence of the method by using distributed metric learning and distributed optimization with Alternating Direction Method of Multipliers (ADMM). Simulation results on a dynamic platoon with cut-in and cut-out maneuvers and with different unidirectional topologies show the effectiveness of the introduced method.
|
|
14:45-15:00, Paper WeA06.8 | Add to My Program |
Platoon Control Based on Predecessor and Delayed Leader Information Via Minimized Headway Times |
|
Koroglu, Hakan | University of Twente |
Falcone, Paolo | Chalmers University of Technology |
Keywords: Autonomous vehicles, Delay systems, Cooperative control
Abstract: The platoon control problem is considered under a recently proposed leader and predecessor following scheme with a third-order vehicle dynamics. The scheme is investigated with a simple PD-type controller in the face of delayed leader information. It is shown that string stability can be achieved only when the headway parameter is chosen larger than the communication delay. The use of larger headway parameters is also shown to be necessary to avoid the amplification of the acceleration signals backward along the platoon. Guidelines are given for the choice of controller gains and the headway parameter in a way to achieve desirable platoon behavior.
|
|
WeA07 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Estimation |
|
|
Chair: Ushirobira, Rosane | Inria |
Co-Chair: Asiri, Sharefa | King Abdulaziz University (KAU) |
|
13:00-13:15, Paper WeA07.1 | Add to My Program |
Selection of Modulating Functions' Design Parameters for Estimation Problems |
|
Asiri, Sharefa | King Abdulaziz University (KAU) |
Liu, Da-Yan | INSA Centre Val De Loire, Campus De Bourges |
Laleg-Kirati, Taous-Meriem | King Abdullah University of Science and Technology (KAUST) |
Keywords: Estimation, Algebraic/geometric methods, Numerical algorithms
Abstract: In this paper, an effective algorithm for selecting the design parameters in the modulating functions-based method (MFBM) is introduced. The appropriate selection of these parameters improves the performance of the estimation approach and reduces the computational cost. The effectiveness and robustness of the algorithm are shown through numerical simulations in a noisy environment.
|
|
13:15-13:30, Paper WeA07.2 | Add to My Program |
Iterative H∞-Norm Estimation Using Cyclic-Prefixed Signals |
|
Müller, Matias I. | KTH Royal Institute of Technology |
Rojas, Cristian R. | KTH Royal Institute of Technology |
Keywords: Estimation, Statistical learning, Computational methods
Abstract: The problem of estimating the largest gain of an unknown linear and time-invariant filter is studied, known as the H∞-norm estimation problem. The approach presented in this paper is iterative and corresponds to the combination of two state-of-the-art methods: Power Iterations and Weighted Thompson Sampling. The combination is done by means of a well-known technique in communications known as a cyclic prefix, in which the last points of a signal are prepended to it. This allows to take considerably more exact measurements of the frequency response of the system at a set of equispaced frequencies. The discussion is complemented with a simulation study, showing that the proposed algorithm has an increased speed of convergence to the quantity of interest.
|
|
13:30-13:45, Paper WeA07.3 | Add to My Program |
Estimation of Heteroscedastic Multilinear Systems |
|
Wang, Mingliang | KTH Royal Institute of Technology |
Jacobsen, Elling | Royal Inst of Tech - KTH |
Chotteau, Veronique | KTH Royal Institute of Technology |
Hjalmarsson, Håkan | KTH Royal Inst. of Tech |
Keywords: Estimation, Nonlinear systems identification, Statistical learning
Abstract: In this paper, we propose an estimation method for heteroscedastic multilinear systems. The system consists of a multilinear map of latent functions and an input-dependent noise process. We assume Gaussian-process priors on the unknowns to embed non-parametric models. This leads to a hierarchical model called heteroscedastic multilinear Gaussian processes which do not admit closed-form posterior and predictive distributions. The model is treated in an empirical Bayes fashion where the hyperparameters are estimated by maximizing the marginal distribution. To achieve that, we use a Monte Carlo expectation maximization method based on a Gibbs sampling algorithm. The predictive inference is also introduced where the mean is used as an approximation of the unknown functions. The performance of the proposed method is illustrated in a simulation study.
|
|
13:45-14:00, Paper WeA07.4 | Add to My Program |
Towards State Estimation of Persidskii Systems |
|
Mei, Wenjie | Inria |
Efimov, Denis | Inria |
Ushirobira, Rosane | Inria |
Keywords: Estimation, Observers for nonlinear systems
Abstract: A state estimation scheme for a class of Persidskii systems is introduced in this paper. Two distinct sets of conditions are formulated for robust stability and convergence of the state estimation error using the theories of input-to-output stability (IOS) and input-to-state stability (ISS). These conditions for the nonlinear error dynamics are established in the form of linear matrix inequalities. Two numerical examples are presented to illustrate the effectiveness of the proposed results.
|
|
14:00-14:15, Paper WeA07.5 | Add to My Program |
Differentially Private Parameter Estimation: Optimal Noise Insertion and Data Owner Selection |
|
Cao, Xuanyu | University of Illinois at Urbana-Champaign |
Basar, Tamer | Univ of Illinois, Urbana-Champaign |
Keywords: Estimation, Control Systems Privacy, Optimization
Abstract: In this paper, we study differentially private parameter estimation in which a data acquisitor (DA) accesses data (or signals) from multiple privacy-aware data owners (DOs) to estimate some random parameters. To ensure differential privacy, the DOs add Laplace noises to their private signals and only reveal the noisy signals to the DA. Our goal is to add optimal amount of noises (measured by their inverse variances) so that the mean squared error (MSE) of the DA's estimate is minimized while differential privacy is satisfied. In the general case, the optimal private estimation can be formulated as a semidefinite program (SDP), which can be readily solved by off-the-shelf optimization methods. In the special case where different DOs have uncorrelated signals, the optimization problem is decomposed across DOs and can be solved very efficiently in almost closed-form. We observe that, in the optimal solution, the DOs should add more noises to the signal entries that are less useful for estimation. Further, when the DA has DO selection constraint (e.g., due to limited budget), a relaxed SDP is put forth to compute a suboptimal solution with superb empirical performance. Finally, several numerical examples are presented.
|
|
14:15-14:30, Paper WeA07.6 | Add to My Program |
Near-Optimal MAP Estimation for Markov Jump Linear Systems Using Relaxed Dynamic Programming |
|
Andrien, Alex Rudolf Petrus | Eindhoven University of Technology |
Antunes, Duarte | Eindhoven University of Technology, the Netherlands |
Keywords: Estimation, Switched systems, Markov processes
Abstract: Computing the optimal joint maximum a posteriori probability (JMAP) estimate of the state and mode of a Markov jump linear system (MJLS) is known to be a computationally intractable problem. This letter provides a novel approximate method for such a problem that guarantees to be within a pre-specified bound of the optimal estimate. The proposed method builds upon relaxed dynamic programming. Through numerical examples, we show that this method can lead to better estimates with less computations than previous suboptimal methods proposed in the literature.
|
|
14:30-14:45, Paper WeA07.7 | Add to My Program |
Robust Moving Horizon State Estimation for Uncertain Linear Systems Using Linear Matrix Inequalities |
|
Georgiou, Anastasis | Imperial College London |
Tahir, Furqan | Perceptive Engineering Limited |
Evangelou, Simos Andreas | Imperial College |
Jaimoukha, Imad M. | Imperial College London |
Keywords: Estimation, Uncertain systems, LMIs
Abstract: This paper investigates the problem of state estimation for linear-time-invariant (LTI) discrete-time systems subject to structured feedback uncertainty and bounded disturbances. The proposed Robust Moving Horizon Estimation (RMHE) scheme computes at each sample time tight bounds on the uncertain states by solving a linear matrix inequality (LMI) optimization problem based on the available noisy input and output data. In comparison with conventional approaches that use offline calculation for the estimation, the suggested scheme achieves an acceptable level of performance with reduced conservativeness, while the online computational time is maintained relatively low. The effectiveness of the proposed estimation method is assessed via a numerical example.
|
|
14:45-15:00, Paper WeA07.8 | Add to My Program |
A Compact CRB for the Single Source Conditional Signal Model with Application to Delay-Doppler-Phase Estimation of Band-Limited Signals |
|
Ortega, Lorenzo | Telecommunications for Space and Aeronautics Lab (TéSA) |
Medina, Daniel | German Aerospace Center (DLR) |
Vilà-Valls, Jordi | University of Toulouse-ISAE |
Vincent, Francois | University of Toulouse-ISAE |
Chaumette, Eric | University of Toulouse-ISAE |
Keywords: Estimation
Abstract: The derivation of tight estimation lower bounds is a key player to design and assess the performance of new estimators. In this contribution, we derive a new compact Cramér-Rao bound (CRB) for the conditional signal model, where the deterministic parameter's vector includes a real positive amplitude and the signal phase. Then, such CRB is particularized to the delay, Doppler, phase and amplitude estimation with band-limited (narrowband) signals, where transmitter and receiver are in relative uniform radial movement. The latter expression is especially easy to use because it only depends on the signal samples. We provide illustrative results for a representative Global Navigation Satellite System positioning example.
|
|
WeA08 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Kalman Filtering |
|
|
Chair: Zorzi, Mattia | University of Padova |
Co-Chair: Murata, Masaya | Japan Aerospace Exploration Agency |
|
13:00-13:15, Paper WeA08.1 | Add to My Program |
An Extended Kalman Filter for Data-Enabled Predictive Control |
|
Alpago, Daniele | University of Padova |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Lygeros, John | ETH Zurich |
Keywords: Predictive control for linear systems, Stochastic systems, Kalman filtering
Abstract: The literature dealing with data-driven analysis and control problems has significantly grown in the recent years. Most of the literature deals with linear time-invariant (LTI) systems in which the uncertainty (if any) is assumed to be deterministic and bounded; relatively little attention has been devoted to stochastic LTI systems. As a first step in this direction, we propose to equip the recently introduced Data-enabled Predictive Control (DeePC) algorithm with a data-based Extended Kalman Filter (EKF) to make use of additional available input-output data for reducing the effect of noise, without increasing the computational load of the optimization procedure.
|
|
13:15-13:30, Paper WeA08.2 | Add to My Program |
Extended, Unscented Kalman, and Sigma Point Multiple Distribution Estimation Filters for Nonlinear Discrete State-Space Models |
|
Murata, Masaya | Japan Aerospace Exploration Agency |
Kawano, Isao | Japan Aerospace Exploration Agency |
Inoue, Koichi | Japan Aerospace Exploration Agency |
Keywords: Filtering, Estimation, Kalman filtering
Abstract: The extended and unscented Kalman multiple distribution estimation filters (EKMDEF/UKMDEF) were recently proposed for nonlinear continuous-discrete state-space models and their superior filtering accuracy was shown in the simulation of satellite reentry. The EKMDEF/UKMDEF is based on the multiple distribution estimation (MDE) for the filtered state probability density function (PDF) and this paper provides its alternate derivation with a comparison of the Gaussian sum filter (GSF). This result sheds light on the relationship between the EKMDEF/UKMDEF and the GSF and a new filter that is more stable than the EKMDEF/UKMDEF can be designed. The performance of the proposed filter is examined for nonlinear discrete state-space models using benchmark simulation problems and is compared with those of the representative filters including particle filters (PFs).
|
|
13:30-13:45, Paper WeA08.3 | Add to My Program |
Robust Tracking under Measurement Model Mismatch Via Linearly Constrained Extended Kalman Filtering |
|
Ortega, Lorenzo | Telecommunications for Space and Aeronautics Lab (TéSA) |
Vilà-Valls, Jordi | University of Toulouse-ISAE |
Chaumette, Eric | University of Toulouse-ISAE |
Pages, Gael | University of Toulouse-ISAE |
Vincent, Francois | University of Toulouse-ISAE |
Keywords: Kalman filtering, Filtering
Abstract: Standard state estimation techniques, ranging from the linear Kalman filter to nonlinear sigma-point or particle filters, assume a perfectly known system model, that is, process and measurement functions and system noise statistics (both the distribution and its parameters). This is a strong assumption which may not hold in practice, reason why several approaches have been proposed for robust filtering. In the context of linear filtering, a solution to cope with a possible system matrices mismatch is to use linear constraints. In this contribution we further explore the extension and use of recent results on linearly constrained Kalman filtering (LCKF) for robust tracking/localization under measurement model mismatch. We first derive the natural extension of the LCKF to nonlinear systems, and its use to mitigate parametric modelling errors in the nonlinear measurement function. A tracking problem where a set of sensors at possibly mismatched (unknown to a certain extent) positions track a moving object from time of arrival measurements is used to support the discussion.
|
|
13:45-14:00, Paper WeA08.4 | Add to My Program |
Low-Rank Kalman Filtering under Model Uncertainty |
|
Yi, Shenglun | Beijing Institute of Technology |
Zorzi, Mattia | University of Padova |
Keywords: Kalman filtering, Filtering, Uncertain systems
Abstract: We consider a robust filtering problem where the nominal state space model is not reachable and different from the actual one. We propose a robust Kalman filter which solves a dynamic game: one player selects the least-favorable model in a given ambiguity set, while the other player designs the optimum filter for the least-favorable model. It turns out that the robust filter is governed by a low-rank risk sensitive-like Riccati equation. Finally, simulation results show the effectiveness of the proposed filter.
|
|
14:00-14:15, Paper WeA08.5 | Add to My Program |
Robust Kalman Filtering with Probabilistic Uncertainty in System Parameters |
|
Kim, Sunsoo | Texas A&M University |
Deshpande, Vedang M. | Texas A&M University |
Bhattacharya, Raktim | Texas A&M |
Keywords: Kalman filtering, Uncertain systems, Estimation
Abstract: In this paper, we propose a robust Kalman filtering framework for systems with probabilistic uncertainty in system parameters. We consider two cases, namely discrete time systems, and continuous time systems with discrete measurements. The uncertainty, characterized by mean and variance of the states, is propagated using conditional expectations and polynomial chaos expansion framework. The results obtained using the proposed filter are compared with existing robust filters in the literature. The proposed filter demonstrates better performance in terms of estimation error and rate of convergence.
|
|
14:15-14:30, Paper WeA08.6 | Add to My Program |
Exploiting Linear Substructure in Linear Regression Kalman Filters |
|
Greiff, Marcus Carl | Lund University |
Robertsson, Anders | LTH, Lund University |
Berntorp, Karl | Mitsubishi Electric Research Labs |
Keywords: Kalman filtering, Estimation, Computational methods
Abstract: We exploit knowledge of linear substructure in the linear-regression Kalman filters (LRKFs) to simplify the problem of moment matching. The theoretical results yield quantifiable and significant computational speedups at no cost of estimation accuracy, assuming partially linear estimation models. The results apply to any symmetrical LRKF, and reductions in computational complexity are stated as a function of the cubature rule, the number of linear and nonlinear states in the estimation model respectively. The implications for the filtering problem are illustrated by several numerical examples.
|
|
14:30-14:45, Paper WeA08.7 | Add to My Program |
A Hybrid, Coupled Approach to the Continuous-Discrete Kalman Filter |
|
Patel, Zubeida | University of Cape Town |
Boje, Edward | University of Cape Town |
Keywords: Kalman filtering
Abstract: In this paper we present a novel approach to continuous-discrete (CD) Kalman filtering. Unlike the EKF or other hybrid UKF-EKF filters, the novel approach does not require direct calculation of system Jacobians and instead uses unscented transforms (UTs) to extract a pair of matrices, each made up of a linear combination of derivatives (with respect to the state), that are used in its place. More specifically, they are used (1) in parts of the filtering process, and (2) in the linearly implicit numerical integration scheme of the filter’s state propagation stage. Extracting these matrices from the filter’s UTs and using them in both the filtering process and model simulation, or what we refer to as coupling the filter to model simulation, avoids having to calculate further function evaluations for standard implicit methods, making the process of state estimation for stiff systems more efficient. Another benefit of the proposed approach is that it offers UKF accuracy levels but improved numerical stability over the UKF. This is because it uses symmetric and positive-definite(PD) representations of the state covariance propagation and measurement update equations.
|
|
14:45-15:00, Paper WeA08.8 | Add to My Program |
Sequential Coordinates Conversion and Decoupled Linear Tracking for Airborne AESA Radars |
|
Jung, Bo-Young | Handong Global University |
Ham, Dahye | Handong Global University |
Ra, Won-Sang | Handong Global University |
Keywords: Kalman filtering, Filtering, Aerospace
Abstract: This paper deals with the problem of tracking a maneuvering target using an airborne electronically scanned array radar. Conventional target-tracking filters produce biased target state estimates because the angular motion of a radar platform complicates the cross-correlations between the converted measurements (CMs). As a practical alternative to this issue, the target tracking filter is designed in the predicted line-of-sight Cartesian coordinate system (PLCCS). Provided that the {it a priori} state estimation error is small enough, the statistics of the CMs in PLCCS can be approximated accurately despite using the first and second moments of the radar measurement noises only. The resulting debiasing term leads to a satisfactory target tracking performance of the proposed filter. Furthermore, by exploiting the diagonality of the noise covariance matrix derived in the PLCCS, the proposed tracking filter is decoupled into each axis to induce a suitable real-time implementation. Through simulations, the enhanced performance of the suggested method is demonstrated.
|
|
WeA09 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Optimization Algorithms I |
|
|
Chair: Prandini, Maria | Politecnico Di Milano |
Co-Chair: Tsumura, Koji | The University of Tokyo |
|
13:00-13:15, Paper WeA09.1 | Add to My Program |
Accelerated Multi-Agent Optimization Method Over Stochastic Networks |
|
Ananduta, Wicak | TU Delft |
Ocampo-Martinez, Carlos | Universitat Politècnica De Catalunya (UPC) |
Nedich, Angelia | Arizona State University |
Keywords: Optimization algorithms, Agents-based systems, Control over communications
Abstract: We propose a distributed method to solve a multi-agent optimization problem with strongly convex cost function and equality coupling constraints. The method is based on Nesterov's accelerated gradient approach and works over stochastically time-varying communication networks. We consider the standard assumptions of Nesterov's method and show that the sequence of the expected dual values converge toward the optimal value with the rate of O(1/k2). Furthermore, we provide a simulation study of solving an optimal power flow problem with a well-known benchmark case.
|
|
13:15-13:30, Paper WeA09.2 | Add to My Program |
A Decentralized Algorithm for Large Scale Min-Max Problems |
|
Mukherjee, Soham | Indian Institute of Technology Kharagpur |
Chakraborty, Mrityunjoy | IIT Kharagpur |
Keywords: Optimization algorithms, Control of networks, Decentralized control
Abstract: We consider a distributed saddle point problem, in which a collection of nodes collaboratively optimize a sum of local component functions through local computations and information exchange with neighbouring nodes. To solve this problem, we propose a decentralized algorithm based on the Extragradient method, whose centralized implementation has been shown to achieve good performance on a wide range of minmax problems. We show that our proposed method achieves linear convergence under suitable assumptions and explicitly characterize how the convergence rate depends on the condition number and the spectral gap of the communication graph. We also present numerical simulations that corroborate our theoretical results.
|
|
13:30-13:45, Paper WeA09.3 | Add to My Program |
Private and Hot-Pluggable Distributed Averaging |
|
Donato Ridgley, Israel | Northwestern University |
Freeman, Randy | Northwestern Univ |
Lynch, Kevin M. | Northwestern University |
Keywords: Optimization algorithms, Distributed control, Network analysis and control
Abstract: Some distributed optimization applications require privacy, meaning that the values of certain parameters local to a node should not be revealed to other nodes in the network during the joint optimization process. A special case is the problem of private distributed averaging, in which a network of nodes computes the global average of individual node reference values in a distributed manner while preserving the privacy of each reference. We present simple iterative methods that guarantee accuracy (i.e., the exact asymptotic computation of the global average) and privacy (i.e., no node can estimate another node's reference value). To achieve this, we require that the digraph modeling the communication between nodes satisfy certain topological conditions. Our method is hot-pluggable (meaning no reinitialization of the averaging process is required when the network changes or a node enters or leaves, when there is a communication or computation fault, or when a node's reference value changes); it does not require an initial scrambling phase; it does not inject noise or other masking signals into the distributed computation; it does not require random switching of edge weights; and it does not rely on homomorphic encryption.
|
|
13:45-14:00, Paper WeA09.4 | Add to My Program |
Distributed Maximization of Submodular and Approximately Submodular Functions |
|
Ye, Lintao | Purdue University |
Sundaram, Shreyas | Purdue University |
Keywords: Optimization algorithms, Distributed control, Control of networks
Abstract: We study the problem of maximizing a submodular function, subject to a cardinality constraint, with a set of agents communicating over a connected graph. We propose a distributed greedy algorithm that allows all the agents to converge to a near-optimal solution to the global maximization problem using only local information and communication with neighbors in the graph. The near-optimal solution approaches the (1-1/e) approximation of the optimal solution to the global maximization problem with an additive factor that depends on the number of communication steps in the algorithm. We then analyze convergence guarantees of the proposed algorithm. This analysis reveals a tradeoff between the number of communication steps and the performance of the algorithm. Finally, we extend our analysis to nonsubmodular settings, using the notion of approximate submodularity.
|
|
14:00-14:15, Paper WeA09.5 | Add to My Program |
Preconditioned Distributed Trajectory Optimization Algorithm Using Differential Dynamic Programming |
|
Wang, Yunzhuo | The University of Tokyo |
Tsumura, Koji | The University of Tokyo |
Keywords: Optimization algorithms, Distributed control, Large-scale systems
Abstract: Trajectory optimization and model predictive control is demanding but challenging for distributed and time-critical system that consists of a large number of dynamic subsystems with sparse physical interactions. The classic dual gradient ascent method suffers from the slow convergence when the system is not well-scaled. This paper proposes a Jacobi-preconditioned dual gradient ascent method that fully exploits the idea behind Differential Dynamic Programming to compute the ascent direction in a distributed and recurrent manner at a linear time-cost with respect to the length of the time horizon. Moreover, we propose a method to compute a fixed step size for the preconditioned dual gradient ascent step that can guarantee global convergence property under certain assumptions. A numerical experiment shows that our proposed algorithm improves performance and robustness to ill-scaled problems over the ordinary non-preconditioned dual ascent algorithm. This algorithm has great potential applications in power grid, chemical plants, and cooperative systems of drones.
|
|
14:15-14:30, Paper WeA09.6 | Add to My Program |
D-DistADMM: A O(1/k) Distributed ADMM for Distributed Optimization in Directed Graph Topologies |
|
Khatana, Vivek | University of Minnesota, Twin-Cities |
Salapaka, Murti V. | University of Minnesota, Minneapolis |
Keywords: Optimization algorithms, Machine learning, Distributed control
Abstract: We focus on the problem of minimizing a finite sum f(x) = sum_{i=1}^n f_i(x) of functions of n functions f_i, where f_i are convex and available only locally to an agent i. The n agents are connected in a directed network mathcal{G}(mathbf{V},mathbf{E}), where each agent i can only communicate with agents in its neighborhood determined by mathcal{G}(V,E). In this article, we present the Directed-Distributed Alternating Direction Method of Multiplier (D-DistADMM) Algorithm, which is an Alternating Direction Method of Multiplier (ADMM) based scheme and utilizes a finite-time ``approximate'' consensus method to solve the above optimization problem distributively. At each iteration of the proposed scheme the agents solve their local optimization problem and utilize an approximate consensus protocol to update a local estimate of the global optimization variable. We show that for convex and not-necessarily differentiable objective functions f_i's the proposed textit{D-DistADMM} method converges at a rate O(1/k). We further demonstrate the applicability of our algorithm by solving a distributed least-squares problem.
|
|
14:30-14:45, Paper WeA09.7 | Add to My Program |
A Distributed Dual Proximal Minimization Algorithm for Constraint-Coupled Optimization Problems |
|
Falsone, Alessandro | Politecnico Di Milano |
Prandini, Maria | Politecnico Di Milano |
Keywords: Optimization algorithms, Distributed control, Agents-based systems
Abstract: We address constraint-coupled optimization for a system composed of multiple cooperative agents communicating over a time-varying network. We propose a distributed proximal minimization algorithm that is guaranteed to converge to an optimal solution of the optimization problem, under suitable convexity and connectivity assumptions. The performance of the introduced algorithm is shown on a numerical example of a charging scheduling problem for a fleet of plug-in electric vehicles.
|
|
14:45-15:00, Paper WeA09.8 | Add to My Program |
Overlapping Schwarz Decomposition for Constrained Quadratic Programs |
|
Shin, Sungho | University of Wisconsin-Madison |
Anitescu, Mihai | Argonne National Laboratory |
Zavala, Victor M. | University of Wisconsin-Madison |
Keywords: Optimization algorithms, Large-scale systems, Control of networks
Abstract: We present an overlapping Schwarz decomposition algorithm for constrained quadratic programs (QPs). Schwarz algorithms have been traditionally used to solve linear algebra systems arising from partial differential equations, but we have recently shown that they are also effective at solving structured optimization problems. In the proposed scheme, we consider QPs whose algebraic structure can be represented by graphs. The graph domain is partitioned into overlapping subdomains (yielding a set of coupled subproblems), solutions for the subproblems are computed in parallel, and convergence is enforced by updating primal-dual information in the overlapping regions. We show that convergence is guaranteed if the overlap is sufficiently large and that the convergence rate improves exponentially with the size of the overlap. Convergence results rely on a key property of graph-structured problems that is known as exponential decay of sensitivity. Here, we establish conditions under which this property holds for constrained QPs (as those found in network optimization and optimal control), thus extending existing work that addresses unconstrained QPs. The numerical behavior of the Schwarz scheme is demonstrated by using a DC optimal power flow problem defined over a network with 9,241 nodes.
|
|
WeA10 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Stochastic Optimal Control II |
|
|
Chair: Moon, Jun | Hanyang University |
Co-Chair: Karlsson, Johan | KTH Royal Institute of Technology |
|
13:00-13:15, Paper WeA10.1 | Add to My Program |
A General Framework for Bounding Approximate Dynamic Programming Schemes |
|
Liu, Yajing | National Renewable Energy Laboratory |
Chong, Edwin K. P. | Colorado State University |
Pezeshki, Ali | Colorado State University |
Zhang, Zhenliang | Intel Corp |
Keywords: Stochastic optimal control, Optimization, Discrete event systems
Abstract: For years, there has been interest in approximation methods for solving dynamic programming problems, because of the inherent complexity in computing optimal solutions characterized by Bellman’s principle of optimality. A wide range of approximate dynamic programming (ADP) methods now exists. It is of great interest to guarantee that the performance of an ADP scheme be at least some known fraction, say , of optimal. This paper introduces a general approach to bounding the performance of ADP methods, in this sense, in the stochastic setting. The approach is based on new results for bounding greedy solutions in string optimization problems, where one has to choose a string (ordered set) of actions to maximize an objective function. This bounding technique is inspired by submodularity theory, but submodularity is not required for establishing bounds. Instead, the bounding is based on quantifying certain notions of curvature of string functions; the smaller the curvatures the better the bound. The key insight is that any ADP scheme is a greedy scheme for some surrogate string objective function that coincides in its optimal solution and value with those of the original optimal control problem. The ADP scheme then yields to the bounding technique mentioned above, and the curvatures of the surrogate objective determine the value of the bound. The surrogate and its curvatures depend on the specific ADP.
|
|
13:15-13:30, Paper WeA10.2 | Add to My Program |
Robust Dual Control of Batch Processes with Parametric Uncertainty Using Proximal Policy Optimization |
|
Byun, Ha-Eun | Korea Advanced Institute of Science and Technology (KAIST) |
Kim, Boeun | University of Wisconsin-Madison |
Lee, Jay H. | Korea Advanced Institute of Science and Technology |
Keywords: Stochastic optimal control, Robust adaptive control, Machine learning
Abstract: This study presents a robust dual control method for batch processes under parametric uncertainty. Proximal policy optimization (PPO), a policy gradient reinforcement learning algorithm, is employed to construct an implicit dual controller in a computationally amenable way. The proposed control method can robustly and actively cope with uncertainties seen in a repeated sequence of batch operations by incorporating a penalty term for constraint violation into the reward function and by considering the effect of control inputs on future uncertainty. An application to a bioethanol fermentation process is discussed to demonstrate the effectiveness of the proposed control strategy. It is shown that the proposed robust dual controller has an active learning feature such that the overall performance improves compared to a certainty-equivalence based approach.
|
|
13:30-13:45, Paper WeA10.3 | Add to My Program |
Optimal Steering of Ensembles with Origin-Destination Constraints |
|
Haasler, Isabel | KTH Royal Institute of Technology |
Chen, Yongxin | Georgia Institute of Technology |
Karlsson, Johan | KTH Royal Institute of Technology |
Keywords: Stochastic optimal control, Transportation networks, Computational methods
Abstract: We consider the optimal control problem of steering a collection of agents over a network. The group behavior of an ensemble is often modeled by a distribution, and thus the optimal control problem we study can be cast as a distribution steering problem. While most existing works for steering distributions require the agents in the ensemble to be indistinguishable, we consider the setting where agents have specified origin-destination constraints. This control problem also resembles a minimum cost network flow problem with a massive number of commodities. We propose a novel optimal transport based framework for this problem and derive an efficient algorithm for solving it. This framework extends multi-marginal optimal transport theory to settings with capacity and origin-destination constraints. The proposed method is illustrated on a numerical simulation for traffic planning.
|
|
13:45-14:00, Paper WeA10.4 | Add to My Program |
A Feedback Nash Equilibrium for Linear-Quadratic Zero-Sum Stochastic Differential Games with Random Coefficients |
|
Moon, Jun | Hanyang University |
Keywords: Stochastic optimal control, Stochastic systems, Game theory
Abstract: We study the linear-quadratic zero-sum stochastic differential game with random coefficients, where the coefficients of the stochastic differential equation (SDE) are random processes and both additive and state multiplicative noise are included in the diffusion term of the corresponding SDE. By applying It^o-Kunita's formula to the quadratic random field, we develop a direct approach, also known as the completion of squares method, to characterize the explicit (feedback) Nash equilibrium and obtain the optimal game value. We show that the corresponding Nash equilibrium is linear in state characterized in terms of the stochastic Riccati differential equation and the linear backward SDE. We also verify the optimality of the Nash equilibrium by characterizing the smooth solution of the stochastic Hamilton-Jacobi-Isaacs equation that is the second-order stochastic partial differential equation obtained from dynamic programming.
|
|
14:00-14:15, Paper WeA10.5 | Add to My Program |
Harvesting Energy from a Periodic Heat Bath |
|
Fu, Rui | University of California, Irvine |
Movilla Miangolarra, Olga | UC Irvine |
Taghvaei, Amirhossein | University of California Irvine |
Chen, Yongxin | Georgia Institute of Technology |
Georgiou, Tryphon T. | University of California, Irvine |
Keywords: Stochastic optimal control, Stochastic systems, Optimal control
Abstract: The context of the present paper is stochastic thermodynamics–an approach to nonequilibrium thermodynamics rooted within the broader framework of stochastic control. In contrast to the classical paradigm of Carnot engines, we herein propose to consider thermodynamic processes with periodic continuously varying temperature of a heat bath and study questions of maximal power and efficiency for two idealized cases, overdamped (first-order) and underdamped (secondorder) stochastic models. We highlight properties of optimal periodic control, derive and numerically validate approximate formulae for the optimal performance (power and efficiency)
|
|
14:15-14:30, Paper WeA10.6 | Add to My Program |
Risk-Constrained Linear Quadratic Regulators |
|
Tsiamis, Anastasios | University of Pennsylvania |
Kalogerias, Dionysios | University of Pennsylvania |
Chamon, Luiz F. O. | University of Pennsylvania |
Ribeiro, Alejandro | University of Pennsylvania |
Pappas, George J. | University of Pennsylvania |
Keywords: Stochastic optimal control, Stochastic systems, Robust control
Abstract: We propose a new risk-constrained reformulation of the standard Linear Quadratic Regulator (LQR) problem. Our framework is motivated by the fact that the classical (risk-neutral) LQR controller, although optimal in expectation, might be ineffective under relatively infrequent, yet statistically significant events. To effectively trade between average and extreme event performance, we introduce a new risk constraint, which explicitly restricts the total expected predictive variance of the state penalty by a user-prescribed level. We show that, under rather minimal conditions on the process noise~(i.e., finite fourth-order moments), the optimal risk-aware controller can be evaluated explicitly and in closed form. In fact, it is affine relative to the state, and is always internally stable regardless of parameter tuning. Our new risk-aware controller: i) pushes the state away from directions where the noise exhibits heavy tails, by exploiting the third moment~(skewness) of the noise; ii) inflates the state penalty in riskier directions, where both the noise covariance and the state penalty are simultaneously large. The properties of the proposed risk-aware LQR framework are also illustrated via indicative numerical examples.
|
|
14:30-14:45, Paper WeA10.7 | Add to My Program |
Stochastic Control with Random Coefficients under Recursive-Type Objective Functionals |
|
Moon, Jun | Hanyang University |
Kim, Yoonsoo | Gyeongsang National University |
Keywords: Stochastic optimal control, Stochastic systems, Uncertain systems
Abstract: We consider the stochastic optimal control problem with random coefficients under recursive-type objective functionals captured by backward stochastic differential equations (with random coefficients). The associated Hamilton-Jacobi-Bellman (HJB) equation obtained from the dynamic programming principle is a second-order nonlinear stochastic HJB (SHJB) equation (or stochastic PDE). The solvability of the SHJB equation, together with It^o-Kunita's formula, leads to the verification theorem that is the sufficient condition for optimality. We also show the existence and uniqueness of the (weak) solution to the SHJB equation via the Sobolev space technique. As an application, the linear-quadratic control problem is considered, for which we obtain an explicit optimal solution by applying the verification theorem.
|
|
14:45-15:00, Paper WeA10.8 | Add to My Program |
Stabilizing Optimal Density Control of Nonlinear Agents with Multiplicative Noise |
|
Bakshi, Kaivalya | Southwest Research Institute |
Theodorou, Evangelos A. | Georgia Institute of Technology |
GROVER, PIYUSH | University of Nebraska-Lincoln |
Keywords: Stochastic optimal control, Stability of nonlinear systems, Large-scale systems
Abstract: Control of continuous time dynamics with multiplicative noise is a classic topic in stochastic optimal control. This work addresses the problem of designing infinite horizon optimal controls with stability guarantees for large populations of identical, non-cooperative and non-networked agents with multi-dimensional and nonlinear stochastic dynamics excited by multiplicative noise. We provide constraints on the state and control cost functions which guarantee stability of the closed-loop system under the action of the individual optimal controls, for agent dynamics belonging to the the class of reversible diffusion processes. A condition relating the state-dependent control cost and volatility is introduced to prove the stability of the equilibrium density. This condition is a special case of the constraint required to use the path integral Feynman-Kac formula for computing the control. We investigate the connection between the stabilizing optimal control and the path integral formalism, leading us to a control law formulation expressed exclusively in terms of the desired equilibrium density.
|
|
WeA11 Invited Session, Coordinated Universal Time (UTC) |
Add to My Program |
Analysis and Control of Large-Scale Autonomous Networks III |
|
|
Chair: Zamani, Majid | University of Colorado Boulder |
Co-Chair: Scherpen, Jacquelien M.A. | University of Groningen |
Organizer: Noroozi, Navid | Ludwig-Maximilians-Universität München |
Organizer: Lazar, Mircea | Eindhoven University of Technology |
Organizer: Zamani, Majid | University of Colorado Boulder |
|
13:00-13:15, Paper WeA11.1 | Add to My Program |
A Passivity-Inspired Design of Power-Voltage Droop Controllers for DC Microgrids with Electrical Network Dynamics (I) |
|
Machado Martínez, Juan Eduardo | University of Groningen |
Schiffer, Johannes | Brandenburg University of Technology |
Keywords: Power systems, Decentralized control, Large-scale systems
Abstract: We propose a design procedure for a power-voltage droop controller in structure-preserving DC microgrids under explicit consideration of the electrical network dynamics. Differently from most related literature, the system’s controlled output is taken as the power—not the current—injection at each generation unit, yielding a nonlinear closed-loop system. This makes the output regulation problem non-trivial, yet far more appealing in a practical setting than the usual linear current-voltage droop control. Our approach is inspired by passivity-based control design in the sense that we exploit the natural port-Hamiltonian representation of the system dynamics and its associated shifted Hamiltonian to derive a control law together with sufficient conditions on the tuning gains that guarantee global asymptotic stability. The analysis is illustrated via detailed simulations, where accurate power sharing is manifested among the distributed generation units in the presence of load variations.
|
|
13:15-13:30, Paper WeA11.2 | Add to My Program |
Rumor-Robust Decentralized Gaussian Process Learning, Fusion, and Planning for Modeling Multiple Moving Targets (I) |
|
Liu, Chang | Cornell University |
Liao, Zhihao | Cornell University |
Ferrari, Silvia | Cornell University |
Keywords: Sensor networks, Sensor fusion, Decentralized control
Abstract: This paper presents a decentralized Gaussian Process (GP) learning, fusion, and planning (RESIN) formalism for mobile sensor networks to actively learn target motion models. RESIN is characterized by both computational and communication efficiency, and the robustness to rumor propagation in sensor networks. By using the weighted exponential product rule and the Chernoff information, a rumor-robust decentralized GP fusion approach is developed to generate a globally consistent target trajectory prediction from local GP models. A decentralized information-driven path planning approach is then proposed for mobile sensors to generate informative sensing paths. A novel, constant-sized information sharing strategy is developed for sensing path coordination,and an analytical objective function is derived that significantly reduces the computational complexity of the path planning. The effectiveness of RESIN is demonstrated in simulations.
|
|
13:30-13:45, Paper WeA11.3 | Add to My Program |
Compositional Construction of Control Barrier Certificates for Large-Scale Stochastic Switched Systems |
|
Nejati, Ameneh | Technical University of Munich (TUM) |
Soudjani, Sadegh | Newcastle University |
Zamani, Majid | University of Colorado Boulder |
Keywords: Switched systems, Stochastic systems, Large-scale systems
Abstract: In this paper, we propose a compositional framework for the construction of control barrier certificates for large-scale stochastic switched systems accepting multiple control barrier certificates with some dwell-time conditions. The proposed scheme is based on a notion of so-called augmented pseudo-barrier certificates computed for each switched subsystem, using which one can compositionally synthesize state-feedback controllers for interconnected systems enforcing safety specifications over a finite-time horizon. In particular, we first leverage sufficient max-type small-gain conditions to compositionally construct augmented control barrier certificates for interconnected systems. We the quantify upper bounds on exit probabilities - the probability that an interconnected system reaches certain unsafe regions - in a finite-time horizon. We employ a technique based on a counter-example guided inductive synthesis (CEGIS) approach to search for control barrier certificates of each mode while synthesizing safety controllers providing switching signals. We demonstrate our proposed results by applying them to two different case studies.
|
|
13:45-14:00, Paper WeA11.4 | Add to My Program |
A Distributed Algorithm for Sequential Decision Making in Multi-Armed Bandit with Homogeneous Rewards (I) |
|
Zhu, Jingxuan | Stony Brook University |
Sandhu, Romeil | Stony Brook University |
Liu, Ji | Stony Brook University |
Keywords: Cooperative control, Agents-based systems
Abstract: This paper studies a distributed multi-armed bandit problem over a network of N agents, each of which can communicate its local observations only with its neighbors, where neighbor relationships are described by a connected graph mathbb G. Each agent makes a sequence of decisions on selecting an arm from M candidates, yet it only have access to local samples of the reward for each action, which is a random variable. A distributed upper confidence bound (UCB) algorithm is proposed for the agents to cooperatively learn the best decision. It is shown that when all the agents share a homogeneous distribution of each arm reward, the algorithm achieves guaranteed weak regret at O([(1+(N-1)rho_{2})^{2}log T]/N) for all N agents, where rho_2 denotes the second largest among the absolute value of all the eigenvalues of the Metropolis matrix of bbb G. A sufficient condition under which the proposed distributed algorithm learns faster than the centralized (single-agent) counterpart is provided.
|
|
14:00-14:15, Paper WeA11.5 | Add to My Program |
Aspects of Fairness in Robust, Distributed Control of Interconnected Systems |
|
Axelson-Fisk, Magnus | Uppsala University |
Knorn, Steffi | Otto-Von-Guericke University Magdeburg |
Keywords: Control of networks, Decentralized control, Large-scale systems
Abstract: This paper studies the distribution of param- eter values in large-scale interconnected systems. Using γ- robustness, a bound on the effect of external disturbances acting on subsystems can be found. However, due to the interconnected nature of the system, the same bound can be reached with different choices of distributed control parameters leading to different local control costs. A notion of fairness and a measurement of unfairness is presented to describe and measure the effects of unevenly distributed parameters in the system. The distribution of bounds on state deviations due to external disturbances is studied and conditions and algorithms are given to ensure such bounds while fairly distributing the required control efforts. Decentralized algorithms to reach fairness or reduce unfairness are presented for various assumptions on systems and constraints. The results are illustrated by examples.
|
|
14:15-14:30, Paper WeA11.6 | Add to My Program |
Exponential Stability and Local ISS for DC Networks |
|
Ferguson, Joel | University of Newcastle |
Cucuzzella, Michele | University of Groningen |
Scherpen, Jacquelien M.A. | University of Groningen |
Keywords: Nonlinear output feedback, Robust control, Lyapunov methods
Abstract: In this letter, we consider the problem of regulating the voltage of an islanded Direct Current (DC) network subject to (i) unknown ZIP-loads, i.e., nonlinear loads with the parallel combination of constant impedance (Z), current (I) and power (P) components, and (ii) unknown time-varying disturbances. Using the port-Hamiltonian framework, two decentralized passivity-based control schemes are designed. It is shown that, using the proposed controllers, the desired equilibrium is exponentially stable and local input-to-state stable (LISS) with respect to unknown time-varying disturbances.
|
|
14:30-14:45, Paper WeA11.7 | Add to My Program |
Systematic Analysis of Distributed Optimization Algorithms Over Jointly-Connected Networks |
|
Van Scoy, Bryan | Miami University |
Lessard, Laurent | Northeastern University |
Keywords: Networked control systems, Optimization algorithms, LMIs
Abstract: We consider the distributed optimization problem, where a group of agents work together to optimize a common objective by communicating with neighboring agents and performing local computations. For a given algorithm, we use tools from robust control to systematically analyze the performance in the case where the communication network is time-varying. In particular, we assume only that the network is jointly connected over a finite time horizon (commonly referred to as B-connectivity), which does not require connectivity at each time instant. When applied to the distributed algorithm DIGing, our bounds are orders of magnitude tighter than those available in the literature.
|
|
14:45-15:00, Paper WeA11.8 | Add to My Program |
A Relaxed Small-Gain Theorem for Discrete-Time Infinite Networks (I) |
|
Noroozi, Navid | Ludwig-Maximilians-Universität München |
Mironchenko, Andrii | University of Passau |
Wirth, Fabian | University of Passau |
Keywords: Network analysis and control, Large-scale systems, Lyapunov methods
Abstract: This paper provides a small-gain theorem for a so-called infinite network, i.e. a network composed of infinitely many finite-dimensional systems. Such a network is mainly motivated by addressing the scalability issue in large-but-finite networks. We develop a so-called relaxed small-gain theorem for input-to-state stability (ISS) with respect to a closed set. It is shown that every exponentially ISS network necessarily satisfies the proposed small-gain condition. Finally, we truncate the infinite network to obtain a large-but-finite network for all stability properties and performance indices obtained for its infinite counterpart are preserved, if each subsystem is individually ISS. The effectiveness of our small-gain theorem is verified by application to an urban traffic network.
|
|
WeA12 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Networked Control Systems II |
|
|
Chair: Zanon, Mario | IMT Institute for Advanced Studies Lucca |
Co-Chair: Skarin, Per | Lund University and Ericsson Research |
|
13:00-13:15, Paper WeA12.1 | Add to My Program |
Initial-Value Privacy of Linear Dynamical Systems |
|
Wang, Lei | The University of Sydney |
Manchester, Ian R. | University of Sydney |
Trumpf, Jochen | Australian National University |
Shi, Guodong | The University of Sydney |
Keywords: Linear systems, Networked control systems, Control Systems Privacy
Abstract: This paper studies initial-value privacy problems of linear dynamical systems. We consider a standard linear time-invariant system with random process and measurement noises. For such a system, eavesdroppers having access to system output trajectories may infer the system initial states, leading to initial-value privacy risks. When a finite number of output trajectories are eavesdropped, we consider a requirement that any guess about the initial values can be plausibly denied. When an infinite number of output trajectories are eavesdropped, we consider a requirement that the initial values should not be uniquely recoverable. In view of these two privacy requirements, we define differential initial-value privacy and intrinsic initial-value privacy, respectively, for the system as metrics of privacy risks. First of all, we prove that the intrinsic initial-value privacy is equivalent to unobservability, while the differential initial-value privacy can be achieved for a privacy budget depending on an extended observability matrix of the system and the covariance of the noises. Next, the inherent network nature of the considered linear system is explored, where each individual state corresponds to a node and the state and output matrices induce interaction and sensing graphs, leading to a network system. Under this network system perspective, we allow the initial states at some nodes to be public, and investigate the resulting intrinsic initial-value privacy of each individual node. We establish necessary and sufficient conditions for such individual node initial-value privacy, and also prove that the intrinsic initial-value privacy of individual nodes is generically determined by the network structure.
|
|
13:15-13:30, Paper WeA12.2 | Add to My Program |
Interesting Phenomena in the Synchronisation of Disturbed Systems |
|
Wissing, Marc | Ruhr-Universität Bochum, IAV GmbH |
Oehlschlaegel, Thimo | IAV GmbH |
Lunze, Jan | Ruhr-Universität Bochum |
Keywords: Networked control systems, Linear systems, Distributed control
Abstract: This paper presents three interesting phenomena that occur in the synchronisation of disturbed systems. The first phenomenon describes the necessity that dynamical local controllers have to be used to introduce the model of the disturbances into each of the extended agents. This result extends the Internal-Model Principle from multivariable control to multi-agent systems. A necessary and sufficient synchronisation condition is given. The second phenomenon concerns the fact that the internal disturbance models of all agents modify the synchronous trajectory of the overall system. For specific initial states the synchronous trajectory is completely fixed by the disturbance models and does not reflect the properties of the agents. This fact leads to the third phenomenon that indicates that an undisturbed virtual agent has to be introduced into the overall system to ensure that the synchronous trajectory is generated by the agent dynamics like in undisturbed multi-agent systems. The considered phenomena are illustrated by numerical examples.
|
|
13:30-13:45, Paper WeA12.3 | Add to My Program |
Optimal Control Design for Perturbed Constrained Networked Control Systems |
|
Bahraini, Masoud | Chalmers University of Technology |
Zanon, Mario | IMT Institute for Advanced Studies Lucca |
Colombo, Alessandro | Politecnico Di Milano |
Falcone, Paolo | Chalmers University of Technology |
Keywords: Networked control systems, Constrained control
Abstract: This paper focuses on an optimal control design problem for a class of perturbed networked control systems where a number of systems, subject to state and input constraints, share a communication network with limited bandwidth. We first formulate an optimal control design problem with a constant feedback gain in order to minimize the communication demand for each system while guaranteeing satisfaction of state and input constraints; we show that this optimization problem is very hard to solve. Then, we formulate the same optimal control design problem with a non-constant feedback gain; we argue that this problem is less difficult and results in a lower, or equal, communication demand in comparison to the design with the constant feedback gain. We illustrate and compare these optimal control designs by a simple example.
|
|
13:45-14:00, Paper WeA12.4 | Add to My Program |
Interplay between Resilience and Accuracy in Resilient Vector Consensus in Multi-Agent Networks |
|
Abbas, Waseem | Vanderbilt University |
Shabbir, Mudassir | Information Technology University |
Li, Jiani | Vanderbilt University |
Koutsoukos, Xenofon | Vanderbilt University |
Keywords: Networked control systems, Network analysis and control, Fault tolerant systems
Abstract: In this paper, we study the relationship between resilience and accuracy in the resilient distributed multi-dimensional consensus problem. We consider a network of agents, each of which has a state in R d. Some agents in the network are adversarial and can change their states arbitrarily. The normal (non-adversarial) agents interact locally and update their states to achieve consensus at some point in the convex hull C of their initial states. This objective is achievable if the number of adversaries in the neighborhood of normal agents is less than a specific value, which is a function of the local connectivity and the state dimension d. However, to be resilient against adversaries, especially in the case of large d, the desired local connectivity is large. We discuss that resilience against adversarial agents can be improved if normal agents are allowed to converge in a bounded region B, which means normal agents converge at some point close to but not necessarily inside C in the worst case. The accuracy of resilient consensus can be measured by the Hausdorff distance between B and C. As a result, resilience can be improved at the cost of accuracy. We propose a resilient bounded consensus algorithm that exploits the trade-off between resilience and accuracy by projecting d-dimensional states into lower dimensions and then solving instances of resilient consensus in lower dimensions. We analyze the algorithm, present various resilience and accuracy bounds, and also numerically evaluate our results.
|
|
14:00-14:15, Paper WeA12.5 | Add to My Program |
Sparse Linear Injection Attack on Multi-Agent Consensus Control Systems |
|
Tsang, Kam Fai Elvis | Hong Kong University of Science and Technology |
HUANG, Mengyu | The Hong Kong University of Science and Technology |
Johansson, Karl H. | Royal Institute of Technology |
Shi, Ling | Hong Kong University of Science and Technology |
Keywords: Networked control systems, Network analysis and control, Optimization
Abstract: This paper investigates the problem of false data injection attack on the communication channels in a multiagent system executing a consensus protocol. We formulate a non-convex optimisation problem for an attack strategy with minimal one-step attack energy to guarantee instability of the consensus system. We propose an algorithm to solve the problem efficiently. Numerical simulations are provided to illustrate the effectiveness of the attack strategy.
|
|
14:15-14:30, Paper WeA12.6 | Add to My Program |
Distributed Computation of Graph Matching in Multi-Agent Networks |
|
Tran, Quoc Van | GIST |
Sun, Zhiyong | Eindhoven University of Technology (TU/e) |
Anderson, Brian D.O. | Australian National University |
Ahn, Hyo-Sung | Gwangju Institute of Science and Technology (GIST) |
Keywords: Networked control systems, Network analysis and control, Distributed control
Abstract: This work considers the distributed computation of the one-to-one vertex correspondences between two undirected and connected graphs, which is called graph matching, over multi-agent networks. Given two isomorphic and asymmetric graphs, there is a unique permutation matrix that maps the vertices in one graph to the vertices in the other. Based on a convex relaxation of graph matching in Aflalo et al. [1], we propose a distributed computation of graph matching as a distributed convex optimization problem subject to equality constraints and a global set constraint, using a network of multiple agents whose interaction graph is connected. Each agent in the network only knows one column of each of the adjacency matrices of the two graphs, and all agents collaboratively learn the graph matching by exchanging information with their neighbors. The proposed algorithm employs a projected primaldual gradient method to handle equality constraints and a set constraint. Under the proposed algorithm, the agents’ estimates of the permutation matrix converge to the optimal permutation globally and exponentially fast. Finally, simulation results are given to illustrate the effectiveness of the method.
|
|
14:30-14:45, Paper WeA12.7 | Add to My Program |
Rollout Scheduling and Control for Disturbed Systems Via Tube MPC |
|
Wildhagen, Stefan | University of Stuttgart |
Allgöwer, Frank | University of Stuttgart |
Keywords: Networked control systems, Predictive control for linear systems, Control over communications
Abstract: Rollout control is an MPC-based control method, in which a controller is periodically activated to schedule the transmission of sensor or actuator data. Therein, a preassigned traffic specification acts as a constraint on the scheduled transmissions, ensuring that they are triggered at an admissible rate for the underlying communication network. In this paper, we extend the theory of rollout control by considering bounded disturbances on the controlled plant in the presence of state and input constraints. We use methods from tube MPC, where the error between the nominal (undisturbed) system, used as a prediction model, and the real system is kept in a robust control invariant set. This approach requires satisfaction of a emph{maximum} inter-transmission interval in closed loop, a guarantee that is not straightforward to obtain in rollout control. Our main contribution is to introduce a novel tube MPC scheme adapted for rollout control, which contains an additional constraint on the predicted transmission schedule. We show that by virtue of this schedule constraint, the inter-transmission interval is bounded in closed loop. We also establish recursive feasibility of this novel MPC scheme and show convergence of the system.
|
|
14:45-15:00, Paper WeA12.8 | Add to My Program |
A Cloud-Enabled Rate-Switching MPC Architecture |
|
Skarin, Per | Lund University and Ericsson Research |
Eker, Johan | Lund University |
Arzen, Karl-Erik | Lund Inst. of Technology |
Keywords: Networked control systems, Predictive control for linear systems, Switched systems
Abstract: A two-tier architecture for cloud-based MPC is presented consisting of a high rate MPC in the cloud and a low rate MPC on the local device. The system use the cloud MPC as the nominal controller but switches to local MPC in case of an unresponsive network. The two MPCs are designed to be as similar to each other as possible except for the sampling rate. Different alternatives for when to execute the local MPC and how to perform the switching are presented. The approach is evaluated by simulation.
|
|
WeA13 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Inference and Prediction Using Machine Learning |
|
|
Chair: Nguyen, Duc Thien | IBM |
Co-Chair: Olfat, Mahbod | UC Berkeley |
|
13:00-13:15, Paper WeA13.1 | Add to My Program |
Bayesian Safe Learning and Control with Sum-Of-Squares Analysis and Polynomial Kernels |
|
Devonport, Alex | University of California, Berkeley |
Yin, He | University of California, Berkeley |
Arcak, Murat | University of California, Berkeley |
Keywords: Learning, Uncertain systems, Estimation
Abstract: We propose an iterative method to safely learn the unmodeled dynamics of a control-affine nonlinear system using Bayesian Gaussian process (GP) models with polynomial kernel functions. The method maintains safety by ensuring that the system state stays within the region of attraction (ROA) of a stabilizing control policy while collecting data. A quadratic programming based exploration control policy is computed to keep the exploration trajectory inside an inner-approximation of the ROA and to maximize the information gained from the trajectory. A prior GP model, which incorporates prior information about the unknown dynamics, is used to construct an initial stabilizing policy. As the GP model is updated with data, it is used to synthesize a new policy and a larger ROA, which increases the range of safe exploration. The use of polynomial kernels allows us to compute ROA inner-approximations and stabilizing control laws for the model using sum-of-squares programming. We also provide a probabilistic guarantee of safety which ensures that the policy computed using the learned model stabilizes the true dynamics with high confidence.
|
|
13:15-13:30, Paper WeA13.2 | Add to My Program |
Variational Bayesian Inference for Crowdsourcing Predictions |
|
Cai, Desmond | IBM |
Nguyen, Duc Thien | IBM |
Lim, Shiau Hong | IBM Research |
Wynter, Laura | IBM Watson Research Center |
Keywords: Machine learning
Abstract: Crowdsourcing has emerged as an effective means for performing a number of machine learning tasks such as annotation and labelling of images and other data sets. In most early settings of crowdsourcing, the task involved classification, that is assigning one of a discrete set of labels to each task. Recently, however, more complex tasks have been attempted including asking crowdsource workers to assign continuous labels, or predictions. In essence, this involves the use of crowdsourcing for function estimation. We are motivated by this problem to drive applications such as collaborative prediction, that is, harnessing the wisdom of the crowd to predict quantities more accurately. To do so, we propose a Bayesian approach aimed specifically at alleviating overfitting, a typical impediment to accurate prediction models in practice. In particular, we develop a variational Bayesian technique for two different worker noise models – one that assumes workers’ noises are independent and the other that assumes workers’ noises have a latent low-rank structure. Our evaluations on synthetic and real-world datasets demonstrate that these Bayesian approaches perform significantly better than existing non-Bayesian approaches and are thus potentially useful for this class of crowdsourcing problems.
|
|
13:30-13:45, Paper WeA13.3 | Add to My Program |
Epistemic Uncertainty Quantification in State-Space LPV Model Identification Using Bayesian Neural Networks |
|
Bao, Yajie | The University of Georgia |
Mohammadpour Velni, Javad | University of Georgia |
Shahbakhti, Mahdi | University of Alberta |
Keywords: Machine learning, Linear parameter-varying systems, Neural networks
Abstract: This paper presents a variational Bayesian inference Neural Network (BNN) approach to quantify uncertainties in matrix function estimation for the state-space linear parameter-varying (LPV) model identification problem using only inputs/outputs data. The proposed method simultaneously estimates states and posteriors of matrix functions given data. In particular, states are estimated by reaching a consensus between an estimator based on past system trajectory and an estimator by recurrent equations of states; posteriors are approximated by minimizing the Kullback–Leibler (KL) divergence between the parameterized posterior distribution and the true posterior of the LPV model parameters. Furthermore, techniques such as transfer learning are explored in this work to reduce computational cost and prevent convergence failure of Bayesian inference. The proposed data-driven method is validated using experimental data for identification of a control-oriented reactivity controlled compression ignition (RCCI) engine model.
|
|
13:45-14:00, Paper WeA13.4 | Add to My Program |
Bayesian Perceptron: Towards Fully Bayesian Neural Networks |
|
Huber, Marco | University of Stuttgart |
Keywords: Machine learning, Statistical learning, Kalman filtering
Abstract: Artificial neural networks (NNs) have become the de facto standard in machine learning. They allow learning highly nonlinear transformations in a plethora of applications. However, NNs usually only provide point estimates without systematically quantifying corresponding uncertainties. In this paper a novel approach towards fully Bayesian NNs is proposed, where training and predictions of a perceptron are performed within the Bayesian inference framework in closed-form. The weights and the predictions of the perceptron are considered Gaussian random variables. Analytical expressions for predicting the perceptron's output and for learning the weights are provided for commonly used activation functions like sigmoid or ReLU. This approach requires no computationally expensive gradient calculations and further allows sequential learning.
|
|
14:00-14:15, Paper WeA13.5 | Add to My Program |
Safety Guarantees for Iterative Predictions with Gaussian Processes |
|
Polymenakos, Kyriakos | University of Oxford |
Laurenti, Luca | University of Oxford |
Patane, Andrea | University of Oxford |
Calliess, Jan-Peter | University of Oxford |
Cardelli, Luca | University of Oxford |
Kwiatkowska, Marta | University of Oxford |
Abate, Alessandro | University of Oxford |
Roberts, Stephen | University of Oxford |
Keywords: Machine learning, Uncertain systems, Formal Verification/Synthesis
Abstract: Gaussian Processes (GPs) are widely employed in control and learning because of their principled treatment of uncertainty. However, tracking uncertainty for iterative, multi-step predictions in general leads to an analytically intractable problem. While approximation methods exist, they do not come with guarantees, making it difficult to estimate their reliability and to trust their predictions. In this work, we derive formal probability error bounds for iterative predictions with GPs. Building on GP properties, we bound the probability that random trajectories lie in specific regions around the predicted values. Namely, given a tolerance epsilon > 0 , we compute regions around the predicted trajectory values, such that GP trajectories are guaranteed to lie inside them with probability at least 1-epsilon. We verify experimentally that our method tracks the predictive uncertainty correctly, even when current approximation techniques fail. Furthermore, we show how the proposed bounds can incorporate a given control law, and effectively bound the trajectories of the closed-loop system.
|
|
14:15-14:30, Paper WeA13.6 | Add to My Program |
Average Margin Regularization for Classifiers |
|
Olfat, Mahbod | UC Berkeley |
Aswani, Anil | UC Berkeley |
Keywords: Machine learning, Optimization, Subspace methods
Abstract: Adversarial robustness has become an important research topic given empirical demonstrations on the lack of robustness of deep neural networks. Unfortunately, recent theoretical results suggest that adversarial training induces a strict tradeoff between classification accuracy and adversarial robustness. In this paper, we propose and then study a new regularization for any margin classifier or deep neural network. We motivate this regularization by a novel generalization bound that shows a tradeoff in classifier accuracy between maximizing its margin and average margin. We thus call our approach an average margin (AM) regularization, and it consists of a linear term added to the objective. We theoretically show that for certain distributions AM regularization can both improve classifier accuracy and robustness to adversarial attacks. We conclude by using both synthetic and real data to empirically show that AM regularization can strictly improve both accuracy and robustness for support vector machine's (SVM's), relative to unregularized classifiers and adversarially trained classifiers.
|
|
14:30-14:45, Paper WeA13.7 | Add to My Program |
On the Sample Complexity of Data-Driven Inference of the mathcal{L}_2-Gain |
|
Sharf, Miel | Israel Institute of Technology |
Keywords: Machine learning, Modeling
Abstract: Lately, data-driven control has become a widespread area of research. A few recent big-data based approaches for data-driven control of nonlinear systems try to use classical input-output techniques to design controllers for systems for which only a finite number of (input-output) samples are known. These methods focus on using the given data to compute bounds on the mathcal{L}_2-gain or on the shortage of passivity from finite input-output data, allowing for the application of the small gain theorem or the feedback theorem for passive systems. One question regarding these methods asks about their sample complexity, namely how many input-output samples are needed to get an approximation of the operator norm or of the shortage of passivity. We show that the number of samples needed to estimate the operator norm of a system is roughly the same as the number of samples required to approximate the system in the operator norm.
|
|
14:45-15:00, Paper WeA13.8 | Add to My Program |
Learning Multiple Nonlinear Dynamical Systems with Side Information |
|
Takeishi, Naoya | RIKEN |
Kawahara, Yoshinobu | Kyushu University / RIKEN |
Keywords: Machine learning, Learning, Nonlinear systems identification
Abstract: We address the problem of learning multiple dynamical systems, which is a kind of multi-task learning (MTL). The existing methods of MTL do not apply to learning dynamical systems in general. In this work, we develop a regularization method to perform MTL for dynamical systems appropriately. The proposed method is based on an operator-theoretic metric on dynamics that is agnostic of model parametrization and applicable even for nonlinear dynamics models. We calculate the proposed MTL-like regularization by estimating the metric from trajectories generated during training. Learning time-varying systems can be regarded as a special case of the usage of the proposed method. The proposed regularizer is versatile as we can straightforwardly incorporate it into off-the-shelf gradient-based optimization methods. We show the results of experiments on synthetic and real-world datasets, which exhibits the validity of the proposed regularizer.
|
|
WeA15 Invited Session, Coordinated Universal Time (UTC) |
Add to My Program |
Cyber-Physical System Security |
|
|
Chair: Griffioen, Paul | Carnegie Mellon University |
Co-Chair: Mo, Yilin | Tsinghua University |
Organizer: Griffioen, Paul | Carnegie Mellon University |
Organizer: Sinopoli, Bruno | Washington University in St Louis |
Organizer: Mo, Yilin | Tsinghua University |
Organizer: Johansson, Karl H. | Royal Institute of Technology |
|
13:00-13:15, Paper WeA15.1 | Add to My Program |
Reinforcement Learning Based Approach for Flip Attack Detection (I) |
|
Liu, Hanxiao | Nanyang Technological University |
Li, Yuchao | KTH Royal Institute of Technology |
Mårtensson, Jonas | KTH Royal Institute of Technology |
Xie, Lihua | Nanyang Tech. Univ |
Johansson, Karl H. | Royal Institute of Technology |
Keywords: Attack Detection, Learning, Sensor networks
Abstract: This paper addresses the detection problem of flip attacks to sensor network systems where the attacker flips the distribution of manipulated sensor measurements of a binary state. The detector decides to continue taking observations or to stop based on the sensor measurements, and the goal is to have the flip attack recognized as fast as possible while trying to avoid terminating the measurements when no attack is present. The detection problem can be modeled as a partially observable Markov decision process (POMDP) by assuming an attack probability, with the dynamics of the hidden states of the POMDP characterized by a stochastic shortest path (SSP) problem. The optimal policy of the SSP solely depends on the transition costs and is independent of the assumed attack possibility. By using a fixed-length window and suitable feature function of the measurements, a Markov decision process (MDP) is used to approximate the behavior of the POMDP. The optimal solution of the approximated MDP can then be solved by any standard reinforcement learning methods. Numerical evaluations are given to demonstrate the effectiveness of the proposed method.
|
|
13:15-13:30, Paper WeA15.2 | Add to My Program |
Data-Driven Attack Detection for Linear Systems |
|
Krishnan, Vishaal | University of California, Riverside |
Pasqualetti, Fabio | University of California, Riverside |
Keywords: Pattern recognition and classification, Subspace methods, Machine learning
Abstract: This paper studies the attack detection problem in a data-driven and model-free setting, for deterministic systems with linear and time-invariant dynamics. Differently from existing studies that leverage knowledge of the system dynamics to derive security bounds and monitoring schemes, we treat the cases where the system dynamics, as well as the attack strategy and attack location, are unknown. We derive fundamental security limitations as a function of only the observed data and without estimating the system dynamics (in fact, no assumption is made on the identifiability of the system). In particular, (i) we derive detection limitations as a function of the informativity and length of the observation window, (ii) provide a data-driven characterization of undetectable attacks, and (iii) construct a data-driven detection monitor. Surprisingly, our results show that while data-driven monitoring requires a larger observation window to attain attack detection capability, once attained it shares the same limitations as model-based monitoring.
|
|
13:30-13:45, Paper WeA15.3 | Add to My Program |
Anomaly Detection in Systems with Periodic Outputs (I) |
|
Cuesta-Garcia, Jose Ricardo | CICESE Research Center |
Alvarez, Joaquin | CICESE |
Ruths, Justin | University of Texas at Dallas |
Pena Ramirez, Jonatan | Centro De Investigación Científica Y De Educación Superior De En |
Keywords: Attack Detection, Cyber-Physical Security, Control Systems Privacy
Abstract: This paper presents a strategy for detecting anomalies—behavior that is different from nominal—in systems with periodic outputs. Different from classical detectors available in the literature which require the computation of a residual, the detection algorithms proposed here are based on the concept of the Poincaré map and on the notion of cyclic group. The performance of the detectors is illustrated by numerical simulations and validated by experiments on an actuated mass-spring-damper oscillator.
|
|
13:45-14:00, Paper WeA15.4 | Add to My Program |
Secure State-Reconstruction Over Networks Subject to Attacks |
|
Mao, Yanwen | University of California, Los Angeles |
Diggavi, Suhas | UCLA |
Fragouli, Christina | University of California, Los Angeles |
Tabuada, Paulo | University of California at Los Angeles |
Keywords: Fault tolerant systems, Network analysis and control, Fault detection
Abstract: Secure state-reconstruction is the problem of reconstructing the state of a linear time-invariant system from sensor measurements that have been corrupted by an adversary. Whereas most work focuses on attacks on sensors, we consider the more challenging case where attacks occur on sensors as well as on nodes and links of a network that transports sensor measurements to a receiver. In this paper we provide necessary and sufficient conditions for the secure state-reconstruction problem to be solvable in the presence of attacks on sensors and on the network. Our results unify existing results in the literature that focus either on attacks on sensors, or on the network.
|
|
14:00-14:15, Paper WeA15.5 | Add to My Program |
Decentralized Event-Triggered Control in the Presence of Adversaries (I) |
|
Griffioen, Paul | Carnegie Mellon University |
Romagnoli, Raffaele | Carnegie Mellon University |
Krogh, Bruce H. | Carnegie Mellon Univ |
Sinopoli, Bruno | Washington University in St Louis |
Keywords: Decentralized control, Networked control systems, Resilient Control Systems
Abstract: Decentralized control systems are widely used in a number of situations and applications. In order for these systems to function properly and achieve their desired goals, information must be propagated between agents. However, communication between agents entails connecting to a network, potentially allowing adversaries to infiltrate the system through the network and attack multiple agents. To increase resiliency against these attacks, it is desirable for agents to operate disconnected from the network as much as possible, only communicating with other agents when it is absolutely necessary to achieve their goal or to maintain the safety of the overall system. This in turn decreases communication costs. Previous approaches to decentralized event-triggered control are mainly concerned with minimizing communication costs and therefore assume that every agent is always connected to the network with the ability to receive any information that is sent to it. In this work, we address the issue of maintaining the safety and stability of the overall system when there is no attack but some agents may be disconnected from the network and unable to receive critical information from other agents. We design an event-triggered mechanism for network connection and communication that is a function of only local information and that ensures stability for the overall system in attack-free scenarios. An algorithm describing this communication protocol is provided, and our approach is illustrated via simulation.
|
|
14:15-14:30, Paper WeA15.6 | Add to My Program |
Asymptotic Security of Control Systems by Covert Reaction: Repeated Signaling Game with Undisclosed Belief (I) |
|
Sasahara, Hampei | KTH Royal Institute of Technology |
SARITAS, Serkan | KTH Royal Institute of Technology |
Sandberg, Henrik | KTH Royal Institute of Technology |
Keywords: Resilient Control Systems, Cyber-Physical Security, Attack Detection
Abstract: This study investigates the relationship between resilience of control systems to attacks and the information available to malicious attackers. Specifically, it is shown that control systems are guaranteed to be secure in an asymptotic manner by rendering reactions against potentially harmful actions covert. The behaviors of the attacker and the defender are analyzed through a repeated signaling game with an undisclosed belief under covert reactions. In the typical setting of signaling games, reactions conducted by the defender are supposed to be public information and the measurability enables the attacker to accurately trace transitions of the defender's belief on existence of a malicious attacker. In contrast, the belief in the game considered in this paper is undisclosed and hence common equilibrium concepts can no longer be employed for the analysis. To surmount this difficulty, a novel framework for decision of reasonable strategies of the players in the game is introduced. Based on the presented framework, it is revealed that any reasonable strategy chosen by a rational malicious attacker converges to the benign behavior as long as the reactions performed by the defender are unobservable to the attacker. The result provides an explicit relationship between resilience and information, which indicates the importance of covertness of reactions for designing secure control systems.
|
|
14:30-14:45, Paper WeA15.7 | Add to My Program |
Authenticated Computation of Control Signal from Dynamic Controllers (I) |
|
Cheon, Jung Hee | Seoul National University |
Kim, Dongwoo | Seoul National University |
Kim, Junsoo | Seoul National University |
Lee, SeungBeom | Seoul National University |
Shim, Hyungbo | Seoul National University |
Keywords: Computer/Network Security, Linear systems
Abstract: Significant concerns on networked control systems are modifications on the control signals caused by a compromise on the network or the controller, since it can cause a devastating behavior or even entire failure of the system. In this paper, we present a fundamental solution to this problem by proposing a new authenticated computation that checks the matrix-vector multiplications---the main arithmetic of a controller---and verifies the updates on the states of the controller. It enables the plant-side not only to check the computation of the controller with much less computational cost than that required for the computation itself, but also to detect any modifications on the control signals.
|
|
14:45-15:00, Paper WeA15.8 | Add to My Program |
The Effect of Behavioral Probability Weighting in a Sequential Defender-Attacker Game (I) |
|
Abdallah, Mustafa | Purdue University |
Cason, Timothy | Purdue University |
Bagchi, Saurabh | Purdue University |
Sundaram, Shreyas | Purdue University |
Keywords: Human-in-the-loop control, Game theory, Cyber-Physical Security
Abstract: We consider a setting consisting of two sites, and a sequential game between a defender and an attacker who are responsible for securing and attacking the sites, respectively. Each site has a value to the defender, and an associated probability of successful attack, which can be reduced via security investments in that site by the defender. The attacker targets the site that maximizes the expected loss for the defender (after the investments). While prior work has studied the security investments in such scenarios, in this work we consider what happens when the defender exhibits characteristics of bounded-rationality that have been identified by behavioral economics. In particular, humans have been shown to perceive probabilities in a nonlinear manner, typically overweighting low probabilities and underweighting high probabilities. We characterize how such nonlinear probability weighting affects the security investments made by the defender, and bound the inefficiency of the equilibrium investments under behavioral decision-making, compared to a non-behavioral optimal solution.
|
|
WeA16 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Switched Systems |
|
|
Chair: Tan, Ying | The University of Melbourne |
Co-Chair: Ozay, Necmiye | Univ. of Michigan |
|
13:00-13:15, Paper WeA16.1 | Add to My Program |
Switched Optimal Control and Dwell Time Constraints: A Preliminary Study |
|
Abudia, Moad | Oklahoma State University |
Harlan, Michael | Oklahoma State University |
Self, Ryan | Oklahoma State University |
Kamalapurkar, Rushikesh | Oklahoma State University |
Keywords: Switched systems, Embedded systems, Optimal control
Abstract: Most modern control systems are switched, meaning they have continuous as well as discrete decision variables. Switched systems often have constraints called dwell-time constraints, (e.g., cycling constraints in a heat pump), on the switching rate. This paper introduces an embedding-based-method to solve optimal control problems that have both discreet and continuous decision variables. unlike existing methods, the developed technique can heuristically incorporate dwell-time constraints while also preserving other state and control constrains of the problem. Simulations are run for a switched optimal control problem with and without the auxiliary cost to showcase the utility of the developed method.
|
|
13:15-13:30, Paper WeA16.2 | Add to My Program |
Ellipsoid-Based Sensor Fault Detection for Discrete-Time Switched Systems |
|
Zammali, Chaima | Conservatoire National Des Arts Et Métiers (CNAM), Cedric Lab |
VAN GORP, Jeremy | CNAM |
Wang, Zhenhua | Harbin Institute of Technology |
Ping, Xubin | Xidian University |
Raïssi, Tarek | Conservatoire National Des Arts Et Métiers |
Keywords: Switched systems, Fault detection, Optimization
Abstract: In this paper, fault detection is studied for discrete-time switched systems subject to unknown but bounded disturbances and measurement noise. A fault detection observer with a new structure is investigated based on an L_{infty} criterion to attenuate the effects of uncertainties. The design conditions are given in terms of Linear Matrix Inequalities using Multiple Quadratic Lyapunov Functions, under an Average Dwell Time switching signal. In addition, the FD results are obtained via an ellipsoidal analysis. Finally, a numerical example is performed to illustrate the effectiveness of the proposed approach.
|
|
13:30-13:45, Paper WeA16.3 | Add to My Program |
Existence of Initial Condition Independent Stabilising Switching Functions |
|
Townsend, Christopher | University of Newcastle |
Seron, Maria M. | The University of Newcastle |
Keywords: Switched systems, Linear systems, Stability of linear systems
Abstract: We demonstrate that a switched affine system is simultaneously stabilisabile i.e. there exists an asymptotically stabilising switching function which is independent of the initial condition if and only if there exists a stable convex combination of the sub-system matrices.
|
|
13:45-14:00, Paper WeA16.4 | Add to My Program |
Path-Complete Lyapunov Functions for Continuous-Time Switching Systems |
|
Della Rossa, Matteo | LAAS CNRS |
Pasquini, Mirko | Imperial College of London |
Angeli, David | Imperial College |
Keywords: Switched systems, Lyapunov methods, Stability of nonlinear systems
Abstract: We use a graph-theory-based argument to propose a novel Lyapunov construction for continuous-time switching systems. Starting with a finite family of continuously differentiable functions, the inequalities involving these functions and the vector fields of the switching system are encoded in a direct and labeled graph. Relaying on the (path-)completeness of this graph, we introduce a signal-dependent Lyapunov function, providing sufficient conditions for stability under fixed-time or dwell-time switching hypothesis. For the case of linear systems, our conditions turn into linear matrix inequalities (LMI), and thus they are compared with previous results, via numerical examples.
|
|
14:00-14:15, Paper WeA16.5 | Add to My Program |
Almost Global Stability of Nonlinear Switched System with Stable and Unstable Subsystems |
|
Kivilcim, Aysegul | Aalborg University |
Karabacak, Özkan | Aalborg University |
Wisniewski, Rafal | Aalborg University |
Keywords: Switched systems, Lyapunov methods, Stability of nonlinear systems
Abstract: This paper presents sufficient conditions for almost global stability of nonlinear switched systems consisting of both stable and unstable subsystems. Techniques from the stability analysis of switched systems have been combined with the multiple Lyapunov density approach - recently proposed by the authors for the almost global stability of nonlinear switched systems composed of stable subsystems. By using slow switching for stable subsystems and fast switching for unstable subsystems lower and upper bounds for mode-dependent average dwell times are obtained. In addition to that, by allowing each subsystem to perform slow switching and using some restrictions on total operation time of unstable subsystems and stable subsystems, we have obtained a lower bound for an average dwell time.
|
|
14:15-14:30, Paper WeA16.6 | Add to My Program |
Implicit Invariant Sets for High-Dimensional Switched Affine Systems |
|
Wintenberg, Andrew | The University of Michigan, Ann Arbor |
Ozay, Necmiye | Univ. of Michigan |
Keywords: Computational methods, Switched systems, Formal Verification/Synthesis
Abstract: In this paper we consider the problem of robust control invariant set computation for discrete-time switched affine systems. We consider additive disturbances and joint state-input constraints. We provide several constructions of N-step recurrent sets from which we derive an implicit description of invariant sets. This is done with a linear program characterizing set recurrence and backwards reachability with affine feedback controllers. The focus of these methods is scalability to systems with high-dimension. The performance of these methods is evaluated on a system modeling consensus of unmanned aerial vehicles (UAVs) and a system modeling the large-scale control of thermostatically controlled loads (TCLs).
|
|
14:30-14:45, Paper WeA16.7 | Add to My Program |
Scheduling Networked Control Systems under Jamming Attacks |
|
Kundu, Atreyee | Indian Institute of Science, Bangalore |
Keywords: Networked control systems, Switched systems, Stability of hybrid systems
Abstract: This paper deals with the design of scheduling policies for networked control systems whose shared networks have limited communication capacity and the controller to plant channels are vulnerable to jamming attacks. We assume that among (N) plants, only (M (< N)) plants can communicate with their controllers at any time instant, and the attack sequences follow an ((m,k))-firm model, i.e., in any (k) consecutive time instants, the control inputs sent to some or all of the plants accessing the communication network, are deactivated at most at (m (< k)) time instants. We devise a new algorithm to allocate the network to the plants periodically such that stability of each plant is preserved under the admissible attack signals. The main apparatus for our analysis is a switched systems representation of the individual plants in an NCS. We rely on matrix commutators (Lie brackets) between the stable and unstable modes of operation of the plants to guarantee stability under our scheduling policies.
|
|
14:45-15:00, Paper WeA16.8 | Add to My Program |
Stability and Robustness Analysis of Switched Vibrational Control |
|
Cheng, Xiaoxiao | The University of Melbourne |
Tan, Ying | The University of Melbourne |
Mareels, Iven | IBM |
Keywords: Stability of nonlinear systems, Robust control, Switched systems
Abstract: Vibrational control stabilizes an unstable nominal system by injecting high-frequency dither signals. However, high-frequency dithers consume much energy, and demand high power and high bandwidth control actuators. To alleviate the energy consumption issue, we propose to switch off the dither injection in a periodic manner - introducing a switched dither signal. By tuning the period of the switching appropriately, system stability can be guaranteed. Moreover, the closed loop system's stability properties are robust with respect to a class of additive disturbances. A simulation of an inverted pendulum system illustrates the theory, and shows that the control energy is significantly reduced with only minimal effect on the overall control performance as compared to classic vibrational control.
|
|
WeA17 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Constrained Control |
|
|
Chair: Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Co-Chair: Wang, Xiaofeng | University of South Carolina |
|
13:00-13:15, Paper WeA17.1 | Add to My Program |
Safe Tracking Control of an Uncertain Euler-Lagrange System with Full-State Constraints Using Barrier Functions |
|
Salehi, Iman | University of Connecticut |
Rotithor, Ghananeel | University of Connecticut |
Trombetta, Daniel | University of Connecticut |
Dani, Ashwin | University of Connecticut |
Keywords: Constrained control, Adaptive control, Robotics
Abstract: This paper presents a novel, safe tracking control design method that learns the parameters of an uncertain Euler-Lagrange (EL) system online using adaptive learning laws. A barrier function (BF) is first used to transform the full-state constrained EL-dynamics into an equivalent unconstrained dynamics. An adaptive tracking controller is then developed along with the parameter update law in the transformed state space such that the states remain bounded for all time within a prescribed bound. A stability analysis is developed that considers the EL-dynamics' uncertainty, yielding a semi-globally uniformly ultimately bounded (SGUUB) tracking error and the parameter estimation error. The controller design is validated in simulations using a two-link planar manipulator. The results show the proposed method's ability to track the reference trajectory while remaining inside each of the predefined state bounds.
|
|
13:15-13:30, Paper WeA17.2 | Add to My Program |
Gaussian Control Barrier Functions: Safe Learning and Control |
|
Khan, Mouhyemen | Georgia Institute of Technology |
Chatterjee, Abhijit | Georgia Tech |
Keywords: Constrained control, Machine learning, Robotics
Abstract: Safety is a critical component in today's autonomous and robotic systems. Many modern controllers endowed with notions of guaranteed safety properties rely on accurate mathematical models of these nonlinear dynamical systems. However, model uncertainty is always a persistent challenge weakening theoretical guarantees and compromising safety. For safety-critical systems, this is an even bigger challenge. Typically, safety is ensured by constraining the system states within a safe constraint set defined a priori by relying on the model of the system. A popular approach is to use Control Barrier Functions (CBFs) that encode safety using a smooth function. However, CBFs fail in the presence of model uncertainties. Moreover, an inaccurate model can either lead to incorrect notions of safety or worse, incur system critical failures. Addressing these drawbacks, we present a novel safety formulation that leverages properties of CBFs and positive definite kernels to design textit{Gaussian CBFs}. The underlying kernels are updated online by learning the unmodeled dynamics using Gaussian Processes (GPs). While CBFs guarantee forward invariance, the hyperparameters estimated using GPs update the kernel online and thereby adjust the relative notion of safety. We demonstrate our proposed technique on a safety-critical quadrotor on SO(3) in the presence of model uncertainty in simulation. With the kernel update performed online, safety is preserved for the system.
|
|
13:30-13:45, Paper WeA17.3 | Add to My Program |
An Explicit Reference Governor for Time-Varying Linear Constraints |
|
Hosseinzadeh, Mehdi | Washington University in St. Louis |
Sinopoli, Bruno | Washington University in St Louis |
Bobick, Aaron | Washington University in St. Louis |
Keywords: Constrained control, Lyapunov methods
Abstract: The explicit reference governor (ERG) is an add-on unit that provides constraint handling capabilities to prestabilized systems. The basic idea behind this approach is to translate state and input constraints into an upper-bound on the value of the Lyapunov function, which is then enforced by suitably manipulating the derivative of the auxiliary reference. This paper extends the ERG approach to deal with time-varying linear constraints. In particular, it is shown that under a certain condition on the rate of change of the constraints and the textit{strength} of stability of the prestabilized system (i.e., the speed of the dynamics of the prestabilized system), it is possible to deploy the ERG approach to handle time-varying constraints. To do so, components of the ERG approach are redefined, and constraints satisfaction and convergence properties are proven analytically. The effectiveness of the proposed scheme is demonstrated through a simulation study on filming a movie scene with a drone.
|
|
13:45-14:00, Paper WeA17.4 | Add to My Program |
Construction of Control Barrier Function and C^2 Reference Trajectory for Constrained Attitude Maneuvers |
|
Tan, Xiao | KTH Royal Institute of Technology |
Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Keywords: Constrained control, Stability of nonlinear systems, Aerospace
Abstract: Constrained attitude maneuvers have numerous applications in robotics and aerospace. In our previous work, a general framework to this problem was proposed with resolution completeness guarantee. However, a smooth reference trajectory and a low-level safety-critical controller were lacking. In this work, we propose a novel construction of a C^2 continuous reference trajectory based on B'ezier curves on SO(3) that evolves within predetermined cells and eliminates previous stop-and-go behavior. Moreover, we propose a novel zeroing control barrier function on SO(3) that provides a safety certificate over a set of overlapping cells on SO(3) while avoiding nonsmooth analysis. The safety certificate is given as a linear constraint on the control input and implemented in real-time. A remedy is proposed to handle the states where the coefficient of the control input in the linear constraint vanishes. Numerical simulations are given to verify the advantages of the proposed method.
|
|
14:00-14:15, Paper WeA17.5 | Add to My Program |
NAW-NET: Neural Anti-Windup Control for Saturated Nonlinear Systems |
|
Breschi, Valentina | Politecnico Di Milano |
Masti, Daniele | IMT School for Advanced Studies Lucca |
Formentin, Simone | Politecnico Di Milano |
Bemporad, Alberto | IMT School for Advanced Studies Lucca |
Keywords: Constrained control, Neural networks, Identification for control
Abstract: One major issue in industrial control applications is how to handle input constraints due to physical limitations of the actuators. Such constraints introduce nonlinearities in the feedback loop, that are commonly tackled through anti-windup or model predictive control schemes. Since these techniques might result into poor closed-loop performance when an accurate model of the plant is not available, in this work we present an off-line strategy to learn a neural anti-windup control scheme (NAW-NET) from a set of open-loop data collected from an unknown nonlinear process. The proposed scheme, that includes a feedback controller and an anti-windup compensator, is trained to reproduce the desired closed-loop behavior while simultaneously accounting for actuator limits. The effectiveness of the approach is illustrated on a simulation example, involving the control of a Hammerstein Wiener process with saturated inputs.
|
|
14:15-14:30, Paper WeA17.6 | Add to My Program |
Constrained Control for Microgrids with Constant Power Loads |
|
Baldivieso Monasterios, Pablo Rodolfo | The University of Sheffield |
Konstantopoulos, George | The University of Sheffield |
Keywords: Constrained control, Power systems
Abstract: This paper analyses the controllability properties of microgrids connected to constant power loads subject to input and state constraints. Constraint requirements in Micro- Grids are either inherent, i.e. inverter modulation indices limitations, or imposed, such as current and voltage limitation which are safety critical. In this paper, we propose an analysis of the controllability properties of a microgrid using set theoretic notions; this analysis sheds light on the constraint admissibility properties of a microgrid with constant power loads in terms of constraint satisfaction and robustness to changes in power demands. Lastly, we provide a method of recasting the original nonlinear microgrid control problem into controlling a linear system subject to bounded additive disturbances and output constraints.
|
|
14:30-14:45, Paper WeA17.7 | Add to My Program |
Action Governor for Discrete-Time Linear Systems with Non-Convex Constraints |
|
Li, Nan | University of Michigan |
Han, Kyoungseok | Kyungpook National University |
Girard, Anouck | University of Michigan, Ann Arbor |
tseng, eric | Ford Motor Company |
Filev, Dimitre P. | Ford Motor Company |
Kolmanovsky, Ilya V. | The University of Michigan |
Keywords: Constrained control, Predictive control for linear systems, Autonomous systems
Abstract: This paper introduces an add-on, supervisory scheme, referred to as Action Governor (AG), for discrete-time linear systems to enforce exclusion-zone avoidance requirements. It does so by monitoring, and minimally modifying when necessary, the nominal control signal to a constraint-admissible one. The AG operates based on set-theoretic techniques and online optimization. This paper establishes its theoretical foundation, discusses its computational realization, and uses two simulation examples to illustrate its effectiveness.
|
|
14:45-15:00, Paper WeA17.8 | Add to My Program |
Adaptive Robust Quadratic Programs Using Control Lyapunov and Barrier Functions |
|
Zhao, Pan | University of Illinois Urbana-Champaign |
Mao, Yanbing | University of Illinois Urbana-Champaign |
Tao, Chuyuan | University of Illinois Urbana and Champaign |
Hovakimyan, Naira | University of Illinois at Urbana-Champaign |
Wang, Xiaofeng | University of South Carolina |
Keywords: Constrained control, Uncertain systems, Robust adaptive control
Abstract: This paper presents adaptive robust quadratic program (QP) based control using control Lyapunov and barrier functions for nonlinear systems subject to time-varying and state-dependent uncertainties. An adaptive estimation law is proposed to estimate the pointwise value of the uncertainties with pre-computable estimation error bounds. The estimated uncertainty and the error bounds are then used to formulate a robust QP, which ensures that the actual uncertain system will not violate the safety constraints defined by the control barrier function. Additionally, the accuracy of the uncertainty estimation can be systematically improved by reducing the estimation sampling time, leading subsequently to reduced conservatism of the formulated robust QP. The proposed approach is validated in simulations on an adaptive cruise control problem and through comparisons with existing approaches.
|
|
WeSP1 Semiplenary Session, Coordinated Universal Time (UTC) |
Add to My Program |
Glocal (Global/Local) Control: Theoretical Challenges to Practice |
|
|
Chair: Cho, Dong-il | Seoul National University |
Co-Chair: Prieur, Christophe | CNRS |
|
15:10-16:10, Paper WeSP1.1 | Add to My Program |
Glocal (Global/Local) Control: Theoretical Challenges to Practice |
|
Hara, Shinji | Tokyo Institute of Technology |
Keywords: Hierarchical control, Robust control, Biological systems
Abstract: There are many interesting dynamical systems that can be regarded as hierarchically networked systems in a variety of fields including control. One of the ideas to treat those systems properly is "Glocal (Global/Local) Control," which means that the global purpose is achieved by local actions of measurement and control cooperatively. The key for realization of glocal control is hierarchically networked dynamical systems with multiple resolutions in time and space depending on the layer, which introduce many new theoretical control challenges aiming at practical effectiveness in synthetic biology and engineering. The main issues may include how to achieve synchronization by decentralized control and how to make a compromise of global and local objectives. This talk starts with the background, the idea, and the concept of glocal control based on an understanding of IoT from the control perspective followed by presenting two research topics. The first topic is on a theoretical framework for hierarchically decentralized control of networked dynamical systems that can take account of the tradeoff between the global and local objectives to achieve the desired harmony under change of the environments. Several new ideas, by exploiting the special structure of the target systems, enable us to develop scalable control design methods based on the powerful theory in classical, modern, and robust control. The effectiveness of the new theoretical foundations on the analysis and synthesis are experimentally confirmed by applications to electric vehicle control. The second topic is on robust instability analysis for a class of uncertain nonlinear networked systems to guarantee the persistence of nonlinear oscillations in the presence of a dynamic perturbation, which is important in synthetic biology. The problem of robust instability has a very different feature from that of robust stability, and hence a new theoretical setting is needed. The theoretical results are applied to the Repressilator in synthetic biology, and the effectiveness is confirmed by numerical simulations.
|
|
WeSP2 Semiplenary Session, Coordinated Universal Time (UTC) |
Add to My Program |
Learning-Based Planning and Control: Opportunities and Challenges |
|
|
Chair: Lee, Jay H. | Korea Advanced Institute of Science and Technology |
Co-Chair: Shim, Hyungbo | Seoul National University |
|
15:10-16:10, Paper WeSP2.1 | Add to My Program |
Learning-Based Planning and Control: Opportunities and Challenges |
|
How, Jonathan, P. | MIT |
Keywords: Machine learning, Autonomous systems, Large-scale systems
Abstract: Machine learning-based techniques have recently revolutionized nearly every aspect of autonomy. In particular, deep reinforcement learning (RL) has rapidly become a powerful alternative to classical model-based approaches to decision-making, planning, and control. Despite the well-publicized successes of deep RL, its adoption in complex and/or safety-critical tasks at scale and in real-world settings is hindered by several key issues, including high sample complexity in large-scale problems, limited transferability, and lack of robustness guarantees. This talk explores our recently developed solutions that address these fundamental challenges for both single and multiagent RL. In addition, this talk highlights the complementary role that classical model-based techniques can play in synergy with data-driven methods in overcoming these issues. Real experiments with ground and aerial robots will be used to illustrate the effectiveness of the proposed techniques. The talk will conclude with an assessment of the state of the art and highlight important avenues for future research.
|
|
WeB01 COVID-19 Focus Session, Coordinated Universal Time (UTC) |
Add to My Program |
Data & Forecasting |
|
|
Chair: Beck, Carolyn L. | Univ of Illinois, Urbana-Champaign |
Co-Chair: Liu, Ji | Stony Brook University |
|
16:15-16:35, Paper WeB01.1 | Add to My Program |
On Choice of Model Complexity and Data Sources for Prediction of Ongoing Pandemics (I) |
|
Gustafsson, Fredrik | Linkoping Univ |
Jaldén, Joakim | KTH Royal Institute of Technology |
Soltesz, Kristian | Lund University |
Bernhardsson, Bo M. | Lund University |
Heimerson, Albin | Lund University |
Jidling, Carl | Uppsala University |
Lundh, Torbjörn | University of Gothenburg |
Schön, Thomas (Bo) | Uppsala University |
Spreco, Armin | Linköping University |
Bagge Carlson, Fredrik | Lund University |
Jöud, Anna | Skåne University Hospital, Lund University |
Philip, Gerlee | Chalmers University of Technology and University of Gothenburg |
Timpka, Toomas | Linköping University |
Keywords: Healthcare and medical systems, Identification, Model Validation
Abstract: We will analyse and discuss how the choice of model complexity impacts the predictive power of some epidemics models. We describe theoretical analysis of parameter identifiability and discuss the discrepancy between different simulation studies and the actual outcome during the COVID-19 pandemic. We also discuss the different information sources available to aid analysis in an ongoing pandemic, and discuss their usefulness, based on our experience from the Swedish health care system. The contribution is a cooperation between modeling experts from the major Swedish universities and Swedish health care experts with long experience, responsible for analysis during the COVID-19 pandemic and tracking the seasonal influenza for many years. We describe how this was used to aid the the planning within the Swedish health care system.
|
|
16:35-16:55, Paper WeB01.2 | Add to My Program |
An Interpretable Mortality Prediction Model for COVID-19 Patients: A Single Center Study (long Abstract) (I) |
|
Yuan, Ye | Huazhong University of Science and Technology |
Keywords: Machine learning, Healthcare and medical systems, Pattern recognition and classification
Abstract: The sudden increase in COVID-19 cases is putting high pressure on healthcare services worldwide. At this stage, fast, accurate and early clinical assessment of the disease severity is vital. To support decision making and logistical planning in healthcare systems, this study leverages a database of blood samples from 485 infected patients in the region of Wuhan, China, to identify crucial predictive biomarkers of disease mortality. For this purpose, machine learning tools selected three biomarkers that predict the mortality of individual patients more than 10 days in advance with more than 90% accuracy: lactic dehydrogenase (LDH), lymphocyte and high-sensitivity C-reactive protein (hs-CRP).Overall, this Article suggests a simple and operable decision rule to quickly predict patients at the highest risk, allowing them to be prioritized and potentially reducing the mortality rate. This talk is based on the following paper published in Nature Machine Intelligence: https://www.nature.com/articles/s42256-020-0180-7
|
|
16:55-17:15, Paper WeB01.3 | Add to My Program |
Intermittent yet Coordinated Regional Strategies Can Alleviate the COVID-19 Epidemic: A Network Model of the Italian Case (I) |
|
Della Rossa, Fabio | Politecnico Di Milano |
Salzano, Davide | University of Naples Federico II |
Di Meglio, Anna | University of Naples Federico II |
De Lellis, Francesco | University of Naples Federico II |
Coraggio, Marco | University of Naples Federico II |
Calabrese, Carmela | University of Naples Federico II |
Guarino, Agostino | University of Naples Federico II |
Cardona-Rivera, Ricardo | University of Naples Federico II |
De Lellis, Pietro | University of Naples Federico II |
Liuzza, Davide | University of Sannio |
Lo Iudice, Francesco | Università Di Napoli Federico II |
Russo, Giovanni | University of Salerno |
di Bernardo, Mario | University of Naples Federico II |
Keywords: Nonlinear systems identification, Network analysis and control, Compartmental and Positive systems
Abstract: To better capture the dynamics of the COVID-19 epidemics in Italy, as in other countries with a regional (or federal) administrative structure, it is of utmost importance to capture regional differences in mitigating the spread of the diseases, considering the heterogeneity at the regional level of the effects of national measures and local intervention strategies. Also, it is crucial to model flows of potential infected among the regions at a finer level than aggregate SIR family models allow.In this study we take Italy as a study case and propose a network model where each of the 20 administrative regions in which Italy is split into is considered as a node, edges representing flows of people to and from each region. Both short distance edges and long distance edges are considered in the model. The aim of the study is to capture at a finer level the dynamics of the epidemics both at the regional and the national level highlighting differences between regional dynamics and the aggregate effect of these at the national level. We use the proposed model to analyse the COVID-19 spread across the country, and to explore different feedback control based mitigation strategies to avoid the return of an epidemic in future waves. In so doing, we derive sufficient conditions based on applying appropriate matrix measures to the so-called next generation matrix and explore different optimal control approaches to devise intermittent regional strategies to contain and mitigate future epidemic phenomena.
|
|
17:15-17:35, Paper WeB01.4 | Add to My Program |
Panel Discussion: Why Was/Is It so Difficult to Forecast the Spread of COVID-19? (I) |
|
Pare, Philip E. | Purdue University |
Keywords: Healthcare and medical systems, Identification, Model Validation
Abstract: The ongoing COVID-19 pandemic has motivated an innumerable number of researchers, companies, and governments to attempt to forecast the spread of the disease. Scouring this vast spectrum of predictions, it is clear that there is no consensus and that capturing the behavior of the outbreak remains elusive. The majority of the data available appears to support the hypothesis that the disease follows some variant of an SIR (susceptible-infected-removed) model. It is well known that the reproduction number of the SIR model (both networked and group models) depends on the number of susceptible individuals in the population. Therefore, forecasts of the spread, and consequently the number of susceptible, are essential for understanding in what stage of the outbreak we are currently operating. Several natural questions arise: - What can be done to improve forecasts of the outbreak/avoid overfitting in order to help design and implement mitigation efforts? - When sickness and deaths are in the balance, as they are now, how should researchers approach the exploitation vs exploration tradeoff? - How can we learn from this outbreak and adapt the findings in preparation for future possible outbreaks?
|
|
WeB02 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Power Systems II |
|
|
Chair: Jacquod, Philippe | University of Applied Sciences of Western Switzerland |
Co-Chair: Duel-Hallen, Alexandra | North Carolina State University |
|
16:15-16:30, Paper WeB02.1 | Add to My Program |
A Stackelberg Security Investment Game for Voltage Stability of Power Systems |
|
An, Lu | North Carolina State University |
Chakrabortty, Aranya | North Carolina State University |
Duel-Hallen, Alexandra | North Carolina State University |
Keywords: Game theory, Power systems, Distributed control
Abstract: We formulate a Stackelberg game between an attacker and a defender of a power system. The attacker attempts to alter the load setpoints of the power system covertly and intelligently, so that the voltage stability margin of the grid is reduced, driving the entire system towards a voltage collapse. The defender, or the system operator, aims to compensate for this reduction by retuning the reactive power injection to the grid by switching on control devices, such as a bank of shunt capacitors. A modified Backward Induction method is proposed to find a cost-based Stackelberg equilibrium (CBSE) of the game, which saves the players' costs while providing the optimal allocation of both players' investment resources under budget and covertness constraints. We analyze the proposed game extensively for the IEEE 9-bus power system model and present an example of its performance for the IEEE 39-bus power system model. It is demonstrated that the defender is able to maintain system stability unless its security budget is much lower than the attacker's budget.
|
|
16:30-16:45, Paper WeB02.2 | Add to My Program |
Sparse Nonlinear Wide-Area Control of Power Systems Using Perturbed Koopman Modes |
|
Hernandez, Marcos | Autonomous University of Guadalajara |
Chakrabortty, Aranya | North Carolina State University |
Roman Messina, Arturo | Cinvestav |
Keywords: Power systems, Distributed control, Optimal control
Abstract: Power systems exhibit nonlinear dynamics when load stress is high. Conventional linear wide-area controllers will fail in such scenarios, and instead nonlinear controllers will become necessary. This paper develops a state-feedback nonlinear quadratic controller based on perturbed Koopman mode analysis (PKMA) considering a sparse wide-area communication net-work for saving communication cost while guaranteeing closed-loop stability. The sparsity is decided based on the modal residues of the generators on selected nonlinear oscillations modes. The theoretical developments behind this sparse nonlinear control are first established, including the definition of residues for Koopman modes. Bilinear effects in the controller structure arising from quadratic PKMA-based extended models, structural constraints on the quadratic gain matrix design, and second-order phenomena in the closed-loop dynamics are all explained in detail. The design is validated using the IEEE 16-machine, 68-bus power system. The advantages and drawbacks of the proposed approach are illustrated using simulation results.
|
|
16:45-17:00, Paper WeB02.3 | Add to My Program |
Primary Control Effort under Fluctuating Power Generation in Realistic High-Voltage Power Networks |
|
Tyloo, Melvyn | HES-SO |
Jacquod, Philippe | University of Applied Sciences of Western Switzerland |
Keywords: Power systems, Control of networks, Stability of nonlinear systems
Abstract: Many recent works in control of electric power systems have investigated their synchronization through global performance metrics under external disturbances. The approach is motivated by fundamental changes in the operation of power grids, in particular by the substitution of conventional power plants with new renewable sources of electrical energy. This substitution will simultaneously increase fluctuations in power generation and reduce the available mechanical inertia. It is crucial to understand how strongly these two evolutions will impact grid stability. With very few, mostly numerical exceptions, earlier works on performance metrics had to rely on unrealistic assumptions of grid homogeneity. Here we show that a modified spectral decomposition can tackle that issue in inhomogeneous power grids in cases where disturbances occur on time scales that are long compared to the intrinsic time scales of the grid. We find in particular that the magnitude of the transient excursion generated by disturbances with long characteristic times does not depend on inertia. For continental-size, high-voltage power grids, this corresponds to power fluctuations that are correlated on time scales of few seconds or more. We conclude that power fluctuations arising from new renewables will not require per se the deployment of additional rotational inertia. We numerically illustrate our results on the IEEE 118-Bus test case and a model of the synchronous grid of continental Europe.
|
|
17:00-17:15, Paper WeB02.4 | Add to My Program |
Observer-Based Excitation Control for Transient Stabilization of the Single Machine Infinite Bus System |
|
Rojas, Michael | Universidad Nacional Autonoma De Mexico |
Rueda-Escobedo, Juan G. | Brandenburg University of Technology Cottbus - Senftenberg |
Espinosa-Perez, Gerardo | Universidad Nacional Autonoma De Mexico |
Schiffer, Johannes | Brandenburg University of Technology |
Keywords: Power systems, Observers for Linear systems, Stability of nonlinear systems
Abstract: The availability of excitation controllers to en-hance transient stability has regained significant relevance in recent years, due to the unprecedented ongoing changes in power systems. Yet, the practical deployment of many reported control schemes is hampered by the fact that their implementation requires the full state vector. Our main contribution is to address this fundamental obstacle by proposing an observer-based excitation controller using modern phasor measurement technology. For this purpose, a linear time-varying observer scheme for the generator frequency and the internal voltage is derived. This observer is then combined with a classical passivity-based excitation controller. Stability of the resulting nonlinear observer-based closed-loop system is shown by deriving an ISS-based separation principle. The performance of the proposed approach is demonstrated via simulation example.
|
|
17:15-17:30, Paper WeB02.5 | Add to My Program |
A Fast Certificate for Power System Small-Signal Stability |
|
Gholami, Amin | Georgia Institute of Technology |
Sun, Andy | Georgia Institute of Technology |
Keywords: Power systems, Stability of nonlinear systems, Smart grid
Abstract: Swing equations are an integral part of a large class of power system dynamical models used in rotor angle stability assessment. Despite intensive studies, some fundamental properties of lossy swing equations are still not fully understood. In this paper, we develop a sufficient condition for certifying the stability of equilibrium points (EPs) of these equations, and illustrate the effects of damping, inertia, and network topology on the stability properties of such EPs. The proposed certificate is suitable for real-time monitoring and fast stability assessment, as it is purely algebraic and can be evaluated in a parallel manner. Moreover, we provide a novel approach to quantitatively measure the degree of stability in power grids using the proposed certificate. Extensive computational experiments are conducted, demonstrating the practicality and effectiveness of the proposal.
|
|
WeB03 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Healthcare and Medical Systems |
|
|
Chair: Di Benedetto, Maria Domenica | University of L'Aquila |
Co-Chair: Di Ferdinando, Mario | University of L'Aquila |
|
16:15-16:30, Paper WeB03.1 | Add to My Program |
Flexible Regularization Approaches for Fairness in Deep Learning |
|
Mintz, Yonatan | University of Wisconsin Madison |
Olfat, Mahbod | UC Berkeley |
Keywords: Machine learning, Healthcare and medical systems, Optimization
Abstract: Artificial Neural Networks (ANN) have been shown to be effective for many predictive tasks, such as system identification and reinforcement learning. However, as they have become more ubiquitous, there have been several examples of models exhibiting anthropomorphic bias (e.g. making predictions correlated with race or gender for unrelated tasks) due to over fitting, amplifying and systematizing bias already inherent in training data. To address this problem, we consider a novel regularization approach for deep learning, inspired by the constrained optimization literature, that directly penalizes unwanted disparities in treatment of populations proportionally to their impact on observed bias. Using this method, we can control bias at training time, as opposed to in a pre- or post-processing step; this results in concurrent out-of-sample improvements in both fairness and accuracy for some data sets. Our methods fit well into existing optimization and training approaches and can be easily generalized across network architectures and notions of fairness. We validate our methods empirically on several real world data sets that contain implicit bias. Namely we consider the impact of race on recidivism prediction, gender on income, and wine color on quality. We also consider fairness in a reinforcement learning setting by controlling the dose of Heparin while being certifiably fair with respect to the patient's insurance provider.
|
|
16:30-16:45, Paper WeB03.2 | Add to My Program |
Finite-Dimensional Periodic Event-Triggered Control of Nonlinear Time-Delay Systems with an Application to the Artificial Pancreas |
|
Borri, Alessandro | CNR-IASI |
Pepe, Pierdomenico | University of L' Aquila |
Di Loreto, Ilaria | University of L'Aquila |
Di Ferdinando, Mario | University of L'Aquila |
Keywords: Delay systems, Sampled-data control, Control applications
Abstract: This paper addresses the periodic event-triggered stabilization in the sample-and-hold sense of nonlinear time-delay systems by means of a finite-dimensional state feedback law. Differently from most related literature, no continuity assumptions are required on the state feedback available in continuous time, which can hence be designed for instance by universal constructions yielding possible discontinuities, provided that such a feedback depends on a finite number of commensurate time delays. An application to the problem of glucose regulation is then presented, showing the effectiveness of the approach taken.
|
|
16:45-17:00, Paper WeB03.3 | Add to My Program |
An Assume-Guarantee Approach to Sampled-Data Quantized Glucose Control |
|
Di Loreto, Ilaria | University of L'Aquila |
Borri, Alessandro | CNR-IASI |
Di Benedetto, Maria Domenica | University of L'Aquila |
Keywords: Hybrid systems, Control applications, Systems biology
Abstract: Contract-based design is a compositional approach that enables to satisfy specifications for complex systems by partitioning responsibilities among the component subsystems. A natural application of assume-guarantee reasoning arises in the context of biological systems, which are composed of different modules, each one responsible for some actions. This work presents an application of the assume-guarantee contracts theory to the glucose regulation system, which is a topic of major importance in diabetes treatment, in the context of the so-called Artificial Pancreas. The glucose control problem is defined and solved in a decentralized way by defining suitable contracts to be satisfied by the glucose and insulin subsystems, while the framework naturally takes into account sampling and quantization arising from the digital environment. The results show that the approach is feasible and promising for further investigation.
|
|
17:00-17:15, Paper WeB03.4 | Add to My Program |
Robust Power and Cadence Tracking on a Motorized FES Cycle with an Unknown Time-Varying Input Delay |
|
Allen, Brendon C. | University of Florida |
Stubbs, Kimberly J. | University of Florida |
Dixon, Warren E. | University of Florida |
Keywords: Healthcare and medical systems, Human-in-the-loop control, Control applications
Abstract: Functional electrical stimulation (FES) induced cycling is a common rehabilitative technique applied for those with a movement disorder. An FES cycle system is a nonlinear switched dynamic system that has a potentially destabilizing input delay between stimulation and the resulting muscle force. In this paper, a dual objective control system for a nonlinear, uncertain, switched FES cycle system with an unknown time varying input delay is developed and a Lyapunov-like dwell-time analysis is performed to yield exponential power tracking to an ultimate bound and global exponential cadence tracking. Preliminary experimental results for a single healthy individual are provided and demonstrate average power and cadence tracking errors of -0.05 +/- 0.80 W and -0.05 +/- 1.20 RPM, respectively, for a target power of 10 W and a target cadence of 50 RPM.
|
|
17:15-17:30, Paper WeB03.5 | Add to My Program |
A Converse Lyapunov-Krasovskii Theorem for the Global Asymptotic Local Exponential Stability of Nonlinear Time-Delay Systems |
|
Di Ferdinando, Mario | University of L'Aquila |
Pepe, Pierdomenico | University of L' Aquila |
Di Gennaro, Stefano | University of L'Aquila |
Keywords: Delay systems, Stability of nonlinear systems, Systems biology
Abstract: In this paper, the notion of GALES (Global Asymptotic Local Exponential Stability) is extended to nonlinear systems described by Retarded Functional Differential Equations. Necessary and sufficient Lyapunov--Krasovskii conditions ensuring the GALES of nonlinear time--delay systems are provided. The conditions related to the lower bound and to the dissipation rate of the Lyapunov--Krasovskii functional involve only the current value of the solution, making the provided tool easy to use. An example validating the results is presented.
|
|
WeB04 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Model Reduction |
|
|
Chair: Monshizadeh, Nima | University of Groningen |
Co-Chair: Lamperski, Andrew | University of Minnesota |
|
16:15-16:30, Paper WeB04.1 | Add to My Program |
Non-Asymptotic Closed-Loop System Identification Using Autoregressive Processes and Hankel Model Reduction |
|
Lee, Bruce | University of Pennsylvania |
Lamperski, Andrew | University of Minnesota |
Keywords: Closed-loop identification, Identification, Identification for control
Abstract: One of the primary challenges of system identification is determining how much data is necessary to adequately fit a model. Non-asymptotic characterizations of the performance of system identification methods provide this knowledge. Such characterizations are available for several algorithms performing open-loop identification. Often times, however, data is collected in closed-loop. Application of open-loop identification methods to closed-loop data can result in biased estimates. One method to eliminate these biases involves first fitting a long-horizon autoregressive model and then performing model reduction. The asymptotic behavior of such algorithms is well characterized, but the non-asymptotic behavior is not. This work provides a non-asymptotic characterization of one particular variant of these algorithms. More specifically, we provide non-asymptotic upper bounds on the generalization error of the produced model, as well as high probability bounds on the difference between the produced model and the finite horizon Kalman Filter.
|
|
16:30-16:45, Paper WeB04.2 | Add to My Program |
Online Estimation of the Loewner Matrices |
|
Simard, Joel David | Imperial College London |
Astolfi, Alessandro | Imperial College & Univ. of Rome |
Keywords: Identification, Model/Controller reduction, Linear systems
Abstract: In this paper two online estimation methods are considered for the determining the Loewner matrices of a linear system. Both algorithms employ the gradient descent method and sufficient persistence of excitation conditions guarantee the convergence of estimates to the correct values.
|
|
16:45-17:00, Paper WeB04.3 | Add to My Program |
Multi-Array Electron Beam Stabilization Using Block Circulant Transformation and Generalized Singular Value Decomposition |
|
Kempf, Idris | University of Oxford |
Duncan, Stephen | University of Oxford |
Goulart, Paul J. | University of Oxford |
Rehm, Guenther | Diamond Light Source |
Keywords: Model/Controller reduction, Linear systems, Control applications
Abstract: We introduce a novel structured controller design for the electron beam stabilization problem of the UK's national synchrotron light source. Because changes to the synchrotron will not allow the application of existing control approaches, we develop a novel method to diagonalize the multi-input multi-output (MIMO) system. A generalized singular value decomposition (GSVD) is used to simultaneously diagonalize the actuator response matrices, which is applicable to an arbitrary number of actuator dynamics in a cross-directional setting. The resulting decoupled systems are regulated using mid-ranged control and the controller gains derived as a function of the generalized singular values. In addition, we exploit the inherent block circulant symmetry of the system. The performance of our controller is demonstrated using simulations that involve machine data.
|
|
17:00-17:15, Paper WeB04.4 | Add to My Program |
The Enhanced Finite State Projection Algorithm, Using Conditional Moment Closure and Time-Scale Separation |
|
Kwon, Ukjin | MIT |
Naghnaeian, Mohammad | Clemson University |
Del Vecchio, Domitilla | Massachusetts Institute of Technology |
Keywords: Model/Controller reduction, Biomolecular systems, Computational methods
Abstract: The Chemical Master Equation (CME) is commonly used to describe the stochastic behavior of biomolecular systems. However, in general, the CME’s dimension is very large or infinite, so analytical or even numerical solutions may be difficult to achieve. The truncation methods such as the Finite State Projection (FSP) algorithm alleviate this issue to some extent but not completely. To further resolve the computational issue, we propose the Enhanced Finite State Projection (EFSP) algorithm, in which the ubiquitous time-scale separation is utilized to reduce the dimension of the CME. Our approach combines the original FSP algorithm and the model reduction technique that we developed, to approximate an infinite dimensional CME with a finite dimensional CME that contains the slow species only. Unlike other time-scale separation methods, which rely on the fast-species counts’ stationary conditional probability distributions, our model reduction technique relies on only the first few conditional moments of the fast-species counts. In addition, each iteration of the EFSP algorithm relies on the solution of the approximated CME that contains the slow species only, unlike the original FSP algorithm relies on the solution of the full CME. These two properties provide a significant computation advantage. The benefit of our algorithm is illustrated through a protein binding reaction example.
|
|
17:15-17:30, Paper WeB04.5 | Add to My Program |
Amidst Data-Driven Model Reduction and Control |
|
Monshizadeh, Nima | University of Groningen |
Keywords: Model/Controller reduction, Stability of linear systems
Abstract: We explore a middle ground between data-driven model reduction and data-driven control. In particular, we use snapshots collected from the system to build reduced models that can be expressed in terms of data. We illustrate how the derived family of reduced models can be used for data-driven control of the original system under suitable conditions. Finding a control law that stabilizes certain solutions of the original system as well as the one that reaches any desired state in final time are studied in detail.
|
|
WeB05 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Problems in Machine Learning |
|
|
Chair: Vidyasagar, Mathukumalli | Indian Institute of Technology Hyderabad |
Co-Chair: George, Jemin | U.S. Army Research Laboratory |
|
16:15-16:30, Paper WeB05.1 | Add to My Program |
SPARQ-SGD: Event-Triggered and Compressed Communication in Decentralized Optimization |
|
Singh, Navjot | University of California, Los Angeles |
DATA, DEEPESH | University of California, Los Angeles |
George, Jemin | U.S. Army Research Laboratory |
Diggavi, Suhas | UCLA |
Keywords: Machine learning, Optimization, Optimization algorithms
Abstract: In this paper, we propose and analyze SPARQ-SGD, an event-triggered and compressed algorithm for decentralized training of large-scale machine learning models over a graph. Each node can locally compute a condition (event) which triggers a communication where quantized and sparsified local model parameters are sent. In SPARQ-SGD, each node takes at least a fixed number of local gradient steps and then checks if the model parameters have significantly changed compared to its last update; it communicates further compressed model parameters only when there is a significant change, as specified by a (design) criterion. We prove that SPARQ-SGD converges as O(1/nT) and O(1/sqrt(nT)) in the strongly-convex and non-convex settings, respectively, demonstrating that aggressive compression, including event-triggered communication, model sparsification and quantization does not affect the overall convergence rate compared to uncompressed decentralized training; thereby theoretically yielding communication efficiency for `free'. We evaluate SPARQ-SGD over real datasets to demonstrate significant amount of savings in communication over the state-of-the-art.
|
|
16:30-16:45, Paper WeB05.2 | Add to My Program |
Worst-Case Risk Quantification under Distributional Ambiguity Using Kernel Mean Embedding in Moment Problem |
|
Zhu, Jia-Jie | Max Planck Institute for Intelligent Systems |
Jitkrittum, Wittawat | Max Planck Institute for Intelligent Systems |
Diehl, Moritz | University of Freiburg |
Schölkopf, Bernhard | MPI for Biological Cybernetics |
Keywords: Machine learning, Optimization, Robust control
Abstract: In order to anticipate rare and impactful events, we propose to quantify the worst-case risk under distributional ambiguity using a recent development in kernel methods -- the kernel mean embedding. Specifically, we formulate the generalized moment problem whose ambiguity set (i.e., the moment constraint) is described by constraints in the associated reproducing kernel Hilbert space in a nonparametric manner. We then present the tractable approximation and its theoretical justification. As a concrete application, we numerically test the proposed method in characterizing the worst-case constraint violation probability in the context of a constrained stochastic control system.
|
|
16:45-17:00, Paper WeB05.3 | Add to My Program |
Optimal Sensor Selection for Binary Detection Based on Stochastic Submodular Optimization |
|
Hespanha, Joao P. | Univ. of California, Santa Barbara |
Garagic, Denis | BAE Systems FAST Labs |
Keywords: Machine learning, Optimization, Estimation
Abstract: We address the problem of selecting sensors for the estimation of binary random variables, so as to minimize the probability of error. This problem arises when a large number of sensors are potentially available, but only a few can actually be used for estimation purposes. While sensor selection is a combinatorial problem, we show that the optimization of an upper bound on the probability of error can be formulated as a submodular maximization for which computationally efficient algorithms can provide solutions with guaranteed performance. The submodular optimization that needs to be solved involves the computation of an expected value that generally cannot be computed in closed form, but we show that replacing the expected value by a Monte Carlo empirical mean can result in negligible loss of performance with high probability. We illustrate the use of these results in the context of detecting illegal unreported and unregulated (IUU) fishing.
|
|
17:00-17:15, Paper WeB05.4 | Add to My Program |
NSM Converges to a K-NN Regressor under Loose Lipschitz Estimates |
|
Maddalena, Emilio | École Polytechnique Fédérale De Lausanne |
Jones, Colin N. | EPFL |
Keywords: Machine learning, Nonlinear systems identification
Abstract: Although it is known that having accurate Lipschitz estimates is essential for certain models to deliver good predictive performance, refining this constant in practice can be a difficult task especially when the input dimension is high. In this work, we shed light on the consequences of employing loose Lipschitz bounds in the Nonlinear Set Membership (NSM) framework, showing that the model converges to a nearest neighbor regressor (k-NN with k=1). This convergence process is moreover not uniform, and is monotonic in the univariate case. An intuitive geometrical interpretation of the result is then given and its practical implications are discussed.
|
|
17:15-17:30, Paper WeB05.5 | Add to My Program |
Deterministic Completion of Rectangular Matrices with Measurement Noise Using Unbalanced Ramanujan Bigraphs |
|
Burnwal, Shantanu Prasad | Indian Institute of Technology Hyderabad |
Vidyasagar, Mathukumalli | Indian Institute of Technology Hyderabad |
Keywords: Machine learning, Statistical learning, Learning
Abstract: The present paper addresses the problem of matrix completion with noisy measurements using a deterministic procedure for choosing the sample set. Previous research on the matrix completion problem has been based on probabilistic sampling, with the result that the various theorems hold only with high probability. Also, most available results are for the case where the measurements are noise-free. Authors proposed to generate the sample set as the edge set of a Ramanujan graph, and sufficient condition is provided for the exact and stable recovery of the unknown matrix via nuclear norm minimization, when measurements are noise-free and noisy, respectively. In the present paper, we use the same sampling approach and perform some numerical simulations for the case of both noise-free and noisy measurements. In the process, we give a comparison between the noisy and noise-free measurements.
|
|
WeB06 Invited Session, Coordinated Universal Time (UTC) |
Add to My Program |
Estimation, Control, and Optimization of Automotive Systems |
|
|
Chair: Tong, Son | Siemens Digital Industries Software |
Co-Chair: Dadras, Soodeh | Utah State University |
Organizer: Siegel, Jason B. | University of Michigan |
Organizer: Dadras, Soodeh | Utah State University |
Organizer: Dadam, Sumanth | Ford Motor Company |
Organizer: Dadras, Sara | Company |
Organizer: Ahmed, Qadeer | The Ohio State University |
Organizer: Borhan, Hoseinali | Cummins Inc |
Organizer: Amini, Mohammad Reza | University of Michigan |
Organizer: Tong, Son | Siemens Digital Industries Software |
|
16:15-16:30, Paper WeB06.1 | Add to My Program |
Cooperating Modular Goal Selection and Motion Planning for Autonomous Driving (I) |
|
Ahn, Heejin | Mitsubishi Electric Research Laboratories |
Berntorp, Karl | Mitsubishi Electric Research Labs |
Di Cairano, Stefano | Mitsubishi Electric Research Labs |
Keywords: Automotive systems, Automotive control, Constrained control
Abstract: We present a decision making approach for autonomous driving that concurrently determines the driving mode and the motion plan that achieves the driving mode goal. To do this, we develop two cooperating modules: a mode activator and a motion planner. Based on the current mode in a non-deterministic automaton, the mode activator determines all the feasible next modes, i.e., the modes for which there exists a trajectory that reaches the associated goal. Then, the motion planner generates trajectories achieving the goals of such feasible modes, selects the next mode and trajectory that result in the best performance, and updates the current mode in the automaton. To determine the feasibility, the mode activator uses robust forward and backward reachability that accounts for the discrepancy between the simplified model used in the reachability computation and the more precise model used by the motion planner. We prove that, under normal operation, the mode activator always returns a nonempty set of feasible modes, so that the decision making algorithm is recursively feasible. We validate the algorithm in simulations and experiments using car-like laboratory-scale robots.
|
|
16:30-16:45, Paper WeB06.2 | Add to My Program |
Decision Making through Stochastic Maneuver Validation for Overtaking on Country Roads (I) |
|
Adelberger, Daniel | Johannes Kepler University Linz |
Wang, Meng | Delft University of Technology |
Del Re, Luigi | Johannes Kepler University Linz |
Keywords: Automotive systems, Optimization, Stochastic systems
Abstract: Driver assistance systems have become more and more important in recent years due to the increasing degree of automation in road traffic - especially with regard to safety. Often a driver's perception of a situation is prone to inaccuracy, particularly on country roads. Consequently, there is a high mortality rate due to frontal collisions with oncoming traffic (among others). One of the most dangerous maneuvers that leads to such conflicts is overtaking. Overtaking involves other traffic participants and therefore uncertainties exist. We cope with this challenge by introducing a system that evaluates the options a vehicle has during overtaking (completing or aborting the maneuver) using stochastic models of the surrounding traffic. The stochastic models are used to predict the movements of surrounding road users. As a next step, the general feasibility of the possible maneuvers is checked and rated. The results of this analysis can either be directly used for longitudinal control, be forwarded to a low level controller, or serve as a guideline for the decision-making process of a human driver.
|
|
16:45-17:00, Paper WeB06.3 | Add to My Program |
Estimation of Engine Combustion Instabilities for Transient Operation |
|
Jean, Maxime | IFP Energies Nouvelles |
LEROY, Thomas | IFPEN |
Redaud, Jeanne | IFP Energies Nouvelles |
Keywords: Stochastic systems, Automotive control
Abstract: The contribution of the paper is to propose an estimator of the combustion stability of spark-ignition engines operating in lean-burn conditions. The stability is here taken in the sense of the Coefficient of Variation of the Indicated Mean Effective Pressure. The proposed estimator of the Coefficient of Variation provides in real-time both an estimation of the expected value and of the confidence interval, based on the Bayesian theoretical framework. The estimator only needs a buffer of few measures, making it suitable for transient control. It could lead to the construction of a robust controller in learn-burn conditions, ensuring a better compromise between the engine efficiency and the prevention of combustion instabilities.
|
|
17:00-17:15, Paper WeB06.4 | Add to My Program |
Flexible Predictive Hybrid Powertrain Management with V2X Information (I) |
|
Deng, Junpeng | Johannes Kepler University Linz |
Meier, Florian | Johannes Kepler University Linz |
Del Re, Luigi | Johannes Kepler University Linz |
Keywords: Automotive control, Automotive systems, Optimal control
Abstract: Using knowledge of the future route and its topology is known to offer substantial fuel savings, and this is even more true for hybrid electric vehicles, as the battery use can be planned in advance, for instance to take into account coming slopes. However, traffic or other environmental conditions can force to deviate from the initial planning making it no longer optimal. In this paper, we propose a flexible double layer approach for energy management of hybrid vehicles able to cope with traffic changes. First, before departure, an expected optimal speed and powertrain state reference is computed on a cloud and sent to an on-board controller. Simple, route-specific engine on/off rules are extracted by the controller and used for an on-board fast convex optimization, which can be conducted frequently along the drive, adapting the references to take into account changes of traffic conditions over longer sections of the route as communicated by V2X. Abrupt disturbances are handled by a lower level Model Predictive Control (MPC). If the condition changes are very substantial, so that the empirical on/off rule seems questionable, the cloud can be asked to perform a full optimization again.
|
|
17:15-17:30, Paper WeB06.5 | Add to My Program |
Optimal Control of Battery Fast Charging Based-On Pontryagin’s Minimum Principle (I) |
|
Park, Saehong | University of California, Berkeley |
Lee, Donggun | University of California, Berkeley |
Ahn, Hyoung Jun | LG Chem |
Tomlin, Claire J. | UC Berkeley |
Moura, Scott | University of California, Berkeley |
Keywords: Energy systems, Optimal control, Electrochemical processes
Abstract: This paper derives provably optimal control trajectories for the Li-ion battery fast charging problem. Conventionally, battery charging protocols must satisfy safety constraints while maximizing the state of charge (SoC) level. In the literature, both computational and experimental studies promote a diversity of algorithms, including pulse charging, multistep constant currents, and more. Although these approaches yield applicable charging algorithms, the literature lacks a rigorous analytical insight into optimal charging trajectories. In this paper, we focus on the Pontryagin’s Minimum Principle for solving optimal control problem for battery fast charging. Specifically, we characterize the optimal control solution with respect to the state constraint bound. The optimal input is analytically derived for a reduced-order electrochemical model. The optimal solutions follow a Bang or Bang-Ride trajectory. Numerical simulations validate the analytical solutions.
|
|
WeB07 Invited Session, Coordinated Universal Time (UTC) |
Add to My Program |
Advances on Finite-Time Control and Consensus |
|
|
Chair: Zhao, Zhi-Liang | Shaanxi Normal University |
Co-Chair: Jiang, Zhong-Ping | New York University |
Organizer: Zhao, Zhi-Liang | Shaanxi Normal University |
Organizer: Jiang, Zhong-Ping | New York University |
|
16:15-16:30, Paper WeB07.1 | Add to My Program |
Fixed-Time Nash Equilibrium Seeking in Non-Cooperative Games (I) |
|
Poveda, Jorge I. | University of Colorado at Boulder |
Krstic, Miroslav | University of California, San Diego |
Basar, Tamer | Univ of Illinois, Urbana-Champaign |
Keywords: Game theory, Optimization algorithms, Adaptive control
Abstract: We introduce a novel class of Nash equilibrium seeking dynamics for non-cooperative games with a finite number of players, where the convergence to the Nash equilibrium is bounded by a KL function with a settling time that can be upper bounded by a positive constant that is independent of the initial conditions of the players, and which can be prescribed a priori by the system designer. The dynamics are model-free, in the sense that the mathematical forms of the cost functions of the players are unknown. Instead, in order to update its own action, each player needs to have access only to real-time evaluations of its own cost, as well as to auxiliary states of neighboring players characterized by a communication graph. Stability and convergence properties are established for both potential games and strongly monotone games. Numerical examples are presented to illustrate our theoretical results.
|
|
16:30-16:45, Paper WeB07.2 | Add to My Program |
Global Output-Feedback Finite-Time Stabilization Using a Switching Technique (I) |
|
Zhao, Zhi-Liang | Shaanxi Normal University |
Ma, Pengjuan | Shaanxi Normal University |
Chen, Sen | Shaanxi Normal University |
Jiang, Zhong-Ping | New York University |
Keywords: Nonlinear output feedback, Lyapunov methods, Stability of nonlinear systems
Abstract: The problem of global finite-time stabilization of lower-triangular nonlinear systems is investigated in this paper. Unlike previous papers that either used the full-state information or assumed that the system nonlinearities satisfy some restrictive conditions, in this paper a novel solution to the global output-feedback finite-time stabilization problem for a larger class of nonlinear systems is proposed under a mild condition. A new analytic method is also developed to prove the global finite-time stability of the closed-loop systems.
|
|
16:45-17:00, Paper WeB07.3 | Add to My Program |
Practical Prescribed Time Tracking Control with User Pre Determinable Precision for Uncertain Nonlinear Systems (I) |
|
Cao, Ye | ChongQing University |
Song, Yongduan | Chongqing University |
Keywords: Adaptive control, Uncertain systems, Robust control
Abstract: In this paper we present a practical prescribed time (PPT) control for a class of uncertain nonlinear systems to achieve pre-assignable precision within a finite time that can be specified in advance irrespective of initial condition or any design parameter. Specifically, by introducing a novel time-varying constraining function, we convert the original finite-time tracking control problem into one with a deferred constraint on tracking error, then by stabilizing of which, we solve the problem with the following appealing features: 1) both settling time and tracking accuracy can be explicitly specified in advance regardless of initial conditions; 2) the proposed control is based upon regular (rather than fractional power) state feedback, thus is continuous and smooth everywhere; and 3) the results are global in that the prescribed time tracking is ensured for states starting from anywhere within physically possible domain.
|
|
17:00-17:15, Paper WeB07.4 | Add to My Program |
Model Predictive Control for Discrete-Time Linear Systems with Finite-Time Convergence (I) |
|
Zhu, Bing | Beihang University |
Zuo, Zongyu | Beihang University (aka Beijing University of Aeronautics and As |
Ding, Zhengtao | The University of Manchester |
Keywords: Predictive control for linear systems
Abstract: In this paper, a model predictive control (MPC) approach with finite-time convergence is proposed for constrained discrete-time linear systems. The proposed MPC is constructed based on a criterion of finite-time convergence for discrete-time linear systems, where its Lyapunov function decreases with a rate greater than that of exponential convergence. In the finite-time MPC, the cost function is designed particularly by using the proposed finite-time criterion. Recursive feasibility of optimization and stability of the closed-loop system can be proved within the classical MPC framework.
|
|
17:15-17:30, Paper WeB07.5 | Add to My Program |
Fixed-Time Convergent Consensus Algorithm of Networked Nonholonomic Multi-Agent Systems (I) |
|
Defoort, Michael | UPHF |
Floquet, Thierry | CNRS |
Perruquetti, Wilfrid | Ecole Centrale De Lille |
Keywords: Agents-based systems, Autonomous robots, Lyapunov methods
Abstract: In this paper, the problem of fixed-time leader-follower consensus problem of nonholonomic multi-agent systems is under study. Using the “desingularisation method” introduced in the seminal paper by J.M. Coron [6], new fixed- time controllers/observers for the double integrator system are designed. Following those results, a switching consensus protocol which guarantees the tracking errors stabilization in fixed-time which does not depend on the initial conditions of the multi-agent system is provided. Simulation results on a fleet of wheeled mobile robots show the effectiveness of the proposed scheme.
|
|
WeB08 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Game Theory and Learning |
|
|
Chair: Grammatico, Sergio | Delft Univ. of Tech |
Co-Chair: Etesami, S. Rasoul | University of Illinois at Urbana-Champaign |
|
16:15-16:30, Paper WeB08.1 | Add to My Program |
Stability of Gradient Learning Dynamics in Continuous Games: Scalar Action Spaces |
|
Chasnov, Benjamin J | University of Washington |
Calderone, Dan | University of Washington |
Acikmese, Behcet | University of Washington |
Burden, Samuel A. | University of Washington |
Ratliff, Lillian J. | University of Washington |
Keywords: Machine learning, Game theory, Optimization algorithms
Abstract: Learning processes in games explain how players grapple with one another in seeking an equilibrium. We study a natural model of learning based on individual gradients in two-player continuous games. In such games, the arguably natural notion of a local equilibrium is a differential Nash equilibrium. However, the set of locally exponentially stable equilibria of the learning dynamics do not necessarily coincide with the set of differential Nash equilibria of the corresponding game. To characterize this gap, we provide formal guarantees for the stability or instability of such fixed points by leveraging the spectrum of the linearized game dynamics. We provide a comprehensive understanding of scalar games and find that equilibria that are both stable and Nash are robust to variations in learning rates.
|
|
16:30-16:45, Paper WeB08.2 | Add to My Program |
Learning Pure Nash Equilibrium in Smart Charging Games |
|
Sohet, Benoît | EDF R&D Lab’ Paris-Saclay, Avignon University |
Hayel, Yezekael | University of Avignon |
Beaude, Olivier | EDF R&D |
Jeandin, Alban | EDF R&D, MIRE Dept, EDF Lab’ Paris-Saclay |
Keywords: Machine learning, Game theory, Smart grid
Abstract: Reinforcement Learning Algorithms (RLA) are useful machine learning tools to understand how decision makers react to signals. It is known that RLA converge towards the pure Nash Equilibria (NE) of finite congestion games and more generally, finite potential games. For finite congestion games, only separable cost functions are considered. However, non-separable costs, which depend on the choices of all players instead of only those choosing the same resource, may be relevant in some circumstances, like in smart charging games. In this paper, finite congestion games with non-separable costs are shown to have an ordinal potential function, leading to the existence of an action-dependent continuous potential function. The convergence of a synchronous RLA towards the pure NE is then extended to this more general class of congestion games. Finally, a smart charging game is designed for illustrating convergence of such learning algorithms.
|
|
16:45-17:00, Paper WeB08.3 | Add to My Program |
Nash Equilibrium Seeking under Partial-Decision Information Over Directed Communication Networks |
|
Bianchi, Mattia | Delft University of Technology |
Grammatico, Sergio | Delft Univ. of Tech |
Keywords: Optimization algorithms, Game theory, Variational methods
Abstract: We consider the Nash equilibrium problem in a partial-decision information scenario. Specifically, each agent can only receive information from some neighbors via a communication network, while its cost function depends on the strategies of possibly all agents. In particular, while the existing methods assume undirected or balanced communication, in this paper we allow for non-balanced, directed graphs. We propose a fully-distributed pseudo-gradient scheme, which is guaranteed to converge with linear rate to a Nash equilibrium, under strong monotonicity and Lipschitz continuity of the game mapping. Our algorithm requires global knowledge of the communication structure, namely of the Perron-Frobenius eigenvector of the adjacency matrix and of a certain constant related to the graph connectivity. Therefore, we adapt the procedure to setups where the network is not known in advance, by computing the eigenvector online and by means of vanishing step sizes.
|
|
17:00-17:15, Paper WeB08.4 | Add to My Program |
Multi-Agent Reinforcement Learning in Cournot Games |
|
Shi, Yuanyuan | University of Washington |
Zhang, Baosen | University of Washington |
Keywords: Game theory, Machine learning, Agents-based systems
Abstract: In this work, we study the interaction of strategic agents in continuous action Cournot games with limited information feedback. Cournot game is the essential market model for many socio-economic systems where agents learn and compete without the full knowledge of the system or each other. We consider the dynamics of the policy gradient algorithm, which is a widely adopted continuous control reinforcement learning algorithm, in concave Cournot games. We prove the convergence of policy gradient dynamics to the Nash equilibrium when the price function is linear or the number of agents is two. This is the first result (to the best of our knowledge) on the convergence property of learning algorithms with continuous action spaces that do not fall in the no-regret class.
|
|
17:15-17:30, Paper WeB08.5 | Add to My Program |
Dynamic Assortment with Limited Inventories and Set-Dependent Revenue Functions |
|
Etesami, S. Rasoul | University of Illinois at Urbana-Champaign |
Keywords: Optimization algorithms, Game theory, Optimization
Abstract: We consider an online assortment problem with [n]:={1,2,ldots,n} sellers, each holding exactly one item iin[n] with initial inventory c_iin mathbb{Z}_+, and a sequence of homogeneous buyers arriving over a finite time horizon t=1,2,ldots,m. There is an online platform whose goal is to offer a subset S_tsubseteq [n] of sellers to the arriving buyer at time t to maximize the expected revenue derived over the entire horizon while respecting the inventory constraints. Given an assortment S_t at time t, it is assumed that the buyer will select an item from S_t based on the well-known multinomial logit model, a well-justified choice model from the economic literature. In this model, the revenue obtained from selling an item i at a given time t critically depends on the assortment S_t offered at that time and is given by the Nash equilibrium of a Bertrand game among the sellers in S_t. This imposes a strong dependence/externality among the offered assortments, items revenues, and inventory levels. Despite that challenge, we devise a constant-competitive algorithm for the online assortment problem with homogeneous buyers and evaluate its performance via numerical results.
|
|
WeB09 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Optimization Algorithms II |
|
|
Chair: Graichen, Knut | University Erlangen-Nürnberg (FAU) |
Co-Chair: Fagiano, Lorenzo | Politecnico Di Milano |
|
16:15-16:30, Paper WeB09.1 | Add to My Program |
An Optimal Multistage Stochastic Gradient Method for Minimax Problems |
|
Fallah, Alireza | MIT |
Ozdaglar, Asu | MIT |
Pattathil, Sarath | MIT |
Keywords: Optimization algorithms, Machine learning, Optimization
Abstract: In this paper, we study the minimax optimization problem in the smooth and strongly convex-strongly concave setting when we have access to noisy estimates of gradients. In particular, we first analyze the stochastic Gradient Descent Ascent (GDA) method with constant stepsize, and show that it converges to a neighborhood of the solution of the minimax problem. We further provide tight bounds on the convergence rate and the size of this neighborhood. Next, we propose a multistage variant of stochastic GDA (M-GDA) that runs in multiple stages with a particular learning rate decay schedule and converges to the exact solution of the minimax problem. We show M-GDA achieves the lower bounds in terms of noise dependence without any assumptions on the knowledge of noise characteristics. We also show that M-GDA obtains a linear decay rate with respect to the error’s dependence on the initial error, although the dependence on condition number is suboptimal. In order to improve this dependence, we apply the multistage machinery to the stochastic Optimistic Gradient Descent Ascent (OGDA) algorithm and propose the M-OGDA algorithm which also achieves the optimal linear decay rate with respect to the initial error. To the best of our knowledge, this method is the first to simultaneously achieve the best dependence on noise characteristic as well as the initial error and condition number.
|
|
16:30-16:45, Paper WeB09.2 | Add to My Program |
Taming Convergence for Asynchronous Stochastic Gradient Descent with Unbounded Delay in Non-Convex Learning |
|
Zhang, Xin | Iowa State University |
Liu, Jia | Iowa State University |
Zhu, Zhengyuan | Iowa State University |
Keywords: Optimization algorithms
Abstract: Understanding the convergence performance of asynchronous stochastic gradient descent method (Async-SGD) has received increasing attention in recent years due to their foundational role in machine learning. To date, however, most of the existing works are restricted to either bounded gradient delays or convex settings. In this paper, we focus on Async-SGD and its variant Async-SGDI (which uses increasing batch size) for non-convex optimization problems with unbounded gradient delays. We prove o(1/sqrt{k}) convergence rate for Async-SGD and o(1/k) for Async-SGDI. Also, a unifying sufficient assumption for Async-SGD's convergence is proposed, which includes two major gradient delay models in the literature as special cases.
|
|
16:45-17:00, Paper WeB09.3 | Add to My Program |
On the Use of Set Membership Theory for Global Optimization of Black-Box Functions |
|
Sabug, Lorenzo Jr. | Politecnico Di Milano |
Ruiz, Fredy | Politecnico Di Milano |
Fagiano, Lorenzo | Politecnico Di Milano |
Keywords: Optimization algorithms, Optimization
Abstract: Many science and engineering applications feature non-convex optimization problems where the performance is not explicitly modeled by a cost or reward function, i.e. it is a black box. Examples include most complex design problems where experimental tests are the main method to evaluate performance of chosen values of the decision variables, in fields such as mechanics, fluid-dynamics, electromagnetics and/or magnetohydrodynamics. Solving these problems can be done iteratively: the next value of the decision variables is chosen based on the outcome generated by the previous tests. The time and resource overhead in conducting tests, however, raises the issue of most efficiently choosing the next test point according to previous observations. To approach this issue, a new global optimization strategy based on a Set Membership framework is proposed. Assuming a Lipschitz continuous cost function, the presented algorithm builds an approximation of the latter to decide whether to exploit the best result obtained so far, or to further explore the decision space. The proposed algorithm is presented and some implementation aspects are discussed. Its performance is evaluated on a set of benchmark non-convex problems and compared with those of other global optimization approaches.
|
|
17:00-17:15, Paper WeB09.4 | Add to My Program |
Safe Bayesian Optimization under Unknown Constraints |
|
Bergmann, Daniel | MTU Friedrichshafen |
Graichen, Knut | University Erlangen-Nürnberg (FAU) |
Keywords: Optimization algorithms, Uncertain systems, Machine learning
Abstract: This paper presents a safe optimization method for minimizing an unknown cost function subject to unknown inequality and equality constraints. The cost function as well as the constraints evaluation may be corrupted with Gaussian measurement noise with known uncertainty. The focus especially lies in the safe exploration of the cost function, which means that an evaluation of the cost function far away of previous evaluations is not favoured. Also evaluations in regions where the constraints are violated are undesired. This is the case, for example, in technical applications, where systems may become unstable or damaged, if constraints are violated. To this end, a combination of the expected improvement of the cost and its mean is minimized, while bounding the variance of the cost. The inequality and equality constraints are reformulated as constraints for the probability of the constraint violation. The optimization method is evaluated on numerical examples.
|
|
17:15-17:30, Paper WeB09.5 | Add to My Program |
A Convergence-Preserving Data Integrity Attack on Distributed Optimization Using Local Information |
|
Ding, Tie | Shanghai Jiao Tong University |
XU, Qianwen | Nanyang Technological University |
Zhu, Shanying | Shanghai Jiao Tong University |
Guan, Xin-Ping | Shanghai Jiao Tong University |
Keywords: Optimization algorithms, Computer/Network Security
Abstract: This paper introduces a novel data integrity attack in distributed optimization (DO) problems where all agents cooperatively minimize the sum of their cost functions except a selfish dishonest agent who misleads all agents’ states to converge to its local optimal solution by transmitting falsified information during DO iteration. Different from most existing attacks that require global information, this data integrity attack does not require any extra information and only relies on local information and communication with neighbors. It is shown that even though the dishonest agent shares falsified information, it preserves the linear convergence of the DO algorithm. This means that the attack is implemented locally and difficult to be detected by threshold value method. We provide numerical simulation to verify theoretical results as well as the performance of this attack applied to other DO algorithms. By revealing the vulnerability of DO algorithm to such data integrity attack, this paper conveys the message that besides designing exact and effective DO algorithms, it is equally important to protect the distributed network from potential malicious attacks to avoid performance loss.
|
|
WeB10 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Markov Processes I |
|
|
Chair: Zhu, Quanyan | New York University |
Co-Chair: Jain, Rahul | University of Southern California |
|
16:15-16:30, Paper WeB10.1 | Add to My Program |
Expert Selection in High-Dimensional Markov Decision Processes |
|
Rubies Royo, Vicenc | UC Berkeley |
Mazumdar, Eric | UC Berkeley |
Dong, Roy | University of Illinois at Urbana-Champaign |
Tomlin, Claire J. | UC Berkeley |
Sastry, Shankar | Univ. of California at Berkeley |
Keywords: Markov processes, Adaptive systems, Neural networks
Abstract: In this work we present a multi-armed bandit framework for online expert selection in Markov decision processes and demonstrate its use in high-dimensional settings. Our method takes a set of candidate expert policies and switches between them to rapidly identify the best performing expert using a variant of the classical upper confidence bound algorithm, thus ensuring low regret in the overall performance of the system. This is useful in applications where several expert policies may be available, and one needs to be selected at run-time for the underlying environment.
|
|
16:30-16:45, Paper WeB10.2 | Add to My Program |
Triggered Measurements in Markov Processes for Entropy-Constrained State Estimation with Application to Precision Agriculture |
|
Li, Nan | University of Michigan |
Li, Zhaojian | Michigan State University |
Dong, Liang | Iowa State University |
Girard, Anouck | University of Michigan, Ann Arbor |
Kolmanovsky, Ilya V. | The University of Michigan |
Lu, Renfu | U.S. Department of Agriculture |
Keywords: Markov processes, Constrained control, Control applications
Abstract: This paper presents an optimization-based technique for triggering measurements in a hidden Markov model so that the number/rate of measurements is minimized while maintaining accuracy of the state estimates to satisfy a prescribed constraint. Here, the accuracy of a state estimate is characterized by the entropy of the distribution associated with the estimate. After introducing the general approach, this paper also discusses its application to precision agriculture. In particular, we consider site-specific monitoring of soil nitrate levels and show through simulations that the required number of measurements can be reduced with the proposed technique compared to conventional strategies where measurements are taken at a fixed rate.
|
|
16:45-17:00, Paper WeB10.3 | Add to My Program |
Finite Time Guarantees for Continuous State MDPs with Generative Model |
|
Sharma, Hiteshi | USC |
Jain, Rahul | University of Southern California |
Keywords: Markov processes, Iterative learning control, Uncertain systems
Abstract: In this paper, we present Online Empirical Value Learning (ONEVaL), an ‘online’ reinforcement learning algo- rithm for continuous MDPs that is ‘quasi-model-free’ (needs a generative/simulation model but not the model per se) that can compute nearly-optimal policies and comes with non- asymptotic performance guarantees including prescriptions on required sample complexity for specified performance bounds. The algorithm relies on use of a ‘fully’ randomized policy that will generate a β-mixing sample trajectory. It also relies on randomized function approximation in an RKHS for arbitrarily small function approximation error, and an ‘empirical’ estimate of value from the next state by several samples of the next state from the generative model. We demonstrate its’ good numerical performance on some benchmark problems. We note that the algorithm requires no hyper-parameter tuning, and is also robust to other concerns that seem to plague Deep RL algorithms.
|
|
17:00-17:15, Paper WeB10.4 | Add to My Program |
Censored Markov Decision Processes: A Framework for Safe Reinforcement Learning in Collaboration with External Systems |
|
Kohjima, Masahiro | NTT Corporation |
Takahashi, Masami | NTT Corporation |
Toda, Hiroyuki | NTT Corporation |
Keywords: Markov processes, Learning, Stochastic optimal control
Abstract: The importance of safe reinforcement learning (safe RL) is widely recognized for enhancing real world systems. In this study, we construct the censored Markov decision process (CeMDP), a new Markov Decision Process (MDP) framework that describes the interaction of environment, learner and external systems, e.g., human intervention or pre-designed controller for emergency response. We also theoretically analyze the relation of CeMDP to existing frameworks such as the semi-Markov decision process, MDP with Option (OMDP) and standard MDP; the analysis clarifies that CeMDP is a special case of OMDP and can, with environment redefinition, be represented by MDP. This finding allows us to design planning and reinforcement learning algorithms for CeMDP. We confirm the validity of the theory and algorithms by numerical experiments.
|
|
17:15-17:30, Paper WeB10.5 | Add to My Program |
Distributed Stabilization of Two Interdependent Markov Jump Linear Systems with Partial Information |
|
Peng, Guanze | New York University |
Chen, Juntao | New York University |
Zhu, Quanyan | New York University |
Keywords: Markov processes, Linear systems, Stability of linear systems
Abstract: In this paper, we study the stabilization of two interdependent Markov jump linear systems (MJLS) with partial information, where the interdependency arises as the transition of the mode of one system depends on the states of the other system. First, we formulate a framework for the two interdependent MJLSs to capture the interactions between various entities in the system, where the modes of the system cannot be observed directly. Instead, a signal which contains information of the modes can be obtained. Then, depending on the scope of the available system state information (global or local), we design centralized and distributed controllers, respectively, that can stochastically stabilize the overall interdependent MJLS. In addition, the sufficient stabilization conditions for the system under both types of information structure are derived. Finally, we provide a numerical example to illustrate the effectiveness of the designed controllers.
|
|
WeB11 Invited Session, Coordinated Universal Time (UTC) |
Add to My Program |
Distributed Optimization and Learning for Networked Systems I |
|
|
Chair: Yang, Tao | Northeastern University |
Co-Chair: Li, Na | Harvard University |
Organizer: Yang, Tao | Northeastern University |
Organizer: Uribe, Cesar | Massachusetts Institute of Technology |
Organizer: Lu, Jie | ShanghaiTech University |
Organizer: Nedic, Angelia | Alphatech Inc |
|
16:15-16:30, Paper WeB11.1 | Add to My Program |
GT-SAGA: A Fast Incremental Gradient Method for Decentralized Finite-Sum Minimization (I) |
|
Xin, Ran | Carnegie Mellon University |
Li, Boyue | Carnegie Mellon University |
Kar, Soummya | Carnegie Mellon University |
Khan, Usman A. | Tufts University |
Keywords: Stochastic systems, Optimization algorithms, Distributed control
Abstract: In this paper, we study decentralized solutions for finite-sum minimization problems when the underlying training data is distributed over a network of nodes. In particular, we describe the GT-SAGA algorithm that combines variance reduction and gradient tracking to achieve both robust performance and fast convergence. Variance reduction is implemented locally to asymptotically estimate each local batch gradient at each node, while gradient tracking fuses the local estimated gradients across the nodes. Combining variance reduction and gradient tracking thus enables linear convergence to the optimal solution of strongly-convex problems while keeping a low per-iteration computation complexity at each node. We cast the convergence and behavior of GT-SAGA and related methods in the context of certain practical tradeoffs and further compare their performance with the help of numerical experiments over both convex and non-convex problems.
|
|
16:30-16:45, Paper WeB11.2 | Add to My Program |
Linear Convergence for Distributed Optimization without Strong Convexity (I) |
|
Yi, Xinlei | KTH Royal Institute of Technology |
Zhang, Shengjun | University of North Texas |
Yang, Tao | Northeastern University |
Chai, Tianyou | Northeastern University |
Johansson, Karl H. | Royal Institute of Technology |
Keywords: Optimization algorithms, Networked control systems
Abstract: This paper considers the distributed optimization problem of minimizing a global cost function formed by a sum of local smooth cost functions by using local information exchange. Various distributed optimization algorithms have been proposed for solving such a problem. A standard condition for proving the linear convergence for existing distributed algorithms is the strong convexity of the cost functions. However, the strong convexity may not hold for many practical applications, such as least squares and logistic regression. In this paper, we propose a distributed primal-dual gradient descent algorithm and establish its linear convergence under the condition that the global cost function satisfies the Polyak-{L}ojasiewicz condition. This condition is weaker than strong convexity and the global minimizer is not necessarily unique. The theoretical result is illustrated by numerical simulations.
|
|
16:45-17:00, Paper WeB11.3 | Add to My Program |
Zeroth-Order Feedback Optimization for Cooperative Multi-Agent Systems (I) |
|
Tang, Yujie | Harvard University |
Ren, Zhaolin | Harvard University |
Li, Na | Harvard University |
Keywords: Optimization algorithms, Cooperative control, Networked control systems
Abstract: We consider a class of multi-agent optimization problems, where each agent is associated with an action vector and a local cost that depends on the joint actions of all agents, and the goal is to minimize the average of the local costs. Such problems arise in many control applications such as wind farm operation and mobile sensor coverage. In many of these applications, while we have access to (zeroth-order) information about function values, it can be difficult to obtain (first-order) gradient information. In this paper, we propose a zeroth-order feedback optimization (ZFO) algorithm based on two-point gradient estimators for the considered class of problems, and provide the convergence rate to a first-order stationary point for nonconvex problems. We complement our theoretical analysis with numerical simulations on a wind farm power maximization problem.
|
|
17:00-17:15, Paper WeB11.4 | Add to My Program |
A Distributed Proximal Primal-Dual Algorithm for Nonsmooth Optimization with Coupling Constraints (I) |
|
Wu, Xuyang | ShanghaiTech University |
Wang, He | ShanghaiTech University |
Lu, Jie | ShanghaiTech University |
Keywords: Optimization algorithms, Optimization
Abstract: This paper develops a novel algorithm that tackles a general distributed convex optimization problem with a nonsmooth objective function and a series of coupling nonlinear inequality and linear equality constraints. To address such a problem, we first design a distributed proximal algorithm for problems with coupling equality constraints only, and then extend this algorithm to problems with both inequality and equality constraints by incorporating a virtual-queue-based method. The resulting algorithm, referred to as Proximal Primal-Dual Algorithm (PPDA), is shown to achieve O(1/k) rates of convergence with respect to both optimality and feasibility. Compared to the alternative methods in the literature, PPDA eliminates their assumptions on the smoothness of the objective function and achieves a stronger convergence rate result. In addition, PPDA exhibits faster convergence in a couple of numerical examples.
|
|
17:15-17:30, Paper WeB11.5 | Add to My Program |
Towards Totally Asynchronous Primal-Dual Convex Optimization in Blocks (I) |
|
Hendrickson, Katherine | University of Florida |
Hale, Matthew | University of Florida |
Keywords: Optimization algorithms, Networked control systems, Machine learning
Abstract: We present a parallelized primal-dual algorithm for solving constrained convex optimization problems. The algorithm is "block-based," in that vectors of primal and dual variables are partitioned into blocks, each of which is updated only by a single processor. We consider four possible forms of asynchrony: in updates to primal variables, updates to dual variables, communications of primal variables, and communications of dual variables. We explicitly construct a family of counterexamples to rule out permitting asynchronous communication of dual variables, though the other forms of asynchrony are permitted, all without requiring bounds on delays. A first-order update law is developed and shown to be robust to asynchrony. We then derive convergence rates to a Lagrangian saddle point in terms of the operations agents execute, without specifying any timing or pattern with which they must be executed. These convergence rates contain a synchronous algorithm as a special case and are used to quantify an "asynchrony penalty." Numerical results illustrate these developments.
|
|
WeB12 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Social Dynamics I |
|
|
Chair: Morarescu, Irinel-Constantin | CRAN, CNRS, Université De Lorraine |
Co-Chair: Postoyan, Romain | CNRS, CRAN, Université De Lorraine |
|
16:15-16:30, Paper WeB12.1 | Add to My Program |
Interplay between Homophily-Based Appraisal Dynamics and Influence-Based Opinion Dynamics: Modeling and Analysis |
|
Liu, Fangzhou | Technische Universität München |
CUI, Shaoxuan | Technical University of Munich |
Mei, Wenjun | ETH Zurich |
Dörfler, Florian | Swiss Federal Institute of Technology (ETH) Zurich |
Buss, Martin | Technische Universitaet Muenchen |
Keywords: Agents-based systems, Network analysis and control, Stability of nonlinear systems
Abstract: In social systems, the evolution of interpersonal appraisals and individual opinions are not independent processes but intertwine with each other. Despite extensive studies on both opinion dynamics and appraisal dynamics separately, no previous work has ever combined these two processes together. In this paper, we propose a novel and intuitive model on the interplay between homophily-based appraisal dynamics and influence-based opinion dynamics. We assume that individuals' opinions are updated via the influence network constructed from their interpersonal appraisals, which are in turn updated based on the individual opinions via the homophily mechanism. By theoretical analysis, we characterize the set of equilibria and some transient behavior of our model. Moreover, we establish the equivalence among the convergence of the appraisal network to social balance, the modulus consensus of individual opinions, and the non-vanishing appraisals. Monte Carlo validations further show that the non-vanishing appraisals condition holds for generic initial conditions. Compared with previous works that explain the emergence of social balance via person-to-person homophily mechanism, our model provides an alternative explanation in terms of the person-to-entity homophily mechanism. In addition, our model also describes how individuals' opinions on multiple irrelevant issues become correlated and converge to modulus consensus over time-varying influence networks.
|
|
16:30-16:45, Paper WeB12.2 | Add to My Program |
Consensus Problems on Clustered Networks |
|
De Pasquale, Giulia | University of Padova |
Valcher, Maria Elena | Universita' Di Padova |
Keywords: Agents-based systems, Networked control systems, Linear systems
Abstract: The aim of this paper is to address consensus in the context of networked agents whose interactions can be modelled by an undirected, signed, weighted, connected and clustered graph. Specifically, we assume that individuals can be split into three groups representing the decision classes on a given specific topic, for instance, the in favour, abstained and opponent agents. Interactions between agents belonging to the same cluster are cooperative, meaning that the link connecting those agents has a non-negative weight, while interactions between agents belonging to different clusters are antagonistic and therefore a non-positive weight is associated to the link connecting them. We will show that under certain regularity assumptions it is possible to devise a simple modification of DeGroot’s algorithm that ensures that the opinions of agents who cooperate converge to consensus, i.e. the opinions of agents belonging to the same cluster converge to the same decision.
|
|
16:45-17:00, Paper WeB12.3 | Add to My Program |
Spatial Properties of a Mixed Linear-Nonlinear Model for Opinion Formation in Networks |
|
Sarker, Subir | Washington State University |
Roy, Sandip | Washington State University |
Keywords: Network analysis and control, Networked control systems, Control of networks
Abstract: A standard model for opinion formation in networks, which comprises a linear model for opinion-evolution coupled with a non-linear model for self-confidence dynamics, is considered. The main contribution of this study is to characterize the spatial pattern of the model dynamics in response to a local disruption. Specifically, we show that both the linear and nonlinear models exhibit a diffusive structure, such that responses to disruptions decay in amplitude along cutsets away from the disruption source in the network's graph. The spatial characterization is then used to motivate and develop a model reduction technique for the nonlinear self-confidence dynamics. Finally, the formal results and model reduction algorithm are illustrated in an example.
|
|
17:00-17:15, Paper WeB12.4 | Add to My Program |
Expressed and Private Opinion Dynamics on Influence Networks with Asynchronous Updating |
|
Xia, Weiguo | Dalian University of Technology |
Liang, Hong | Dalian University of Technology |
Ye, Mengbin | Curtin University |
Keywords: Cooperative control, Distributed control, Agents-based systems
Abstract: In this paper, an asynchronous discrete-time opinion dynamics model on a social influence network is considered. At each time instant, a single individual activates and updates two state variables simultaneously. The individual's new private opinion is a weighted average of her current private opinion, the expressed opinions of her neighbors, and a constant prejudice. Meanwhile, the individual's new expressed opinion is equal to her current private opinion, altered due to a pressure to conform to the public opinion as perceived by the individual, being the average expressed opinion among her neighbors. We analyze the system for social networks which are rooted, and show that if no individual holds a prejudice, then a mild assumption on the activation sequence of the individuals guarantees convergence. In particular, the expressed and private opinions of all individuals converge to the same value exponentially fast, with two lower bounds on convergence speeds based on two different assumptions on the network topology. Simulations are provided to illustrate the result, and provide support to the conjecture that the system dynamics may converge even if individuals hold an existing prejudice.
|
|
17:15-17:30, Paper WeB12.5 | Add to My Program |
A Hybrid Model of Opinion Dynamics with Memory-Based Connectivity |
|
Mariano, Simone | Université De Lorraine |
Morarescu, Irinel-Constantin | CRAN, CNRS, Université De Lorraine |
Postoyan, Romain | CNRS, CRAN, Université De Lorraine |
Zaccarian, Luca | LAAS-CNRS and University of Trento |
Keywords: Agents-based systems, Stability of hybrid systems, Lyapunov methods
Abstract: Given a social network where the individuals know the identity of the other members, we present a model of opinion dynamics where the connectivity among the individuals depends on both their current and past opinions. Thus, their interactions are not only based on the present states but also on their past relationships. The model is a multi-agent system with active or inactive pairwise interactions depending on auxiliary state variables filtering the instantaneous opinions, thereby taking the past experience into account. When an interaction is (de)activated, a jump occurs, leading to a hybrid model. The proven stability properties ensure that opinions converge to local agreements/clusters as time grows. Simulation results are provided to illustrate the theoretical guarantees.
|
|
WeB13 Invited Session, Coordinated Universal Time (UTC) |
Add to My Program |
Learning-Based Control II |
|
|
Chair: Trimpe, Sebastian | Max Planck Institute for Intelligent Systems |
Co-Chair: Pappas, George J. | University of Pennsylvania |
Organizer: Muller, Matthias A. | Leibniz University Hannover |
Organizer: Schoellig, Angela P | University of Toronto |
Organizer: Trimpe, Sebastian | RWTH Aachen University |
Organizer: Zeilinger, Melanie N. | ETH Zurich |
|
16:15-16:30, Paper WeB13.1 | Add to My Program |
Control Barrier Functions for Unknown Nonlinear Systems Using Gaussian Processes (I) |
|
Jagtap, Pushpak | KTH Royal Institute of Technology |
Pappas, George J. | University of Pennsylvania |
Zamani, Majid | University of Colorado Boulder |
Keywords: Statistical learning, Formal Verification/Synthesis
Abstract: This paper focuses on the controller synthesis for unknown, nonlinear systems while ensuring safety constraints. Our approach consists of two steps, a learning step that uses Gaussian processes and a controller synthesis step that is based on control barrier functions. In the learning step, we use a data-driven approach utilizing Gaussian processes to learn the unknown control affine nonlinear dynamics together with a statistical bound on the accuracy of the learned model. In the second controller synthesis steps, we develop a systematic approach to compute control barrier functions that explicitly take into consideration the uncertainty of the learned model. The control barrier function not only results in a safe controller by construction but also provides a rigorous lower bound on the probability of satisfaction of the safety specification. Finally, we illustrate the effectiveness of the proposed results by synthesizing a safety controller for a jet engine example.
|
|
16:30-16:45, Paper WeB13.2 | Add to My Program |
A Convex Approach to Robust LQR (I) |
|
Scampicchio, Anna | University of Padova |
Pillonetto, Gianluigi | University of Padova |
Keywords: Optimal control, Linear systems, Uncertain systems
Abstract: In this paper, we propose some new convex strategies for robust optimal control. In particular, we treat the problem of designing finite-horizon linear quadratic regulator (LQR) for uncertain discrete-time systems focusing on minimax strategies. A time-invariant linear control law is obtained just solving sequentially two convex optimization problems, hence obtaining a feedback law that takes into account all the available systems samples. In the case of stabilizable systems, we also generalize our approach by including additional constraints on the closed-loop stability in the optimization scheme. Extensions to time-variant control rules are also discussed, leading to novel and intriguing connections between optimal control and multi-task learning.
|
|
16:45-17:00, Paper WeB13.3 | Add to My Program |
Controller Design Via Experimental Exploration with Robustness Guarantees |
|
Holicki, Tobias | University of Stuttgart |
Scherer, Carsten W. | University of Stuttgart |
Trimpe, Sebastian | RWTH Aachen University |
Keywords: Robust control, LMIs, Machine learning
Abstract: For a partially unknown linear systems, we present a systematic control design approach based on generated data from measurements of closed-loop experiments with suitable test controllers. These experiments are used to improve the achieved performance and to reduce the uncertainty about the unknown parts of the system. This is achieved through a parametrization of auspicious controllers with convex relaxation techniques from robust control, which guarantees that their implementation on the unknown plant is safe. This approach permits to systematically incorporate available prior knowledge about the system by employing the framework of linear fractional representations.
|
|
17:00-17:15, Paper WeB13.4 | Add to My Program |
Learning Control Barrier Functions from Expert Demonstrations (I) |
|
Robey, Alexander | University of Pennsylvania |
Hu, Haimin | University of Pennsylvania |
Lindemann, Lars | Royal Institute of Technology, KTH |
Zhang, Hanwen | University of Pennsylvania |
Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Tu, Stephen | University of California, Berkeley |
Matni, Nikolai | University of Pennsylvania |
Keywords: Learning, Lyapunov methods, Optimal control
Abstract: Inspired by the success of imitation and inverse reinforcement learning in replicating expert behavior through optimal control, we propose a learning based approach to safe controller synthesis based on control barrier functions (CBFs). We consider the setting of a known non-linear control affine dynamical system and assume that we have access to safe trajectories generated by an expert — a practical example of such a setting would be a kinematic model of a self-driving vehicle with safe trajectories (e.g., trajectories that avoid collisions with obstacles in the environment) generated by a human driver. We then propose and analyze an optimization based approach to learning a CBF that enjoys provable safety guarantees under suitable Lipschitz smoothness assumptions on the underlying dynamical system. A strength of our approach is that it is agnostic to the parameterization used to represent the CBF, assuming only that the Lipschitz constant of such functions can be efficiently bounded. Furthermore, if the CBF parameterization is convex, then under mild assumptions, so is our learning process. We end with extensive numerical evaluations of our results on both planar and realistic examples, using both random feature and deep neural network parameterizations of the CBF. To the best of our knowledge, these are the first results that learn provably safe control barrier functions from data.
|
|
17:15-17:30, Paper WeB13.5 | Add to My Program |
On the Robustness of Equilibria in Generalized Aggregative Games (I) |
|
Fabiani, Filippo | University of Oxford |
Margellos, Kostas | University of Oxford |
Goulart, Paul J. | University of Oxford |
Keywords: Learning, Game theory, Robust control
Abstract: We assess the robustness of the equilibria in generalized Nash equilibrium problems in aggregative form subject to linear coupling constraints affected by uncertainty with a possibly unknown probability distribution. Within a data-driven context, we apply the scenario approach paradigm to provide a-posteriori feasibility certificates for the entire set of generalized Nash equilibria of the game. Then, we show that assessing the violation probability of such set merely requires one to enumerate the constraints that ``shape'' it. For the class of aggregative games, this results in solving a feasibility problem on each active facet of the feasibility region, for which we propose a semi-decentralized, structure-preserving algorithm.
|
|
WeB14 Regular Session |
Add to My Program |
Iterative Learning Control |
|
|
Chair: Chu, Bing | University of Southampton |
Co-Chair: Wan, Yan | University of Texas at Arlington |
|
16:15-16:30, Paper WeB14.1 | Add to My Program |
Safety-Critical Online Control with Adversarial Disturbances |
|
Ramasubramanian, Bhaskar | University of Washington, Seattle |
Xiao, Baicen | University of Washington |
Bushnell, Linda | University of Washington |
Poovendran, Radha | University of Washington |
Keywords: Iterative learning control, Resilient Control Systems, Robust adaptive control
Abstract: This paper studies the control of safety-critical dynamical systems in the presence of adversarial disturbances. We seek to synthesize state-feedback controllers to minimize a cost incurred due to the disturbance while respecting the safety constraint. The safety constraint is given by a bound on an H-∞ norm, while the cost is specified as an upper bound on an H-2 norm of the system. We consider an online setting where costs at each time are revealed only after the controller at that time is chosen. We propose an iterative approach to the synthesis of the controller by solving a modified discrete-time Riccati equation. Solutions of this equation enforce the safety constraint. We compare the cost of this controller with that of the optimal controller when one has complete knowledge of disturbances and costs in hindsight. We show that the regret function, which is defined as the difference between these costs, called varies logarithmically with the time horizon. We validate our approach on a process control setup that is subject to two kinds of adversarial attacks.
|
|
16:30-16:45, Paper WeB14.2 | Add to My Program |
Iterative Learning Control for Region to Region Tracking |
|
Chu, Bing | University of Southampton |
Owens, David H. | The University of Sheffield |
Keywords: Iterative learning control
Abstract: Iterative learning control (ILC) is a high performance control design method for systems working in a repetitive manner by learning from previous experience. Most existing ILC design considers the problem where the desired reference trajectory is defined either fully on the trial duration or on a finite number of intermediate time instants. This paper further expands the applicability of ILC by studying a more general case (named region to region tracking) where there is no desired trajectory defined at all; instead only a region where the system output should reach is given. To solve this problem, a novel ILC algorithm with an norm optimal ILC step and a projection operation is developed. Convergence properties of the algorithm are analysed rigorously. It is also shown that traditional reference trajectory tracking problems can be solved as special cases of the proposed design, resulting in a well-known norm optimal ILC algorithm being recovered and a new point to point ILC algorithm. Numerical simulations are presented to demonstrate the effectiveness of the proposed approach.
|
|
16:45-17:00, Paper WeB14.3 | Add to My Program |
Distributed Iterative Learning Control for Constrained Consensus Tracking Problem |
|
Chen, Bin | University of Southampton |
Chu, Bing | University of Southampton |
Geng, Hua | Tsinghua University |
Keywords: Iterative learning control
Abstract: High precision consensus tracking of networked systems working repetitively has found important applications in various areas. To achieve the high precision tracking, iterative learning control (ILC) has recently been applied. This paper considers the constrained consensus tracking problem in ILC design. We develop a novel constrained ILC algorithm based on the successive projection framework. The resulting algorithm guarantees the satisfaction of the constraints and has appealing convergence properties: when perfect consensus tracking is possible, the tracking error norm converges monotonically to zero; otherwise, it converges monotonically to a `best fit' solution. The proposed algorithm can be applied to both homogeneous and heterogeneous systems, as well as non-minimum phase systems, which is desirable in practice. Furthermore, we provide a distributed implementation for the proposed ILC algorithm using the idea of the alternating direction method of multipliers, allowing the proposed algorithm to be applied to large scale networked systems using only local information. Convergence properties of the algorithm are analysed rigorously and numerical examples are given to demonstrate its effectiveness.
|
|
17:00-17:15, Paper WeB14.4 | Add to My Program |
Zero-Order Optimization-Based Iterative Learning Control |
|
Baumgärtner, Katrin | University of Freiburg |
Diehl, Moritz | University of Freiburg |
Keywords: Iterative learning control, Optimal control
Abstract: We consider an optimization-based iterative learning control (ILC) approach for nonlinear systems where the control input is obtained as the solution of a constrained nonlinear program (NLP). The NLP formulation is based on a - possibly nonlinear - nominal model corrected by the output error which has been observed in the previous trial. Assuming that the sensitivities of the nominal model are sufficiently close to the sensitivities of the real system, we show local convergence of the proposed ILC method to a generally suboptimal solution and derive a bound on the loss of optimality. Even though the algorithm does not require any exact sensitivity information of the true process, it can recover the optimal control input in two special cases: Assuming that the sensitivities are sufficiently similar, the optimal solution is obtained (1) if it lies at a vertex of the feasible set, i.e. it is fully determined by the constraints, or (2) if a reference tracking problem is considered and optimal tracking is feasible.
|
|
17:15-17:30, Paper WeB14.5 | Add to My Program |
Hinfty Tracking Control for Linear Discrete-Time Systems: Model-Free Q-Learning Designs |
|
Yang, Yunjie | Tsinghua University |
Wan, Yan | University of Texas at Arlington |
Zhu, Jihong | Tsinghua University |
Lewis, Frank L. | University of Texas at Arlington |
Keywords: Iterative learning control, Linear systems, Optimal control
Abstract: In this letter, a novel model-free Q-learning based approach is developed to solve the H∞ tracking problem for linear discrete-time systems. A new exponential discounted value function is introduced that includes the cost of the whole control input and tracking error. The tracking Bellman equation and the game algebraic Riccati equation (GARE) are derived. The solution to the GARE leads to the feedback and feedforward parts of the control input. A Q-learning algorithm is then developed to learn the solution of the GARE online without requiring any knowledge of the system dynamics. Convergence of the algorithm is analyzed, and it is also proved that probing noises in maintaining the persistence of excitation (PE) condition do not result in any bias. An example of the F-16 aircraft short period dynamics is developed to validate the proposed algorithm.
|
|
WeB15 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Cyber-Physical Security |
|
|
Chair: Meskin, Nader | Qatar University |
Co-Chair: Poovendran, Radha | University of Washington |
|
16:15-16:30, Paper WeB15.1 | Add to My Program |
Undetectable Cyber Attacks on Communication Links in Multi-Agent Cyber-Physical Systems |
|
Taheri, Mahdi | Concordia University |
Khorasani, Khashayar | Concordia University |
Shames, Iman | The University of Melbourne |
Meskin, Nader | Qatar University |
Keywords: Cyber-Physical Security, Attack Detection, Agents-based systems
Abstract: The objective in this paper is to study and develop conditions for a network of multi-agent cyber-physical systems (MAS) where a malicious adversary can utilize vulnerabilities in order to ensure and maintain cyber attacks undetectable. We classify these cyber attacks as undetectable in the sense that their impact cannot be observed in the generated residuals. It is shown if an agent that is the root of a rooted spanning tree in the MAS graph is under a cyber attack, the attack is undetectable by the entire network. Next we investigate if a non-root agent is compromised, then under certain conditions cyber attacks can become detectable. Moreover, a novel cyber attack that is designated as quasi-covert cyber attack is introduced that can be used to eliminate detectable impacts of cyber attacks to the entire network and maintain these attacks as undetected. Finally, an event-triggered based detector is proposed that can be used to detect the quasi-covert cyber attacks. Numerical simulations are provided to illustrate the effectiveness and capabilities of our proposed methodologies.
|
|
16:30-16:45, Paper WeB15.2 | Add to My Program |
A Nash Equilibrium-Based Moving Target Defense against Stealthy Sensor Attacks |
|
Umsonst, David | KTH Royal Institute of Technology |
SARITAS, Serkan | KTH Royal Institute of Technology |
Sandberg, Henrik | KTH Royal Institute of Technology |
Keywords: Cyber-Physical Security, Game theory, Attack Detection
Abstract: This paper investigates a moving target defense strategy based on detector threshold switching against stealthy sensor attacks. We model the interactions between the attacker and the defender as a game. While the attacker wants to remain stealthy and maximize its impact, the defender wants to minimize both the cost for investigating false alarms and the attack impact. We define the moving target defense as a mixed strategy Nash equilibrium and are able to formulate an equivalent finite matrix game of the original game. We provide a necessary and sufficient condition for the existence of a moving target defense strategy. A globally optimal moving target defense strategy is obtained via a linear optimization problem by exploiting the structure of the matrix game. Simulations with a four tank system verify that by applying an optimal moving target defense strategy, the defender reduces its cost compared to the optimally chosen fixed detector threshold.
|
|
16:45-17:00, Paper WeB15.3 | Add to My Program |
Dynamic Resilient Network Games Considering Connectivity |
|
Nugraha, Yurid | Tokyo Institute of Technology |
Cetinkaya, Ahmet | National Institute of Informatics |
Hayakawa, Tomohisa | Tokyo Institute of Technology |
Ishii, Hideaki | Tokyo Institute of Technology |
Zhu, Quanyan | New York University |
Keywords: Cyber-Physical Security, Game theory, Computer/Network Security
Abstract: To address cyber security issues in multi-agent type systems, we formulate a game theoretic problem on resilient graphs, where the utilities of the players are mainly related to the connectivity of the network. An attacker is capable to disconnect part of the edges of the graph by emitting jamming signals while, in response, the defender recovers some of them by increasing the transmitted power for the communication over the corresponding edges. The players' actions are constrained by their energy for transmissions and they play this two-stage game repeatedly over time. Their strategies are characterized by the edge connectivity and the timings to start/stop their actions. A numerical example is provided to demonstrate the efficacy of the results.
|
|
17:00-17:15, Paper WeB15.4 | Add to My Program |
Privacy-Preserving Resilience of Cyber-Physical Systems to Adversaries |
|
Ramasubramanian, Bhaskar | University of Washington, Seattle |
Niu, Luyao | Worcester Polytechnic Institute |
Clark, Andrew | Worcester Polytechnic Institute |
Bushnell, Linda | University of Washington |
Poovendran, Radha | University of Washington |
Keywords: Cyber-Physical Security, Resilient Control Systems, Markov processes
Abstract: A cyber-physical system (CPS) is expected to be resilient to more than one type of adversary. In this paper, we consider a CPS that has to satisfy a linear temporal logic (LTL) objective in the presence of two kinds of adversaries. The first adversary has the ability to tamper with inputs to the CPS to influence satisfaction of the LTL objective. The interaction of the CPS with this adversary is modeled as a stochastic game. We synthesize a controller for the CPS to maximize the probability of satisfying the LTL objective under any policy of this adversary. The second adversary is an eavesdropper who can observe labeled trajectories of the CPS generated from the previous step. It could then use this information to launch other kinds of attacks. A labeled trajectory is a sequence of labels, where a label is associated to a state and is linked to the satisfaction of the LTL objective at that state. We use differential privacy to quantify the indistinguishability between states that are related to each other when the eavesdropper sees a labeled trajectory. Two trajectories of equal length will be differentially private if they are differentially private at each state along the respective trajectories. We use a skewed Kantorovich metric to compute distances between probability distributions over states resulting from actions chosen according to policies from related states in order to quantify differential privacy. Moreover, we do this in a manner that does not affect the satisfaction probability of the LTL objective. We validate our approach on a simulation of a UAV that has to satisfy an LTL objective in an adversarial environment.
|
|
17:15-17:30, Paper WeB15.5 | Add to My Program |
Covariance-Robust Dynamic Watermarking |
|
Olfat, Mahbod | UC Berkeley |
Sloan, Stephen | University of California, Berkeley |
Hespanhol, Pedro | UC Berkeley |
Porter, Matthew | University of Michigan |
Vasudevan, Ramanarayan | University of Michigan |
Aswani, Anil | UC Berkeley |
Keywords: Cyber-Physical Security, Uncertain systems, Robust control
Abstract: Attack detection and mitigation strategies for cyberphysical systems (CPS) are an active area of research, and researchers have developed a variety of attack-detection tools such as dynamic watermarking. However, such methods often make assumptions that are difficult to guarantee, such as exact knowledge of the distribution of measurement noise. Here, we develop a new dynamic watermarking method that we call covariance-robust dynamic watermarking, which is able to handle uncertainties in the covariance of measurement noise. Specifically, we consider two cases. In the first this covariance is fixed but unknown, and in the second this covariance is slowly-varying. For our tests, we only require knowledge of a set within which the covariance lies. Furthermore, we connect this problem to that of algorithmic fairness and the nascent field of fair hypothesis testing, and we show that our tests satisfy some notions of fairness. Finally, we exhibit the efficacy of our tests on empirical examples chosen to reflect values observed in a standard simulation model of autonomous vehicles.
|
|
WeB16 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Switched Linear Systems |
|
|
Chair: Dinh, Thach N. | CNAM Paris |
Co-Chair: Sun, Zhendong | Academy of Mathematics & Systems Science, CAS |
|
16:15-16:30, Paper WeB16.1 | Add to My Program |
Feedback Stabilization of Third-Order Switched Linear Control Systems |
|
Sun, Zhendong | Academy of Mathematics & Systems Science, CAS |
Keywords: Switched systems, Stability of hybrid systems
Abstract: In this paper, we address the stabilization problem for third-order continuous-time switched linear control systems. We prove that the switched system is stabilizable iff the uncontrollable sub-mode is stabilizable. For completely controllable switched systems, multi-linear feedback controllers are assigned to one subsystem in a time-varying manner. For incompletely controllable switched systems, mixed time-driven and state-driven switching laws are designed to stabilize both the controllable sub-mode and uncontrollable sub-mode. Two numerical examples are presented to exhibit the effectiveness of the proposed design methodology.
|
|
16:30-16:45, Paper WeB16.2 | Add to My Program |
Interval Estimation for Discrete-Time Switched Linear Systems Based on L_{infty} Observer and Ellipsoid Analysis |
|
Dkhil, Monia | University of Gabes, National Engineering School of Gabes, Elect |
Dinh, Thach N. | CNAM Paris |
Wang, Zhenhua | Harbin Institute of Technology |
Raïssi, Tarek | Conservatoire National Des Arts Et Métiers |
Amairi, Messaoud | National Engineering School of Gabes (ENIG) |
Keywords: Estimation, Switched systems, Linear systems
Abstract: This paper proposes a two-step interval estimation method for discrete-time switched linear systems with unknown but bounded uncertainties. Based on the ellipsoidal analysis, the proposed estimator provides upper and lower bounds of the system state with high computational analysis. The size of the obtained ellipsoids is minimized using the trace criterion. The idea of introducing an L_{infty} performance is also employed in order to improve the estimation accuracy. Its design conditions are given in terms of Linear Matrix Inequalities (LMIs). Finally, a numerical example emphasizes the effectiveness of the contribution.
|
|
16:45-17:00, Paper WeB16.3 | Add to My Program |
Statistical Consistency of Set-Membership Estimator for Linear Systems |
|
Hespanhol, Pedro | UC Berkeley |
Aswani, Anil | UC Berkeley |
Keywords: Identification, Switched systems, Uncertain systems
Abstract: Suppose we can choose from a set of linear autonomous systems with bounded process noise, the dynamics of each system are unknown, and we would like to design a stabilizing policy. The underlying question is how to estimate the dynamics of each system given that measurements of each system will be nonsequential. Though seemingly straightforward, existing proof techniques for proving statistical consistency of system identification procedures fail when measurements are nonsequential. Here, we prove that the set-membership estimator is statistically consistent even when measurements are nonsequential. We numerically illustrate its strong consistency.
|
|
17:00-17:15, Paper WeB16.4 | Add to My Program |
Quantized Stabilization of Continuous-Time Switched Linear Systems |
|
Berger, Guillaume O. | UCLouvain |
Jungers, Raphaël M. | University of Louvain |
Keywords: Networked control systems, Switched systems, Quantized systems
Abstract: In this paper, we study the problem of stabilizing continuous-time switched linear systems via mode-dependent quantized state feedback. We derive a closed-form expression for the minimal information data rate from the coder to the controller necessary to achieve stabilization of the system. In particular, it is shown that the evaluation of the minimal data rate for stabilization reduces to the computation of the Lyapunov exponent of some lifted switched linear system, obtained from the original one by using tools from multilinear algebra, and thus can benefit from well-established algorithms for the computation of the Lyapunov exponent. In a second time, drawing upon this expression, we describe a practical coder--controller that stabilizes the system, and whose data rate can be as close as desired to the optimal data rate.
|
|
17:15-17:30, Paper WeB16.5 | Add to My Program |
Finite Data-Rate Feedback Stabilization of Continuous-Time Switched Linear Systems with Unknown Switching Signal |
|
Berger, Guillaume O. | UCLouvain |
Jungers, Raphaël M. | University of Louvain |
Keywords: Networked control systems, Switched systems, Stability of hybrid systems
Abstract: In this paper, we study the problem of stabilizing switched linear systems when only limited information about the state and the mode of the system is available, which occurs in many applications involving networked switched systems (such as cyber-physical systems, IoT, etc.). First, we show that switched linear systems with arbitrary switching, i.e., with no constraint on the switching signal, are in general not stabilizable with a finite data rate. Then, drawing on this result, we restrict our attention to systems satisfying a fairly mild slow-switching assumption, in the sense that the switching signal has an average dwell time bounded away from zero. We show that under this assumption, switched linear systems that are stabilizable in the classical sense remain stabilizable with a finite data rate. A practical coder--controller that stabilizes the system is presented and its applicability is demonstrated on numerical examples.
|
|
WeB17 Regular Session, Coordinated Universal Time (UTC) |
Add to My Program |
Quantum Information and Control |
|
|
Chair: Petersen, Ian R. | Australian National University |
Co-Chair: Amini, Nina H. | CNRS, L2S, CentraleSupelec |
|
16:15-16:30, Paper WeB17.1 | Add to My Program |
Data-Driven System Identification of Linear Quantum Systems Coupled to Time-Varying Coherent Inputs |
|
Nurdin, Hendra I | UNSW Australia |
Amini, Nina H. | CNRS, L2S, CentraleSupelec |
Chen, Jiayin | University of New South Wales |
Keywords: Quantum information and control
Abstract: In this paper, we develop a system identification algorithm to identify a model for unknown linear quantum systems driven by time-varying coherent states, based on empirical single-shot continuous homodyne measurement data of the system's output. The proposed algorithm identifies a model that satisfies the physical realizability conditions for linear quantum systems, challenging constraints not encountered in classical (non-quantum) linear system identification. Numerical examples on a multiple-input multiple-output optical cavity model are presented to illustrate an application of the identification algorithm.
|
|
16:30-16:45, Paper WeB17.2 | Add to My Program |
A Systems Theory Approach to the Synthesis of Minimum Noise Non-Reciprocal Phase-Insensitive Quantum Amplifiers |
|
Petersen, Ian R. | Australian National University |
James, Matthew R. | Australian National University |
Ugrinovskii, Valery | University of New South Wales |
Yamamoto, Naoki | Keio University |
Keywords: Quantum information and control, Linear systems
Abstract: We present a systems theory approach to finding the minimum required level of added quantum noise in a non-reciprocal phase-insensitive quantum amplifier. We also present a synthesis procedure for constructing a quantum optical non-reciprocal phase-insensitive quantum amplifier which adds the minimum level of quantum noise and achieves a required gain and bandwidth. This synthesis procedure is based on a singularly perturbed quantum system involving the broadband approximation of a Bogoliubov transformation and leads to an amplifier involving three squeezers and six beamsplitters.
|
|
16:45-17:00, Paper WeB17.3 | Add to My Program |
On the Robustness of Stabilizing Feedbacks for Quantum Spin-1/2 Systems |
|
Liang, Weichao | CY Cergy Paris Université |
Amini, Nina H. | CNRS, L2S, CentraleSupelec |
Mason, Paolo | CNRS, Laboratoire Des Signaux Et Systèmes, Supélec |
Keywords: Quantum information and control, Stochastic systems, Lyapunov methods
Abstract: In this paper, we consider stochastic master equations describing the evolution of quantum spin-1/2 systems interacting with electromagnetic fields undergoing continuous-time measurements. We suppose that the initial states and the exact values of the physical parameters are unknown. We prove that the feedback stabilization strategy considered in [1] is robust to these imperfections. This is shown by studying the asymptotic behavior of the coupled stochastic master equations describing the evolutions of the actual state and the estimated one under appropriate assumptions on the feedback controller. We provide sufficient conditions on the feedback controller and a valid domain of estimated parameters which ensure exponential stabilization of the coupled system. Furthermore, our results allow us to answer positively to [2, Conjecture 4.4] in the case of spin-1/2 systems with unknown initial states, even in presence of imprecisely known physical parameters.
|
|
17:00-17:15, Paper WeB17.4 | Add to My Program |
Continuous Measurement and Feedback Control for Enhancement of Quantum Synchronization |
|
Kato, Yuzuru | Tokyo Institute of Technology |
Nakao, Hiroya | Tokyo Institute of Technology |
Keywords: Quantum information and control
Abstract: We analyze synchronization of a quantum van der Pol (vdP) oscillator driven by a harmonic signal and demonstrate that performing continuous homodyne measurement on an additional bath linearly coupled to the oscillator and applying a feedback control to the oscillator can enhance quantum synchronization. We show that the phase coherence of the oscillator is increased by the reduction of quantum fluctuations due to the continuous measurement, but also that the measurement backaction inevitably induces fluctuations around the phase-locking point. We propose a simple feedback policy that can suppress the measurement-induced fluctuations by adjusting the frequency detuning between the oscillator and the driving signal, which leads to enhancement of quantum synchronization. We further demonstrate that the maximum phase coherence can be achieved by performing the quantum measurement on the quadrature angle at which the phase diffusion of the oscillator takes the largest value and the maximum information of the oscillator phase is attained.
|
|
17:15-17:30, Paper WeB17.5 | Add to My Program |
When Do Additional Quantum Noises Affect Controller Performance? |
|
Thien, Rebbecca Tze Yean | Australian National University |
Vuglar, Shanon Leigh | Princeton University |
Petersen, Ian R. | Australian National University |
Keywords: Quantum information and control
Abstract: The introduction of quantum noise in a system is necessary for realizing it as physical quantum system. In general, quantum noise can be detrimental to the performance of a coherent quantum feedback control system. This paper gives a necessary and sufficient condition for when a quantum system corresponding to a given Linear Time Invariant~(LTI) controller can be made physically realizable in the presence of both direct feedthrough quantum vacuum noise and additional quantum vacuum noise such that the additional quantum noise has no effect on the controller output. Furthermore, an example is presented to illustrate our result.
|
| |