Research – Shie Mannor’s Homepage

Scientific objectives

I am in the business of being of professor because I want to understand how to act and make decisions in dynamic, complex and uncertain environments. In plain language, I want to build machines (e.g., software agents) that learn, evolve, and improve over time. I work mostly in machine learning, but also in certain application domains.

Reinforcement learning
High dimensional statistics and learning
Uncertainty and risk in decision making
Learning and modeling dynamics from data
Systems that include multiple decision makers: Multi-agent/distributed/many players/adaptive systems

See my Publications page for more details.

More specific research interests

Machine Learning (theory, algorithms, and applications). High-dimensional problems with uncertainty in the data and modeling and learning dynamics (e.g., networks).
Reinforcement Learning and Markov decision processes. Theory and application of Markov decision processes. I have worked quite a bit on adaptive control and learning algorithms for (large) stochastic systems in what is known as reinforcement learning.
Learning, optimization and control under uncertainty. Robust and stochastic optimization and statistical analysis of such approaches.
Games. Stochastic, dynamic, network, and differential games; applications in networks and resource sharing.
Multi-agent systems. Especially learning in such systems (e.g., online learning and learning in games). The goal here is to design economic systems (e.g., markets) where equilibrium is also a good social outcome.
Optimization of large scale problems. Especially combinatorial optimization using heuristic and statistical methods (e.g., the Cross Entropy method) and stochastic optimization.
Power Grid. Especially in reliability, pricing, and decision making in large-scale power grids (smart grids). My approach is very much data-driven: I try to understand the actual dynamics of the grid so that I can propose concrete policies for control of the grid, as well as evaluate market mechanisms and anomalies. See, for example, the EU funded GARPUR project that looks at probabilistic reliability models for large-scale grids.
Applications. I am interested and have worked (i.e., got to a semi-commercial prototype at least or plan to) on the following eclectic list of applications: large-scale communication network optimization, power management for laptops, adaptive compression of large data bases, a learning agent for combat planes simulator, cognitive radio networks, human activity recognition and context identification on mobiles, stochastic approaches to decoding of LDPC codes.

Open Positions (updated: September 4th, 2019)

I am looking for a postdoc and a couple of graduate students to join my team. Please consider that working with me requires very strong mathematical skills and/or true hacking capabilities. Email me your resume and a brief explanation of what you want to do if you are interested.