As our daily lives increasingly depend on technology, the security and privacy of that technology becomes ever more important. Gain a critical understanding of the technological needs, threats, and ...
Transfer Learning for Reinforcement Learning Domains: A Survey. Matthew E. Taylor and Peter Stone. Journal of Machine Learning Research, 10(1):1633–1685, 2009.
Abstract Reinforcement learning (RL) is a popular paradigm for addressing sequential decision tasks in which the agent has only limited environmental feedback. Despite many advances over the past ...
Recent work has shown that deep neural networks are capable of approximating both value functions and policies in reinforcement learning domains featuring continuous state and action spaces. However, ...
BibTeX Entry @InProceedings {SDMIA15-Hausknecht, author = {Matthew Hausknecht and Peter Stone}, title = {Deep Recurrent Q-Learning for Partially Observable MDPs}, booktitle = {AAAI Fall Symposium on ...
A critical bottleneck limiting imitation learning in robotics is the lack of data. This problem is more severe in mobile manipulation, where collecting demonstrations is harder than in stationary ...
PhD students Dongmyeong Lee and Zifan Xu prepare their child-sized humanoid robots before a match at RoboCup 2025 in Salvador ...
Lili Qiu, professor of Computer Science and a leader in AI systems, wireless sensing, and real-world intelligence, was awarded a Test-of-Time award by ACM SIGMOBILE for her 2003 paper: Impact of ...
Interests: Computer architecture, reconfigurable logic, computing technologies, ubiquitous and wearable computing, privacy and personal data management, mobile devices, cloud services, datacenter ...
This note is devoted to three rules, the following of which is necessary if you want to be successful in scientific research. (If you manage to follow them, they will prove close to sufficient, but ...
Books by Robert A. van de Geijn Robert A. van de Geijn and Enrique S. Quintana-Orti, "The Science of Programming Matrix Computations." In preparation. (Last updated Sept. 10, 2005.) Using PLAPACK: ...