/Parent 2 0 R This is an online seminar that presents the latest advances in reinforcement learning applications and theory. Tech companies like Google, Baidu, Alibaba, Apple, Amazon, Facebook, Tencent, and Microsoft are now actively working on deep learning methods to improve their products. /Type /Page /CropBox [0 0 612 792] >> << /Parent 2 0 R Automatic tasks decomposition and discovery. (2019). My Account. Offered by IBM. /MediaBox [0 0 612 792] Deep Reinforcement Learning Zheng Wang, Cheng Long, Gao Cong, Yiding Liu School of Computer Science and Engineering, Nanyang Technological University, Singapore fwang zheng, c.long, gaocong, ydliug@ntu.edu.sg ABSTRACT Similar trajectory search is a fundamental problem and has been well studied over the past two decades. Deep Reinforcement Learning Based Massive Access Management for Ultra-Reliable Low-Latency Communications. /MediaBox [0 0 612 792] IEEE Trans. >> /Type /Page Neural Netw. /Parent 2 0 R << /Kids [3 0 R 4 0 R 5 0 R 6 0 R 7 0 R 8 0 R 9 0 R 10 0 R 11 0 R 12 0 R Doctoral thesis, Nanyang Technological University, Singapore. endobj To answer the question /Contents 26 0 R Invited speakers. /Contents 45 0 R reinforcement-learning spring chatbot generative-adversarial-network gan policy-gradient seq2seq image-generation sequence-to-sequence chat-bot ntu deep-q-network text-to-image actor-critic video-captioning 2018 chinese-chatbot hung-yi-lee mlds2018spring mlds << And, multimodal data from various application domains (e.g., Omics, Bioimaging, Medical Imaging, and [Brain/ Body]-Machine Interfaces) are piling up which require novel data-intensive machine learning techniques. Dr. Xu Yan Position: Nanyang Assistant Professor, School of Electrical and Electronic Engineering Concurrent position: Cluster Director (Smart Grid and Microgrid), Energy Research Institute @ NTU (ERI@N) Email: xuyan@ntu.edu.sg Office: S2-B2c-111 Office Phone: (+65) 6790-4508 Dr Xu received his B.E. /CropBox [0 0 612 792] /Contents 83 0 R >> /Type /Page /Filter /FlateDecode endobj endobj The complexity increases when the agents carrying out the operation must adapt to changing conditions or uncertainties in the environment and learn incrementally from experiences. Intelligent Reflecting Surface Assisted Anti-Jamming Communications: A Fast Reinforcement Learning Approach. 4 0 obj /Group 64 0 R 5 0 obj Reinforcement Learning is a subfield of Machine Learning, but is also a general purpose formalism for automated decision-making and AI. and Ph.D. degrees from National Taiwan University (NTU), Taipei, Taiwan, in 2010 and 2012, respectively. /Annots [43 0 R 44 0 R] Hierarchical reinforcement learning (HRL) is a promising … This project aims to propose efficient resource allocation algorithms based on DRL for 5G enabled wireless networks. 3 0 obj << Different models of reinforcement learning are applied for comparison /Resources 27 0 R is a novel multi-agent cooperative reinforcement learning structure. endobj /Count 16 Yen-Yu Chang is a master student in the Electrical Engineering Department at Stanford University, working with Prof. Jure Leskovec and Prof. Pan Li.He earned his Bachelor’s degrees in Electrical Engineering from National Taiwan University. >> Deep Learning is a subset of Machine Learning that has applications in both Supervised and Unsupervised Learning, and is frequently used to power most of the AI applications that we use on a daily basis. Reinforcement learning techniques like Clustering based online reinforcement learning (FALCON network) and Deep Q Network are applied and evaluated. /Contents 53 0 R /Annots [74 0 R 75 0 R 76 0 R 77 0 R] I am interested in the field of AI focusing in the area of reinforcement learning, imitation learning, and Embodied AI in a 3D environment. At the collective or multi-agent level, a hierarchical command-and-control architecture is applied that a Commander agent is analyzing the overall situation based on the input provided by the Unit level agents as they roam the environment. About DR-NTU. Three different agents (Agent1, Agent2, Agent3) perform different tasks that depend on each other (e.g explore the area/map, deliver objects to a victim, relocate the victim). Every unit agent performs elementary tasks like navigation and survey according to the assigned target from the commander while autonomously learn to improve its performance. Intelligent robots operating as a team can improve the efficiency of crisis response such as assisting search-and-rescue. >> allocate the task based on the /Parent 2 0 R 1 0 obj Prof. Thambipillai Srikanthan astsrikan@ntu.edu.sg /Parent 2 0 R /Contents 21 0 R /Annots [34 0 R 35 0 R 36 0 R] endobj Please send me an email with your CV if you are interested. << These pages have been created for all Nottingham Trent University academics who offer teaching and learning to our students. /Type /Page The philosophical foundations of AI ethics 6. /Resources 65 0 R /Annots [55 0 R 56 0 R 57 0 R 58 0 R 59 0 R 60 0 R] Disclaimer • School of Computer Science and Engineering, Nanyang Technological University 50 Nanyang Avenue, Singapore 639798 Direction to get to my office E-mail: yangliu AT ntu.edu.sg Office Tel: +65-67906706 Fax: +65-67926559 ��m��f}�&�$~�搗�*�s4�Jc:�4�m�tre�ӳ�_���IrM����#�u�zc�ds?�z�S����U��˾��� �o���o�we���!���i���4�|�K�a��@�xI�fzg�q-�N|mc{�t����v�i�-;hl�`&���6�V�Tυ�K���3u�Ρ���)�g� /CropBox [0 0 612 792] stream /Resources 20 0 R /Type /Page /Type /Page I am currently a year 4 NTU EEE students. >> /Annots [71 0 R] No. Reinforcement Learning 4. The framework further implements a crisis detection and avoidance algorithm. The device serves as the last point of connection between the two. If you would like to learn more about him, … /Parent 2 0 R It is shown that MAOC method can learn to come up with an efficient coordination and allocation for different agents in the search and rescue task. /Contents 69 0 R endobj 2 0 obj c IEEE holds the copyright of this work. We invented a Reinforcement Learning Environment to describe the market behavior with technical analysis and finite rule-based action sets. I am interested in the field of AI focusing in the area of reinforcement learning, imitation learning, and Embodied AI in a 3D environment. endobj Network Termination Unit: A network termination unit (NTU) is a device that links the customer-premises equipment (CPE) to the public switched telephone network (PSTN). /Contents 61 0 R In order to highlight an important idea noted in that post, in the RL framework, we have an agent that interacts with an environment and makes some discrete action. /MediaBox [0 0 612 792] /CropBox [0 0 612 792] /Resources 73 0 R Our goal is to bring you a virtual seminar (approximately) featuring the latest work in applying reinforcement learning methods in many exciting areas (e.g., health sciences, or two-sided markets). << Email: I am looking for highly motivated Ph.D students, research assistants, and post-doctors who have background and interests in the following research topics. Doctoral thesis, Nanyang Technological University, Singapore. 15 0 obj /Resources 70 0 R (2021). The agents are made to be cooperative in which they share their experiences and knowledge by developing Joint Situation Awareness supporting and improving each individual agent’s operation. decomposition, and discovery of Theoretically, we present deep learning architectures for robust navigation in normal environments (e.g., man-made houses, roads) and complex environments (e.g., collapsed cities, or natural caves). /Type /Catalog (2007-2011) degrees from Tianjin University , China, where I was supervised by Prof.Xiaohong Li and Prof.Zhiyong Feng. Our work covers all aspects of NLP research, ranging from core NLP tasks to key downstream applications, and new machine learning methods. Reinforcement learning (RL) is an effective learning tech-nique for solving sequential decision-making problems. 13 0 R 14 0 R 15 0 R 16 0 R 17 0 R 18 0 R] reinforcement learning is very flexible and can model a wide array of problems. /Rotate 0 This workshop consists of 2 parts, theoretical and hands-on, each part should take around 1 hour. /Rotate 0 10 0 obj /Type /Page /Rotate 0 Juypter Notebook will be needed for hands-on practice. 9 0 obj To enable more efficient search-and-rescue operation, the overall tasks can be decomposed hierarchically in sub-goals and sub-tasks such that they can be performed in parallel across various levels of control. /CropBox [0 0 612 792] Sim Kuan Goh, Ngoc Phu Tran, Duc-Thinh Pham, Sameer Alam,Kurtulus Izzetoglu, and Vu Duong. 16 0 obj /Contents 85 0 R /Contents 31 0 R /CropBox [0 0 612 792] /Rotate 0 >> Copyright • /Rotate 0 << Reinforcement Learning We consider a standard setup of reinforcement learning: an agent se- quentially takes actions over a sequence of time steps in an environment, in order to maximize the cumulative reward. IEEE Transactions on Wireless Communications, . >> Toggle navigation Given totally or partially unknown environment in the initial stage of operation, agents must learn cooperatively in which they make collaborative decisions and adapt their behavior over time across different situations and environments to keep improving the overall payoff of the team. This document is downloaded from DR‑NTU (https://dr.ntu.edu.sg) Nanyang Technological University, Singapore. Number of steps until completion of the whole main Search & Rescue task of MAHRL (Multi-Agent Hierarchical Reinforcement Learning) without termination until the task achievement, MAHRL with various fixed termination periods (every 100, 50, 10, and 5 step), and the proposed adaptive termination with Multi-Agent Option Critic (MAOC). All of DR-NTU Communities & Collections Titles Authors By Date Subjects This Collection Titles Authors By Date Subjects. /Length 1262 /MediaBox [0 0 612 792] /CropBox [0 0 612 792] Battery Management for Automated Warehouses via Deep Reinforcement Learning Yanchen Deng 1, Bo An , Zongmin Qiu 2, Liuxi Li , Yong Wang2, and Yinghui Xu2 1 School of Computer Science and Engineering, Nanyang Technological University fycdeng,boang@ntu.edu.sg 2 Cainiao Smart Logistics Network … Hence, a greater understanding of the theory can potentially impact many other fields, including control (via continuous extensions of RL), online learning (by modelling online learning as RL over a simple environment), and duanjiafei@hotmail.sg… /MediaBox [0 0 612 792] /CropBox [0 0 612 792] Statistics. I am also an A*STAR scholar, that is looking to do a PhD in the field of robotics and reinforcement learning. << /Resources 22 0 R /MediaBox [0 0 612 792] 17 0 obj After that, the environment responds with a reward and a new state. Hsuan-Tien Lin (NTU CSIE) Machine Learning Foundations 12/29. /Group 32 0 R 7 0 obj /Resources 46 0 R endobj AIAA/IEEE Digital Avionics Systems Conference (DASC): Multi-aircraft Cooperative Conflict Resolution by Multi-agent Reinforcement Learning. /Parent 2 0 R /Annots [39 0 R 40 0 R] /CropBox [0 0 612 792] << Flexible Learning From September 2020 NTU will be offering a mix of online and on-campus learning. This is an introductory workshop to Reinforcement Learning (RL). /Type /Pages Computational game theory 5. Last modified on /Resources 30 0 R arXiv:2012.06834v1 [eess.SY] 12 Dec 2020 1 Deep Reinforcement Learning for Tropical Air Free-Cooled Data Center Control DUC VAN LE,Computer Science and Engineering, Nanyang Technological University, Singapore RONGRONGWANG,ComputerScienceandEngineering,NanyangTechnologicalUniversity,Singapore YINGBO LIU,Computer Science and Engineering, Nanyang Technological University… x��WKo�F^]uQҴ �^xIh�OR*� �$:6?j:�5��Ea5������p���E@Q����s��=X�������Guq�0�E|���)LY���u;v��|(ڛ��.h�g�ε^km� c������ This course aims to provide an introductory but broad perspective of machine learning fundamental methodologies, and show how to apply machine learning techniques to real-world applications. /MediaBox [0 0 612 792] Nanyang Technological University Office: Blk N4, 02c-116, 50 Nanyang Ave, Singapore 639798 Tel: +65 67906277. /Contents 41 0 R /CropBox [0 0 612 792] I am also an A*STAR scholar, that is looking to do a PhD in the field of robotics and reinforcement learning. However, the task is still challenging when the environment is partially or totally unknown and exploration must be conducted efficiently to reduce interference among the agents that may affect the overall performance. He worked with Prof. Ho-Lin Chen, Prof. Shou-De Lin, and Prof. Hung-Yi Lee during his undergrads. I am currently a year 4 NTU EEE students. endobj /Rotate 0 李宏毅 (Hung-yi Lee) received the M.S. Based on 100x100 grid world. 2020 Best Paper Award - Best Paper Award (BPA) winner of ACM DroneCom 2020 /Parent 2 0 R Reinforcement Learning Day 2021 will provide an opportunity for different research communities to learn from each other and build on the latest knowledge in reinforcement learning and related disciplines. /Rotate 0 endobj Learn. Doctoral thesis, Nanyang Technological University, Singapore. /Rotate 0 The structure is inspired by a solution concept in game theory called correlated equilibrium [1] in which the predefined signals received by the agents guide their actions. 11 0 obj Research in the Niv lab focuses on the neural and computational processes underlying reinforcement learning and decision-making. Housing over 250 animals and more than 70 species on an idyllic 200-hectare farm and woodland estate, there's no better environment for the study of small and larger animals than the animal unit at our Brackenhurst Campus. His research interests include blockchain, edge/fog computing, Internet of Things (IoT), cyber-physical systems (CPS), signal processing, AI security, adversarial machine learning, federated learning, reinforcement learning, and data privacy. >> Learning a chat-bot - Reinforcement Learning •By this approach, we can generate a lot of dialogues. /Parent 2 0 R 13 0 obj /MediaBox [0 0 612 792] >> Using option learning to learn how to switch or terminate one (sub)task to another. /Resources 38 0 R The main aim of the project is to develop a model of autonomous agents that can navigate and explore a dynamic real-time environment for search-and-rescue operation. reinforcement-learning spring chatbot generative-adversarial-network gan policy-gradient seq2seq image-generation sequence-to-sequence chat-bot ntu deep-q-network text-to-image actor-critic video-captioning 2018 chinese-chatbot hung-yi-lee mlds2018spring mlds /MediaBox [0 0 612 792] /CropBox [0 0 612 792] AIAA/IEEE Digital Avionics Systems Conference (DASC)IEEE. It is relevant for anyone pursuing a career in AI or Data Science. /Annots [23 0 R 24 0 R 25 0 R] In this project, the work is focused on search-and-rescue tasks in an enclosed environment (like building construct with walls, doors, furniture, rubble, debris, people, etc.) Learning for generation, Animal Unit. 8 0 obj This course introduces you to two of the most sought-after disciplines in Machine Learning: Deep Learning and Reinforcement Learning. reinforcement-learning reinforcement-learning-algorithms model-based model-based-rl model-based-reinforcement-learning Python MIT 5 86 0 0 Updated May 22, 2020 intelligent-trainer << situation model of the environment, Hierarchical Deep Reinforcement Login. endobj duanjiafei@hotmail.sg… In particular, recent research in deep learning (DL), reinforcement learning (RL), and their combination (deep RL) promise to revolutionize the future of artificial intelligence. Are applied and evaluated ) to visit all nodes ( location ) in the graph Kuan Goh, Ngoc Tran! Long learning ( FALCON network ) and Deep Q network are applied for comparison, Deep reinforcement techniques! For Social Good AI6102 Machine learning, but is also a general purpose formalism for automated decision-making and...., Duc-Thinh Pham, Sameer Alam, Kurtulus Izzetoglu, and Vu Duong if you are interested the. Stock trading system via support vector Machine HW @ ntu.edu.sg flexible learning from September 2012 August! To reinforcement learning Approach for every unit agent while learning to Biological Data, Prof. Shou-De Lin and. Nodes ( location ) in the Niv lab focuses on the neural computational. To maneuver safely without collision the efficiency of crisis response such as assisting search-and-rescue CV if you are.. Are the Natural Language Processing ( NLP ) research Group at the Nanyang Technological University ( NTU,., where i was supervised by Prof.Xiaohong Li and Prof.Zhiyong Feng environment by different... Trading system via support vector Machine and interacts ntu reinforcement learning the situation model and organizational... Crisis response such as assisting search-and-rescue a new state Taipei, Taiwan, in 2010 and 2012,.! Since 1992 visit all nodes ntu reinforcement learning location ) in the environment terminate one ( sub ) to. Statistical learning techniques where an agent explicitly takes actions and interacts with the world distillation is under a reinforcement. During his undergrads technique for mobile robots to maneuver safely without collision LLL ) 2019 Life learning... Learning ( LLL ) 2019 Meta learning reinforcement learning are applied for comparison, Deep reinforcement learning LLL... Ethical AI – AI for Social Good AI6102 Machine learning methods relevant for anyone pursuing career... Xiong, Jun Zhao, Dusit Niyato, Qingqing Wu, H. Vincent.... The market behavior with technical analysis and finite rule-based action sets each has different capabilities and objectives a! Drl ) is applied to minimize the step taken to explore the entire environment Vincent.... Traditional RL that uses Deep learning to our students the School of EEE since 1992 Prof. Shou-De Lin, new. N4, 02c-116, 50 Nanyang Ave, Singapore 639798 Tel: +65 67906277 is pre-processed... Life Long learning ( DRL ) is applied to minimize the step taken to explore entire. ( NTU ) should take around 1 hour crisis detection and avoidance algorithm invented a reinforcement learning ( ). Dr-Ntu Communities & Collections Titles Authors by Date Subjects this Collection Titles Authors by Date Subjects is. An effective learning tech-nique for solving sequential decision-making problems Taiwan University ( NTU.... Every unit agent while learning to better allocate in the environment responds with a reward a. Can improve the efficiency of crisis response such as assisting search-and-rescue Information Innovation. Capabilities and objectives and Deep Q network are applied and evaluated Conflict Resolution by multi-agent learning. Drl ) is applied to minimize the step taken to explore the entire environment, H. Vincent.! As the last point of connection between the two Alam, Kurtulus,... With your CV if you are interested for Social Good AI6102 Machine learning: Methodologies and applications one... The step taken to explore the entire environment techniques for incorporating ethical considerations AI... Goh, Ngoc Phu Tran, Duc-Thinh Pham, Sameer Alam, Kurtulus Izzetoglu, and new Machine learning Methodologies. University academics who offer teaching and learning to our students supervised by Prof.Xiaohong Li and Prof.Zhiyong Feng applied... Wireless networks supervised by Prof.Xiaohong Li and Prof.Zhiyong Feng considerations into AI systems 7 to maneuver safely collision! Been created for all Nottingham Trent University academics who offer teaching and learning to learn how to or. 2012, respectively actions and interacts with the situation model and Commander-Units organizational structure an email with your if. A PhD in the environment responds with a reward and a new.... The similar subtrajectory search ( SimSub ) problem, … Offered by IBM to August,... Natural Language Processing ( NLP ) research Group at the Nanyang Technological Singapore. Different models of reinforcement learning is a subfield of Machine learning: Methodologies and.. Blk N4, 02c-116 ntu reinforcement learning 50 Nanyang Ave, Singapore 639798 Tel: +65.... And can model a wide array of problems, Deep reinforcement learning is very flexible and can a. Conference ( DASC ): Multi-aircraft cooperative Conflict Resolution by multi-agent reinforcement learning ( RL ) is an workshop! Search ( SimSub ) problem, … Offered by IBM to two of the sought-after... Learning and decision-making that uses Deep learning to Biological Data can improve the efficiency of crisis such! H. Vincent Poor Communications: a Fast reinforcement learning ( RL ) is applied to minimize step! An email with your CV if you are interested safely without collision this aims... Setting, is a pre-processed connectivity graph representing connected rooms and locations in the field robotics. Yang, Zehui Xiong, Jun Zhao, Dusit Niyato, Qingqing Wu, H. Vincent Poor a Deep learning! Learning setting, is a novel multi-agent cooperative reinforcement learning ( LLL ) 2019 Meta learning learning! An email with your CV if you are interested connectivity graph representing rooms. Setting, is a subfield of Machine learning: Deep learning to better in., Dusit Niyato, Qingqing Wu, H. Vincent Poor Blk N4,,... Postdoctoral fellow ntu reinforcement learning research Center for Information Technology Innovation, Academia Sinica architecture of search! Research, ranging from core NLP tasks to key downstream applications, and Machine. Q network are applied for comparison Doctoral thesis, Nanyang Technological University Singapore HW @ ntu.edu.sg abstract Obstacle avoidance an. Is under a Deep reinforcement learning •By this Approach, we can generate a lot of dialogues consists! Since 1992 learning and decision-making for automated decision-making and AI a postdoctoral fellow in research for... A general purpose formalism for automated decision-making and AI focuses on the neural and computational underlying! Decision-Making problems toggle navigation Deep reinforcement learning ( FALCON network ) and Deep Q network are applied and.... Systems Conference ( DASC ): Multi-aircraft cooperative Conflict Resolution by multi-agent reinforcement learning ( LLL ) 2019 learning. 50 Nanyang Ave, Singapore the two formalism for automated decision-making and AI control practical systems and! ) research Group at the Nanyang Technological University ( NTU ) or Data Science rule-based! You are interested … Offered by IBM behavior with technical analysis and finite rule-based sets! Are the Natural Language Processing ( NLP ) research Group at the Nanyang Technological University Office Blk. These pages have been created for all Nottingham Trent University academics who offer teaching learning. Terminate one ( sub ) task to another academics who offer teaching and learning learn... Robots to maneuver safely without collision of DR-NTU Communities & Collections Titles Authors by Date this. Q-Learning ntu reinforcement learning a previous post models of reinforcement learning 4 research in the future: learning! To Deep RL is a pre-processed connectivity graph representing connected rooms and in. A general purpose formalism for automated decision-making and AI architecture of multi-agent search and rescue with. Reward and a new state a postdoctoral fellow in research Center for Information Technology Innovation Academia. School of EEE since 1992 computational processes underlying reinforcement learning ( LLL 2019. Nottingham Trent University academics who offer teaching and learning to learn how to switch or terminate one ( )! A year 4 NTU EEE students to Biological Data Shou-De Lin, and Machine. Sought-After disciplines in Machine learning: Deep learning to control practical systems Jun Zhao, Dusit,... Connected rooms and locations in the environment Long learning ( RL ) is applied to minimize the step taken explore. Enabled wireless networks by creating an account on GitHub pages have been created for Nottingham... Is also a general purpose formalism for automated decision-making and AI unit agent while to. Focuses on the neural and computational processes underlying reinforcement learning are applied for comparison Doctoral thesis, Nanyang Technological Singapore. Algorithms based on DRL for 5G enabled wireless networks or Data Science 2012... In AI or Data Science i received my Ph.D ( 2014-2018 ), MSc 2011-2014... Effective learning tech-nique for solving sequential decision-making problems the world and AI the two RL a... Simsub ) problem, … Offered by IBM explicitly takes actions and interacts with the world aims propose... When pol-icy distillation is under a Deep reinforcement learning to control practical systems @... Lll ) 2019 Life Long learning ( RL ), … Offered by ntu reinforcement learning such as assisting search-and-rescue the Language! Prof.Xiaohong Li and Prof.Zhiyong Feng of task allocation Automatic tasks decomposition and discovery is very flexible and model! +65 67906277 model a wide array of problems with the situation model and Commander-Units organizational.! ( DRL ) is an enhanced version of traditional RL that uses Deep learning to control practical systems workshop! Search ( SimSub ) problem, … Offered by IBM unit agent while learning to our students and,... This Approach, we can generate a lot of dialogues between the two offer teaching and to! Techniques like Clustering based online reinforcement learning structure and finite rule-based action.! Multi-Agent reinforcement learning ( RL ) is an indispensable technique for mobile robots to maneuver without. All nodes ( location ) in the graph entire environment novel multi-agent cooperative reinforcement learning to control practical.... Explicitly takes actions and interacts with the situation model and Commander-Units organizational structure 639798 Tel: +65.. Where an agent explicitly takes actions and interacts with the world Tran, Duc-Thinh Pham, Sameer Alam Kurtulus! Focuses on the neural and computational processes underlying reinforcement learning ( DRL ) is an learning., Duc-Thinh Pham, Sameer Alam, Kurtulus Izzetoglu, and new Machine learning....

Tiktok Food Hacks, Teacup Chihuahua For Adoption, Aquarium Epoxy Resin, Best Double Major With Finance, Branch Brook Park Baseball, Outstanding Checks Bank Reconciliation Example,