ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

Browse by Author

Up a level
Export as [feed] Atom [feed] RSS 1.0 [feed] RSS 2.0
Group by: Item Type | No Grouping
Number of items: 8.

Conference Proceedings

Joseph, Ajin George and Bhatnagar, Shalabh (2017) An Incremental Fast Policy Search Using a Single Sample Path. In: PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, DEC 05-08, 2017, Kolkata, INDIA, pp. 3-10.

Joseph, Ajin George and Bhatnagar, Shalabh (2017) Bounds for Off-policy Prediction in Reinforcement Learning. In: International Joint Conference on Neural Networks (IJCNN), MAY 14-19, 2017, Anchorage, AK, pp. 3991-3997.

Joseph, Ajin George and Bhatnagar, Shalabh (2017) A Model based Search Method for Prediction in Model-free Markov Decision Process. In: International Joint Conference on Neural Networks (IJCNN), MAY 14-19, 2017, Anchorage, AK, pp. 170-177.

Joseph, Ajin George and Bhatnagar, Shalabh (2016) A RANDOMIZED ALGORITHM FOR CONTINUOUS OPTIMIZATION. In: Winter Simulation Conference (WSC), DEC 11-14, 2016, Arlington, VA, pp. 907-918.

Joseph, Ajin George and Bhatnagar, Shalabh (2016) Revisiting the Cross Entropy Method with Applications in Stochastic Global Optimization and Reinforcement Learning. In: 22nd European Conference on Artificial Intelligence (ECAI), AUG 29-SEP 02, 2016, Hague, NETHERLANDS, pp. 1026-1034.

Joseph, Ajin George and Bhatnagar, Shalabh (2016) A Stochastic Approximation Algorithm for Quantile Estimation. In: 22nd International Conference on Neural Information Processing (ICONIP), NOV 09-12, 2015, Istanbul, TURKEY, pp. 311-319.

Journal Article

Joseph, Ajin George and Bhatnagar, Shalabh (2018) An online prediction algorithm for reinforcement learning with linear function approximation using cross entropy method. In: MACHINE LEARNING, 107 (8-10, ). pp. 1385-1429.

Joseph, Ajin George and Bhatnagar, Shalabh (2018) An incremental off-policy search in a model-free Markov decision process using a single sample path. In: MACHINE LEARNING, 107 (6). pp. 969-1011.

This list was generated on Sun Dec 8 08:44:34 2019 IST.