ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

Browse by Author

Up a level
Export as [feed] Atom [feed] RSS 1.0 [feed] RSS 2.0
Group by: Item Type | No Grouping
Number of items: 66.

Book Chapter

Kolavali, Sudha Rani and Bhatnagar, Shalabh (2009) Ant Colony Optimization Algorithms for Shortest Path Problems. [Book Chapter]

Conference Paper

Lakshmanan, K and Bhatnagar, Shalabh (2012) A novel Q-learning algorithm with function approximation for constrained Markov decision processes. In: 2012 50th Annual Allerton Conference on Communication, Control, and Computing (Allerton), 1-5 Oct. 2012 , Monticello, IL, USA.

Ghoshdastidar, Debarghya and Dukkipati, Ambedkar and Bhatnagar, Shalabh (2012) q-Gaussian based Smoothed Functional Algorithms for Stochastic Optimization. In: IEEE International Symposium on Information Theory, JUL 01-06, 2012 , Cambridge, MA .

Prashanth, LA and Bhatnagar, Shalabh (2011) Reinforcement learning with average cost for adaptive control of traffic lights at intersections. In: 2011 14th International IEEE Conference on Intelligent Transportation Systems (ITSC), 5-7 Oct. 2011, Washington, DC, USA.

Lakshmanan, K and Bhatnagar, Shalabh (2011) Smoothed functional and Quasi-Newton algorithms for routing in multi-stage queueing network with constraints. In: ICDCIT'11 Proceedings of the 7th international conference on Distributed Computing and Internet Technology, 2011, Heidelberg.

Prashanth, LA and Bhatnagar, Shalabh and Desai, Nirmit and Prasad, HL and Dasgupta, Gargi (2011) Stochastic optimization for adaptive labor staffing in service systems. In: ICSOC'11 Proceedings of the 9th international conference on Service-Oriented Computing, 2011, Heidelberg.

Velusamy, Sudha and Bhatnagar, Shalabh and Basavaraja, S and Sridhar, V (2008) SPSA Based Feature Relevance Estimation For Video Retrieval. In: 2008 IEEE 10th Workshop on Multimedia Signal Processing, OCT 08-10, 2008, Cairns, Australia, pp. 602-607.

Reddy, Ramana G and Bhatnagar, Shalabh (2008) An Efficient and Optimized Bluetooth Scheduling Algorithm for Scatternets. In: Proceedings of IEEE ANTS 2008 (Advanced Networks and Tele-communication Systems), Mumbai, 15-17 Dec. 2008 , Mumbai.

Reddy, G Ramana and Bhatnagar, Shalabh and Rakesh, V and Chaturvedi, Vijay Prakash (2008) An efficient algorithm for scheduling in bluetooth piconets and scatternets. In: 2nd International Symposium on Advanced Networks and Telecommunication Systems, DEC 15-17, 2008, Mumbai.

Bhatnagar, Shalabh and Sutton, Richard S and Ghavamzadeh, Mohammad and Lee, Mark (2007) Incremental natural-gradient actor-critic algorithms. In: Proceedings of 21st Annual Conference on Neural Information Processing Systems (NIPS-2007), Vancouver, Canada,, Dec. 2007, Vancouver, Canada.

Mohan Babu, K and Bhatnagar, Shalabh (2007) Two-timescale Q-learning Algorithms with an Application to Routing in Networks. In: International Conference on Advances in Control and Optimization of Dynamical Systems, ACODS- Bangalore, Feb. 2007, Bangalore.

Mishra, Vivek and Bhatnagar, Shalabh and Hemachandra, N (2007) Discrete parameter simulation optimization algorithms with applications to admission control with dependent service times. In: 46th IEEE Conference on Decision and Control, DEC 12-14, 2007, New Orleans, LA.

Velusamy, Sudha and Gopal, Lakshmi and Sridhar, V and Bhatnagar, Shalabh (2007) Fuzzy Clustering Based Ad Recommendation for TV Programs. In: Proceedings of the Fifth European Conference, EuroITV (Published in Interactive TV: A Shared Experience, Eds. P.Cesar, K.Chorianopoulos and J.F.Jensen, LNCS 4471, Springer, 2007), Amsterdam, Netherlands, Amsterdam.

Abdulla, Mohammed Shahid and Bhatnagar, Shalabh (2007) Network flow-control using asynchronous stochastic approximation. In: 46th IEEE Conference on Decision and Control, DEC 12-14, 2007, New Orleans, LA.

Abdulla, Mohammed Shahid and Bhatnagar, Shalabh (2007) Parametrized actor-critic algorithms for finite-horizon MDPs. In: American Control Conference 2007, JUL 09-13, 2007, New York,.

Abdulla, Mohammed Shahid and Bhatnagar, Shalabh (2007) Solving MDPs using two-timescale simulated annealing with multiplicative weights. In: American Control Conference 2007, JUL 09-13, 2007, New York, NY.

Chaturvedi, Vijay Prakash and Rakesh, V and Bhatnagar, Shalabh (2007) An efficient and optimized bluetooth scheduling algorithm for piconets. In: 4th International Conference on Distributed Computing and Internet Technology, DEC 17-20, 2007, Bangalore.

Vemu, Koteswara Rao and Bhatnagar, Shalabh and Hemachandra, N (2007) An optimal weighted-average congestion based pricing scheme for enhanced QoS, , LNCS 4882, 2007. In: ICDCIT'07 Proceedings of the 4th international conference on Distributed computing and internet technology , Dec. 17-20, 2007, Heidelberg.

Vemu, Koteswara Rao and Bhatnagar, Shalabh and Hemachandra, N (2007) An optimal weighted-average congestion based pricing scheme for enhanced QoS. In: 4th International Conference on Distributed Computing and Internet Technology, DEC 17-20, 2007, Bangalore, I.

Bhatnagar, Shalabh and Abdulla, Mohammed Shahid (2006) An Actor-Critic Algorithm for Finite Horizon Markov Decision Processes. In: Proceedings of the 45th IEEE Conference on Decision & Control Manchester Grand Hyatt Hotel, December 13-15, 2006, San Diego, CA.

Sharma, Diksha and Bhatnagar, Shalabh (2006) Optimal Parameterized Policies for Resource Allocation in Communication Networks. In: Proceedings of IEEE International Conference on Signal and Image Processing, , 2006, Hubli, Karnataka.

Abdulla, Mohammed Shahid and Bhatnagar, Shalabh (2006) SPSA algorithms with measurement reuse. In: 2006 Winter Simulation Conference,, Dec 03-06, 2006, Monterey, CA,, pp. 319-327.

Sharma, Diksha and Bhatnagar, Shalabh and Chakraborty, Shyam (2006) An algorithm for dynamic optimal bandwidth allocation in communication networks. In: Proceedings of Fifth Asia Pacific International Symposium on Information Technology (APIS5), , 2006, Hangzhou, China.

Patro, Rajesh Kumar and Bhatnagar, Shalabh (2006) A four-timescale algorithm for constrained stochastic optimization of RED. In: 45th IEEE Conference on Decision and Control, Dec 13-15, 2006, San Diego, CA, pp. 1930-1935.

Bhatnagar, Shalabh and Abdulla, Mohammed Shahid (2006) A reinforcement learning based algorithm for finite horizon Markov decision processes. In: 45th IEEE Conference on Decision and Control,, Dec 13-15, 2006, San Diego, CA, pp. 5519-5524.

Dukkipati, Ambedkar and Murty, Narasimha M and Bhatnagar, Shalabh (2005) Information theoretic justification of Boltzmann selection and its generalization to Tsallis case. In: IEEE Congress on Evolutionary Computation, 2-5 Sept, 2005, Monterey, CA, United States, pp. 1667-1674.

Abdulla, Mohammed Shahid and Bhatnagar, Shalabh (2005) Solution of MDPS using simulation-based value iteration. In: 2nd International Conference on Artificial Intelligence Applications and Innovations, SEP 07-09, 2005, Beijing.

Dukkipati, Ambedkar and Murty, Narasimha M and Bhatnagar, Shalabh (2004) Cauchy Annealing Schedule: An Annealing Schedule for Boltzmann Selection Scheme in Evolutionary Algorithms. In: 2004 Congress on Evolutionary Computation. CEC2004, 19-23 June, Portland,Origon, Vol.1, 55-62.

Panigrahi, Jnana Ranjan and Bhatnagar, Shalabh (2004) Hierarchical Decision Making in Semiconductor Fabs Using Multi-Time Scale Markov Decision Processes. In: 43rd IEEE Conference on Decision and Control, 2004. CDC, 14-17 December, Nassau,Bahamas, Vol.4, 4387-4392.

Viswanath, P and Murty, Narasimha M and Bhatnagar, Shalabh (2004) A Pattern Synthesis Technique with an Efficient Nearest Neighbor Classifier for Binary Pattern Recognition. In: 17th International Conference on Pattern Recognition, 2004. ICPR 2004, 23-26 August, Cambridge,UK, Vol.4, 416 -419.

Dukkipati, Ambedkar and Murty, Narasimha M and Bhatnagar, Shalabh (2003) Quotient Evolutionary Space: Abstraction of Evolutionary process w.r.t macroscopic properties. In: The 2003 Congress on Evolutionary Computation – CEC 2003, 8-12 December 2003, Canberra, Australia, pp. 846-853.

Conference Poster

Vemu, Koteswara Rao and Bhatnagar, Shalabh and Hemachandra, N (2007) Link route pricing for enhanced QoS. In: 46th IEEE Conference on Decision and Control, DEC 12-14, 2007, New Orleans, LA.

Journal Article

Bhatnagar, Shalabh and Borkar, Vivek S and Prabuchandran, KJ (2013) Feature Search in the Grassmanian in Online Reinforcement Learning. In: IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 7 (5). pp. 746-758.

Vemu, Koteswara Rao and Bhatnagar, Shalabh and Hemachandra, N (2012) Optimal multi-layered congestion based pricing schemes for enhanced QoS. In: Computer Networks, 56 (4). pp. 1249-1262.

Bhatnagar, Shalabh and Lakshmanan, K (2012) An Online Actor-Critic Algorithm with Function Approximation for Constrained Markov Decision Processes. In: JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 153 (3). pp. 688-708.

Bhatnagar, Shalabh and Mishra, Vivek Kumar and Hemachandra, Nandyala (2011) Stochastic Algorithms for Discrete Parameter Simulation Optimization. In: IEEE Transactions on Automation Science and Engineering, 8 (4). pp. 780-793.

Bhatnagar, Shalabh (2011) The Borkar-Meyn theorem for asynchronous stochastic approximations. In: Systems & Control Letters, 60 (7). pp. 472-478.

Bhatnagar, Shalabh and Karmeshu, * (2011) Monte-Carlo estimation of time-dependent statistical characteristics of random dynamical systems. In: Applied Mathematical Modelling, 35 (6). pp. 3063-3079.

Karmeshu, * and Bhatnagar, Shalabh and Mishra, Vivek Kumar (2011) An Optimized SDE Model for Slotted Aloha. In: IEEE Transactions on Communications, 59 (6). pp. 1502-1508.

Prashanth, LA and Bhatnagar, Shalabh (2011) Reinforcement Learning With Function Approximation for Traffic Signal Control. In: IEEE Transactions onIntelligent Transportation Systems, 12 (2, Sp.). pp. 412-421.

Bhatnagar, Shalabh and Hemachandra, N and Mishra, Vivek Kumar (2011) Stochastic Approximation Algorithms for Constrained Optimization via Simulation. In: ACM Transactions on Modeling and Computer Simulation, 21 (3).

Bhatnagar, Shalabh (2010) An actor-critic algorithm with function approximation for discounted cost constrained Markov decision processes. In: Systems & Control Letters, 59 (12). pp. 760-766.

Chakraborty, Anshuk and Bhatnagar, Shalabh (2010) Optimized Policies for the Retransmission Probabilities in Slotted Aloha. In: Simulation, 86 (4). pp. 247-261.

Bhatnagar, Shalabh and Sutton, Richard S and Ghavamzadeh, Mohammad and Lee, Mark (2009) Natural actor-critic algorithms. In: Automatica, 45 (11). pp. 2471-2482.

Bhatnagar, Shalabh and Patro, Rajesh Kumar (2009) A Proof of Convergence of the B-RED and P-RED Algorithms for Random Early Detection. In: IEEE Communications letters, 13 (10). pp. 809-811.

Bhatnagar, Shalabh and Karmeshu, M and Mishra, Vivek Kumar (2009) Optimal Parameter Trajectory Estimation in Parameterized SDEs: An Algorithmic Procedure. In: ACM Transactions on Modeling and Computer Simulation, 19 (2). 8-8:26.

Patro, Rajesh Kumar and Bhatnagar, Shalabh (2009) A probabilistic constrained nonlinear optimization framework to optimize RED parameters. In: Performance Evaluation, 66 (2). pp. 81-104.

Bhatnagar, Shalabh and Abdulla, Mohammed Shahid (2008) Simulation-Based Optimization Algorithms for Finite-Horizon Markov Decision Processes. In: Simulation- Transactions of the Society for Modeling and Simulation international, 84 (12). pp. 577-600.

Velusamy, Sudha and Gopal, Lakshmi and Bhatnagar, Shalabh and Varadarajan, Sridhar (2008) An efficient ad recommendation system for TV programs. In: Multimedia Systems, 14 (2). pp. 73-87.

Bhatnagar, Shalabh and Babu, K Mohan (2008) New algorithms of the Q-learning type. In: Automatica, 44 (4). pp. 1111-1119.

Dukkipati, Ambedkar and Bhatnagar, Shalabh and Murty, Narasimha M (2007) On measure-theoretic aspects of nonextensive entropy functionals and corresponding maximum entropy prescriptions. In: Physica A: Statistical Mechanics and its Applications, 384 (2). pp. 758-774.

Abdulla, Mohammed Shahid and Bhatnagar, Shalabh (2007) Reinforcement Learning Based Algorithms for Average Cost Markov Decision Processes. In: Discrete Event Dynamic Systems - Theory and Applications, 17 (1). pp. 23-52.

Bhatnagar, Shalabh (2007) Adaptive Newton-based multivariate smoothed functional algorithms for simulation optimization. In: ACM Transactions on Modeling and Computer Simulation, 18 (1). 2:1-2:35.

Vaidya, Rahul and Bhatnagar, Shalabh (2006) Robust optimization of Random Early Detection. In: Telecommunication Systems, 33 (4). pp. 291-316.

Viswanath, P and Murty, Narasimha M and Bhatnagar, Shalabh (2006) Partition based pattern synthesis technique with efficient algorithms for nearest neighbor classification. In: Pattern recognition Letters, 27 (14). pp. 1714-1724.

Bhatnagar, Shalabh and Borkar, Vivek S and Akarapu, Madhukar (2006) A Simulation-Based Algorithm for Ergodic Control of Markov Chains Conditioned on Rare Events. In: Journal of Machine Learning Research, 7 . pp. 1937-1962.

Bhatnagar, Shalabh and Panigrahi, Ranjan J (2006) Actor-critic algorithms for hierarchical Markov decision processes. In: Automatica, 42 (4). pp. 637-644.

Dukkipati, Ambedkar and Murty, Narasimha M and Bhatnagar, Shalabh (2006) Nonextensive triangle equality and other properties of Tsallis relative-entropy minimization. In: Physica A-Statistical Mechanics And Its Applications, 361 (1). pp. 124-138.

Bhatnagar, Shalabh and Kowshik, Hemant J (2005) A Discrete Parameter Stochastic Approximation Algorithm for Simulation Optimization. In: SIMULATION, 81 (11). pp. 757-772.

Viswanath, P and Murty, Narasimha M and Bhatnagar, Shalabh (2005) Overlap pattern synthesis with an efficient nearest neighbor classifier. In: Pattern Recognition, 38 (8). pp. 1187-1195.

Bhatnagar, Shalabh and Reddy, I Bala Bhaskar (2005) Optimal threshold policies for admission control in communication networks via discrete parameter stochastic approximation. In: Telecommunication Systems, 29 (1). pp. 9-31.

Bhatnagar, Shalabh (2005) Adaptive Multivariate Three-Timescale Stochastic Approximation Algorithms for Simulation Based Optimization. In: ACM Transactions on Modeling and Computer Simulation, 15 (1). pp. 74-107.

Bhatnagar, Shalabh and Kumar, Shishir (2004) A Simultaneous Perturbation Stochastic Approximation-Based Actor-Critic Algorithm for Markov Decision Processes. In: IEEE Transactions on Automatic Control, 49 (4). pp. 592-598.

Bhatnagar, Shalabh and Fu, Michael C and Marcus, Steven I and Wang, I-Jeng (2003) Two-Timescale Simultaneous Perturbation Stochastic Approximation Using Deterministic Perturbation Sequences. In: ACM Transactions on Modeling and Computer Simulation, 13 (2). pp. 180-209.

Bhatnagar, Shalabh and Borkar, Vivek S (1998) A two timescale stochastic approximation scheme for simulation-based parametric optimization. In: Probability in the Engineering and Informational Sciences, 12 (4). pp. 519-531.

Bhatnagar, Shalabh and Borkar, Vivek S (1997) Multiscale stochastic approximation for parametric optimization of hidden Markov models. In: Probability in the Engineering and Informational Sciences, 11 (4). pp. 509-522.

This list was generated on Wed Jul 23 22:18:28 2014 IST.