Trans. Interact. Intell. Syst. 7, 3, Article 11 (September 
2017), 40 pages. https://doi.org/10.1145/2912150. 
Budakova, D., Dakovski, L., 2019. Smart shopping system. 
8th International scientific conference  (TechSys’19). 
Plovdiv, Bulgaria, 16-18 May 2019. 
doi:10.1088/issn.1757- 899X; ISSN: 1757-899X; 
ISSN: 1757-8981.  
Sutton, R. S., Barto, A. G., 2014. Reinforcement Learning: 
An Introduction. MIT Press, Cambridge, London, 
England, [Online]. Available: 
http://incompleteideas.net/book/ebook/the-book.html. 
Gosavi, A., 2009. Reinforcement Learning: A Tutorial 
Survey and Recent Advances. INFORMS Journal on 
Computing. Vol. 21 No.2, pp. 178-192, 2009. 
Torrado, R. R., Bontrager, Ph., Togelius, Liu, J. J. and 
Perez-Liebana, D., 2018. Deep Reinforcement 
Learning for General Video Game AI. IEEE 
Conference on Computatonal Intelligence and Games. 
CIG, 10.1109/CIG.2018.8490422. 
Argall, B. D., 2009. Learning Mobile Robot Motion Control 
from Demonstration and Corrective Feedback. Thesis. 
Robotics Institute Carnegie Mellon University 
Pittsburgh, PA 15213, 172. 
Amor, H. B., Vogt D., Ewerton M., Berger, E., Jung, B., 
Peters, J., 2013. Learning Responsive Robot Behavior 
by Imitation. IEEE/RSJ International Conference on 
Intelligent Robots and Systems (IROS 2013). IEEE, 
Japan, 3257-3264. 
Takahashi, K., Kim, K., Ogata, T., Sugano, S., 2017. Tool-
body assimilation model considering grasping motion 
through deep learning. Robotics and Autonomous 
Systems. Elsevier, Volume 91, 115–127. 
Moffaert, K. V., 2016. Multi-Criteria Reinforcement 
Learning for Sequential Decision Making Problems, 
Dissertation for the degree of Doctor of Science: 
Computer Science, Brussels University Press, ISBN 
978 90 5718 094 1. 
Moffaert, K. V., Nowé, A., 2014. Multi-objective 
reinforcement learning using sets of pareto dominating 
policies.  Journal of Machine Learning Research, 
15:3483–3512. 
Natarajan, S., Tadepalli, P., 2005. Dinamic Preferences in 
Multi-Criteria Reinforcement Learning. 22nd 
International Conference on Machine Learning. Bonn, 
Germany. 
Gunantara, N., 2018. A review of multi-objective 
optimization: Methods and its applications. Cogent 
Engineering, 5(1),  1502242. 
https://doi.org/10.1080/23311916.2018.1502242 
Cho, J., Wang, Y., Chen, I., Chan, K. S., Swami A., 2017, 
"A Survey on Modeling and Optimizing Multi-
Objective Systems," in IEEE Communications Surveys 
& Tutorials, vol. 19, no. 3, pp. 1867-1901, third quarter 
2017, doi: 10.1109/COMST.2017.2698366. 
Vachhani, V. L.,  Dabhi V. K., Prajapati, H. B., 2015. 
"Survey of multi objective evolutionary algorithms," 
International Conference on Circuits, Power and 
Computing Technologies [ICCPCT-2015], Nagercoil, 
2015, pp. 1-9, doi: 10.1109/ICCPCT.2015.7159422. 
Budakova, D., Dakovski L., Petrova-Dimitrova, V., 2019. 
Smart Shopping Cart Learning Agents Development. 
19th IFAC-PapersOnLine, Conference on 
International Stability, Technology and Culture, 
(TECIS 2019). Volume 52, Issue 25, 26-28 September, 
64-69, Sozopol, Bulgaria, Elsevier ISSN 2405-
8963,https://doi.org/10.1016/j.ifacol.2019.12.447 
Budakova, D., Dakovski, L., Petrova-Dimitrova, V., 2019. 
Smart Shopping Cart Learning Agents. International 
journal on Advances in internet technology, IARIA, 
issn: 1942-2652, Vol. 12, nr 3&4. 109 – 121. 
Maslow, A. H., 1998. Motivation and Personality, 
Addison-Wesley Education Publishers, 2nd Edition, 
Paperback, 400 pages, ISBN: 0060442417 (ISBN13: 
9780060442415).