Conference on Knowledge Discovery, Knowledge 
Engineering and Knowledge Management,  Vol.  2, 
Nov. 2020. SCITEPRESS. 313 – 320.  
Hasselt, H., Guez, A., Silver, D., 2015. Deep reinforcement 
learning  with  double  Q-learning. In  AAAI 2015, 29
th
 
AAAI Conference on Artificial Intelligence, Jan. 2015. 
2094–2100. 
Huang,  Z.,  Xu,  X.,  He,  H.,  Tan,  J.,  Sun,  Z.,  2017. 
Parameterized  batch  reinforcement  learning  for 
longitudinal control of autonomous land vehicles. IEEE 
Transactions on Systems, Man, and Cybernetics: 
Systems 49(4), 730 – 741. 
Huynh, T., Zelinka, I., Pham, H., Nguyen, H.D. 2019. Some 
measures  to  Detect  the  Influencer  on  Social  Network 
Based on Information Propagation. In WIMS 2019, 9
th
 
International Conference on Web Intelligence, Mining 
and Semantics, June 2019. ACM.  
Kouris, A., Venieris, S., Rizakis, M., Bouganis, C., 2020. 
Approximate LSTMs for Time-Constrained Inference: 
Enabling  Fast  Reaction  in  Self-Driving  Cars.   IEEE 
Consumer Electronics Magazine 9(4), 11 – 26. 
Lin,  L.J.,  1992.  Self-improving  reactive  agents  based  on 
reinforcement  learning,  planning  and  teaching. 
Machine Learning 8(3 – 4), 293–321. 
Lucarelli,  G.,  Borrotti,  M.,  2020.  A  deep  Q-learning 
portfolio  management  framework  for  the 
cryptocurrency  market. Neural Comput & 
Applic 32, 17229–17244. 
Marina, L., Sandu, A. 2017.  Deep reinforcement learning 
for autonomous vehicles  - State of  the  art, Bulletin of 
the Transilvania University of Braşov  10(59),  195  – 
202. 
Metacar, 2021. https://metacar.scottpletcher.guru/ (Access 
on 08 March 2021) 
Min,  K.,  Kim,  H.,  Huh,  K.,  2019.  Deep  distributional 
reinforcement learning based high level driving policy 
determination.  IEEE Transactions on Intelligent 
Vehicles 4(3), 416 – 424. 
Nguyen, H., Huynh, T., Hoang, S., Pham, V., Zelinka, I., 
2020a. Language-oriented Sentiment Analysis based on 
the  grammar  structure  and  improved  Self-attention 
network.  In  ENASE 2020, 15th International 
Conference on Evaluation of Novel Approaches to 
Software Engineering, May 2020. SCITEPRESS. 339-
346. 
Nguyen, H.D., Tran, D., Do, H., Pham, V., 2020b. Design 
an intelligent system to automatically tutor the method 
for  solving  problems.  International Journal of 
Integrated Engineering (IJIE) 12(7), 211 – 223. 
Nguyen,  H.,  Tran,  V.,  Pham,  V.,  Nguyen,  H.D.,  2021. 
Design  a  learning  model  of  mobile  vision  to  detect 
diabetic  retinopathy  based  on  the  improvement  of 
MobileNetV2.  Int. J. Digital Enterprise Technology 
(IJDET), in publishing. 
Perez, G., Guerrero, J., Olivas, E., Ballester, E., Palomares, 
A.,  Casariego,  N.  2009.  Assigning  discounts  in  a 
marketing  campaign  by  using  reinforcement  learning 
and neural networks. Expert Systems with Applications 
36(4), 8022-8031.  
Peters,  J.,  Schaal,  S.,  2008.  Reinforcement  learning  of 
motor  skills  with  policy  gradients.  Neural Networks 
21(4), 682-697. 
Pham,  X.T,  Tran,  T.V,  Nguyen-Le,  V.T,  Pham,  V.T., 
Nguyen,  H.D.  2020.  Build  a  search  engine  for  the 
knowledge  of  the  course  about  Introduction  to 
Programming based on ontology Rela-model. In KSE, 
12
th
 International Conference on Knowledge and 
Systems Engineering, Nov. 2020. IEEE. 207 – 212. 
Sehnke,  F.,  Osendorfer,  C.,  Rückstiess,  T.,  Graves,  A., 
Peters, J., Schmidhuber, J., 2010. Parameter-exploring 
policy gradients. Neural Networks 23(2), 551 - 559. 
Silver, D., Lever, G., Heess, N., Degris, T., Wierstra, D., 
Riedmiller,  M.,  2014.  Deterministic  policy  gradient 
algorithms.  In  ICML 2014, 31
st
 International 
Conference on Machine Learning, vol.  32,  June 2014. 
387–395. 
Sutton,  R.,  Barto,  A.  2015.  Reinforcement Learning: An 
Introduction.  MIT  Press,  Cambridge,  Massachusetts, 
USA, 2
nd
 edition. 
Talamini, J., Bartoli, A., De Lorenzo, A., Medvet, E. 2020. 
On  the  Impact  of  the  Rules  on  Autonomous  Drive 
Learning. Appl. Sci. 10(7), 2394. 
United  States  Environmental  Protection  Agency  (EPA), 
2018.  Sources  of  Greenhouse  Gas  Emissions 
https://www.epa.gov/ghgemissions/sources-
greenhouse-gas-emissions (Access on 08 March 2021) 
Unity  ML-Agents  Highway,  2021. 
https://github.com/MLJejuCamp2017/DRL_based_Sel
fDrivingCarControl (Access on 08 March 2021). 
Watkins,  C.,  Dayan,  P.,  1992.  Q-learning.  Machine 
Learning 8(3), 279–292. 
WHO.  2020.  https://www.who.int/news-room/fact-
sheets/detail/the-top-10-causes-of-death  (Published on 
09 Dec. 2020).