Survey on combination of Nature Language Processing and Reinforcement Learning algorithms
DOI:
https://doi.org/10.17721/1812-5409.2024/1.25Keywords:
Natural Language Processing, Reinforcement Learning, Robotics, Computer VisionAbstract
The integration of NLP and RL has gained significant attention in recent years, as it holds the potential to enhance the capabilities of various applications, ranging from language understanding and generation to dialogue systems and autonomous agents. The incorporation of RL into NLP algorithms enhances language-related tasks by enabling adaptation and learning from interactions and feedback. This integration proves valuable in scenarios where language understanding and generation require dynamic and context-dependent responses, contributing to improved real-world performance. The survey explores the challenges and opportunities in fusing NLP and RL. Furthermore, it investigates the impact of different RL paradigms applications on NLP algorithms performance and combination of NLP and RL in more complex systems like simulated or real world navigation, which also includes usage of Computer Vision subsystems. In addition to reviewing existing research results, the paper identifies potential avenues for future research and development in the field.
Pages of the article in the issue: 137 - 140
Language of the article: English
References
Sutton, Richard S., and Andrew G. Barto. Reinforcement learning: An introduction. MIT press, 2018. https://web.stanford.edu/class/psych209/Readings/SuttonBartoIPRLBook2ndEd.pdf
Uc-Cetina, Victor, et al. "Survey on reinforcement learning for language processing." Artificial Intelligence Review 56.2 (2023): 1543-1575. https://arxiv.org/abs/2104.05565
Paulus, Romain, Caiming Xiong, and Richard Socher. "A deep reinforced model for abstractive summarization." (2017). https://arxiv.org/abs/1705.04304
Xiong, Caiming, Victor Zhong, and Richard Socher. "Dcn+: Mixed objective and deep residual coattention for question answering." https://arxiv.org/abs/1711.00106
Li, Jiwei, et al. "Deep reinforcement learning for dialogue generation.” https://arxiv.org/abs/1606.01541
Mnih, Volodymyr, et al. "Playing atari with deep reinforcement learning." https://arxiv.org/abs/1312.5602
Xiong, Caiming, Victor Zhong, and Richard Socher. "Dynamic coattention networks for question answering." https://arxiv.org/abs/1611.01604
Goodfellow, Ian, et al. "Maxout networks." International conference on machine learning. PMLR, 2013. https://arxiv.org/abs/1302.4389
Srivastava, Rupesh K., Klaus Greff, and Jürgen Schmidhuber. "Training very deep networks." Advances in neural information processing systems 28 (2015). https://arxiv.org/abs/1507.06228
Brown, Tom, et al. "Language models are few-shot learners." (2020) https://arxiv.org/abs/2005.14165
Radford, Alec, et al. "Improving language understanding by generative pre-training." (2018). https://s3-us-west-2.amazonaws.com/openai-assets/research-covers/language-unsupervised/language_understanding_paper.pdf
Anisimov, A.V., Marchenko, O.O. & Zemlianskyi, V.R. Evolutionary Method of Constructing Artificial Intelligence Systems. Cybern Syst Anal 55, 1–9 (2019). https://doi.org/10.1007/s10559-019-00106-x
Anisimov, A.V., Marchenko, A.A. & Zemlianskyi, V.R. Influence of Language on the Lifespan of Populations of Artificial Intelligence. Cybern Syst Anal 57, 669–675 (2021). https://doi.org/10.1007/s10559-021-00392-4
Ammanabrolu, Prithviraj, et al. "How to motivate your dragon: Teaching goal-driven agents to speak and act in fantasy worlds." https://arxiv.org/abs/2010.00685
Colas, Cédric, et al. "Language as a cognitive tool to imagine goals in curiosity driven exploration." Advances in Neural Information Processing Systems 33 (2020): 3761-3774. https://arxiv.org/abs/1711.07280
Hemachandra, Sachithra, et al. "Learning models for following natural language directions in unknown environments." https://arxiv.org/pdf/1503.05079.pdf
Matuszek, Cynthia. "Grounded language learning: Where robotics and nlp meet (invited talk)." https://www.acl web.org/anthology/D18-1355.pdf
Anderson, Peter, et al. "Vision-and-language navigation: Interpreting visually-grounded navigation instructions in real environments." Proceedings of the IEEE conference on computer vision and pattern recognition. 2018. https://par.nsf.gov/servlets/purl/10066404
Rennie, Steven J., et al. "Self-critical sequence training for image captioning." Proceedings of the IEEE conference on computer vision and pattern recognition. 2017. https://arxiv.org/abs/1612.00563
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 Ruslan Pravosud
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).