EMPATHICED: UM CHATBOT EDUCACIONAL COM RAG E COMPUTAÇÃO AFETIVA EM UM CONTEXTO DE INICIAÇÃO CIENTÍFICA COORDENADA

Nilo Sergio Maziero Petrin; João Carlos Néto

doi:10.56238/arev8n3-129

Authors

Nilo Sergio Maziero Petrin Author
João Carlos Néto Author

DOI:

https://doi.org/10.56238/arev8n3-129

Keywords:

Educational Chatbots, Retrieval-Augmented Generation, Affective Computing, Coordinated Scientific Initiation, Brazilian Higher Education, Generative Artificial Intelligence

Abstract

Educational chatbots built on large language models face a persistent challenge: delivering fluent interaction without sacrificing factual accuracy, traceability, and institutional responsibility. At the same time, Undergraduate Research initiatives in Brazil are often organized around highly individualized models, which may constrain collaborative technological learning. This paper presents two complementary contributions. First, it introduces EmpathicEd, a modular architecture that combines a multi-stage Retrieval-Augmented Generation (RAG) pipeline with a socio-emotional layer based on multilingual BERT-like models, aiming to produce grounded, contextualized, and emotionally appropriate responses. Second, it proposes a replicable coordinated undergraduate research methodology organized around parallel teams, biweekly sprints, continuous documentation, and progressive pedagogical scaffolding over a ten-month period. The initial validation suggests promising results for both retrieval quality and emotional classification, while also making visible relevant methodological limitations such as the small pedagogical sample, the lack of full baselines, and the reliance on a proprietary institutional corpus. The paper argues that the combination of RAG, evidence governance, and socio-emotional adaptation is a relevant path for designing more trustworthy educational conversational agents in Brazilian higher education settings.

Downloads

Download data is not yet available.

References

BENDER, E. M.; GEBRU, T.; McMILLAN-MAJOR, A.; SHMITCHELL, S. On the dangers of stochastic parrots: can language models be too big? In: ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY (FAccT), 2021, Nova York. Proceedings… Nova York: ACM, 2021. p. 610-623.

BOCKLISCH, T.; FAULKNER, J.; PAWLOWSKI, N.; NICHOL, A. Rasa: open source language understanding and dialogue management. arXiv preprint, arXiv:1712.05181, 2017.

BROWN, T. B.; MANN, B.; RYDER, N.; SUBBIAH, M.; KAPLAN, J.; DHARIWAL, P.; NEELAKANTAN, A.; SHYAM, P.; SASTRY, G.; ASKELL, A. Language models are few-shot learners. In: ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33 (NeurIPS), 2020. Proceedings… 2020. p. 1877-1901.

CARBONELL, J.; GOLDSTEIN, J. The use of MMR, diversity-based reranking for reordering documents and producing summaries. In: ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 1998, Melbourne. Proceedings… Melbourne: ACM, 1998. p. 335-336.

CHAVES, A. P.; GEROSA, M. A. How should my chatbot interact? A survey on social characteristics in human-chatbot interaction design. International Journal of Human-Computer Interaction, v. 37, n. 8, p. 729-758, 2021.

COGHLAN, S.; D'ALFONSO, S.; TEAHAN, J.; VINES, K.; HINE, K.; PATERSON, H.; SHARP, G.; CHRISTIE, A. To chat or bot to chat: ethical issues with using chatbots in mental health. Digital Health, v. 9, 2023.

CRESWELL, J. W. Research design: qualitative, quantitative, and mixed methods approaches. 4. ed. Thousand Oaks: SAGE Publications, 2014.

DE GENNARO, M.; KRUMHUBER, E. G.; LUCAS, G. Effectiveness of an empathic chatbot in combating adverse effects of social exclusion on mood. Frontiers in Psychology, v. 10, artigo 3061, 2020.

GHANDEHARIOUN, A.; MCDUFF, D.; CZERWINSKI, M.; ROWAN, K. EMMA: an emotion-aware wellbeing chatbot. In: 2019 8TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), Cambridge, United Kingdom, 2019. Proceedings… [S.l.]: IEEE, 2019. p. 1-7. Disponível em: https://arxiv.org/abs/1812.11423. Acesso em: 18 mar. 2026.

GIL, A. C. Como elaborar projetos de pesquisa. 5. ed. São Paulo: Atlas, 2010.

JI, Z.; LEE, N.; FRIESKE, R.; YU, T.; SU, D.; XU, Y.; ISHII, E.; BANG, Y.; MADOTTO, A.; FUNG, P. Survey of hallucination in natural language generation. ACM Computing Surveys, v. 55, n. 12, artigo 248, 2023.

JOHNSON, J.; DOUZE, M.; JÉGOU, H. Billion-scale similarity search with GPUs. IEEE Transactions on Big Data, v. 7, n. 3, p. 535-547, 2021.

KHAN, A.; BROMAN, D.; DURUMERIC, Z. Developing retrieval augmented generation (RAG) based LLM systems from PDFs: an experience report. arXiv preprint, arXiv:2407.15804, 2024.

KHATTAB, O.; ZAHARIA, M. ColBERT: efficient and effective passage search via contextualized late interaction over BERT. In: ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2020. Proceedings… [S.l.]: ACM, 2020. p. 39-48.

LEWIS, P.; PEREZ, E.; PIKTUS, A.; PETRONI, F.; KARPUKHIN, V.; GOYAL, N.; KÜTTLER, H.; LEWIS, M.; YIH, W.; ROCKTÄSCHEL, T.; RIEDEL, S.; KIELA, D. Retrieval-augmented generation for knowledge-intensive NLP tasks. In: ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33 (NeurIPS), 2020. Proceedings… 2020.

MA, X.; GONG, Y.; HE, P.; ZHAO, H.; DUAN, N. Query rewriting for retrieval-augmented large language models. In: CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2023, Singapura. Proceedings… Singapura: ACL, 2023. p. 5303-5315.

MONTEIRO, Guilherme Souza. Helena: um chatbot para auxílio dos discentes do DECOM em trâmites universitários. 2021. 59 f. Monografia (Graduação em Ciência da Computação) — Instituto de Ciências Exatas e Biológicas, Universidade Federal de Ouro Preto, Ouro Preto, 2021. Disponível em: http://www.monografias.ufop.br/handle/35400000/3331. Acesso em: 18 mar. 2026.

OPENAI. GPT-4 technical report. arXiv preprint, arXiv:2303.08774, 2023.

PICARD, R. W. Affective computing. Cambridge: MIT Press, 1997.

PROJECT MANAGEMENT INSTITUTE (PMI). A guide to the project management body of knowledge (PMBOK guide). 7. ed. Newtown Square: Project Management Institute, 2021.

REIMERS, N.; GUREVYCH, I. Sentence-BERT: sentence embeddings using siamese BERT-networks. In: CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2019, Hong Kong. Proceedings… Hong Kong: ACL, 2019. p. 3982-3992.

SILVA OLIVEIRA, N.; SOUZA, J. C.; BONIFÁCIO, B.; DAMACENO, R.; MOURÃO, F.; ROCHA, L.; ARAÚJO, A. P. Usability evaluation of a chatbot for academic orientation. In: BRAZILIAN SYMPOSIUM ON HUMAN FACTORS IN COMPUTING SYSTEMS (IHC), 2019, Vitória. Proceedings… Vitória: SBC, 2019. p. 1-10.

TEFFÉ, Chiara Spadaccini de; VIOLA, Mario. Tratamento de dados pessoais na LGPD: estudo sobre as bases legais. Civilistica.com, Rio de Janeiro, v. 9, n. 1, 2020. Disponível em: https://civilistica.emnuvens.com.br/redc/article/view/510. Acesso em: 18 mar. 2026.

THAKUR, N.; REIMERS, N.; RÜCKLÉ, A.; SRIVASTAVA, A.; GUREVYCH, I. BEIR: a heterogeneous benchmark for zero-shot evaluation of information retrieval models. In: ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NeurIPS), 2021. Proceedings… 2021. p. 9509-9520.

EMPATHICED: AN EDUCATIONAL CHATBOT WITH RAG AND AFFECTIVE COMPUTING IN A COORDINATED SCIENTIFIC INITIATION CONTEXT

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

Issue

Section

How to Cite

Google

Make a Submission

Language

Latest publications

Information

Keywords