Análisis comparativo de la evaluación humana y la evaluación basada en inteligencia artificial generativa de resúmenes científicos

López de Ramos, Aura; Bonnett-Bogallo, Belka; Concepción, Dimas; Quintero-Barreto, Gustavo; Durán, Jarles; Meléndez, Nelly; Esteves, Yuly

dc.contributor.author	López de Ramos, Aura
dc.contributor.author	Bonnett-Bogallo, Belka
dc.contributor.author	Concepción, Dimas
dc.contributor.author	Quintero-Barreto, Gustavo
dc.contributor.author	Durán, Jarles
dc.contributor.author	Meléndez, Nelly
dc.contributor.author	Esteves, Yuly
dc.date.accessioned	2025-07-01T22:46:58Z
dc.date.available	2025-07-01T22:46:58Z
dc.date.issued	2025-07-01
dc.identifier.citation	López de Ramos, A. L., Bonnett-Bogallo, B., Concepción, D., Quintero-Barreto, G., Durán, J., Meléndez, N., & Esteves, Y. (2025). Análisis comparativo de la evaluación humana y la evaluación basada en inteligencia artificial generativa de resúmenes científicos. EDUCA. Revista Internacional Para La Calidad Educativa, 5(2), 1-21. https://doi.org/10.55040/q8sgtr65	en_US
dc.identifier.issn	2792-7660
dc.identifier.uri	https://repositorio.ciedupanama.org/handle/123456789/797
dc.description	This study analyzes the differences in the evaluation of abstracts submitted to the II Educational Research Congress COIE-CIEDU 2024, comparing the assessments made by two subject-matter experts with those generated by a generative artificial intelligence system. A standardized evaluation rubric was used, and mean difference tests were applied to determine the presence of statistically significant discrepancies. The results indicate that, while no significant differences were found between the human experts, statistically significant discrepancies were identified between the human evaluations and those generated by the generative artificial intelligence system (p < 0.05). This finding demonstrates that, although human judgment maintains methodological consistency, generative artificial intelligence is not yet capable of replicating the academic quality standards applied by expert reviewers. It is concluded that, although generative artificial intelligence may serve as a valuable support tool for technical or administrative tasks within the review process, it is not ready to autonomously perform academic peer-review functions. Its implementation is recommended as a complementary resource, under clear supervision protocols and continuous performance validation, in order to ensure fairness, rigor, and integrity in the evaluation of scientific content.	en_US
dc.description.abstract	El presente estudio analiza las diferencias en la evaluación de resúmenes enviados al II Congreso de Investigación Educativa COIE-CIEDU 2024, entre las valoraciones emitidas por dos expertos en el área con las generadas por una inteligencia artificial generativa. Se utilizó una misma rúbrica de evaluación, aplicando pruebas de diferencia de medias a fin de determinar la existencia de discrepancias significativas. Los resultados muestran que, si bien no se hallaron diferencias significativas entre los expertos humanos, sí se identificaron discrepancias estadísticamente significativas entre las evaluaciones humanas y las de la inteligencia artificial generativa (p < 0,05). Este hallazgo evidencia que, aunque el juicio humano mantiene una consistencia metodológica, la inteligencia artificial generativa no logra aún emular los estándares de calidad aplicados por revisores expertos. Se concluye que la inteligencia artificial generativa, aunque útil como herramienta de apoyo en tareas técnicas o administrativas delproceso de revisión, no está aún preparada para desempeñar de forma autónoma funciones de arbitraje académico. Se recomienda su implementación como complemento, bajo protocolos de supervisión humana y con validación continua de su desempeño, a fin de garantizar la equidad, la rigurosidad y la integridad en la evaluación de contenidos científicos.	en_US
dc.format	application/pdf	en_US
dc.language.iso	spa	en_US
dc.publisher	Revista Educa	en_US
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.rights	https://creativecommons.org/licenses/by-nd/4.0	en_US
dc.subject	análisis comparativo	en_US
dc.subject	inteligencia artificial	en_US
dc.subject	resumen	en_US
dc.subject	evaluación	en_US
dc.subject	investigación educativa	en_US
dc.title	Análisis comparativo de la evaluación humana y la evaluación basada en inteligencia artificial generativa de resúmenes científicos	en_US
dc.type	info:eu-repo/semantics/article	en_US

Files in this item

Name:: 10.+Análisis+comparativo.pdf
Size:: 446.4Kb
Format:: PDF
Description:: Artículo completo

View/Open

This item appears in the following Collection(s)

Artículos Científicos [213]
Esta colección contiene artículos científicos educativos o en áreas relacionadas a educación. Pueden ser sobre Panamá o sobre otras áreas que puedan ser de utilidad.

Show simple item record