Portal de Programas de Pós-Graduação (UFLA)

SIGAA - Sistema Integrado de Gestão de Atividades Acadêmicas

Ir para o Conteúdo Principal 1
Início do Menu 2

PROGRAMA DE PÓS-GRADUAÇÃO EM CIÊNCIA DA COMPUTAÇÃO

Telefone/Ramal: (35) 3829-5195/5195
E-mail: posgrad_si.icet@ufla.br Site Antigo: http://prpg.ufla.br/alternativo/computacao

Notícias

Banca de QUALIFICAÇÃO: JOAO PAULO PAIVA LIMA

Uma banca de QUALIFICAÇÃO de MESTRADO foi cadastrada pelo programa.
DISCENTE: JOAO PAULO PAIVA LIMA
DATA: 25/06/2025
HORA: 10:00
LOCAL: Sala de Seminários do DCC
TÍTULO:

How Good Lusophones are Data Science LLM Agents?:Evaluating Agentic Approaches for Data Science in Portuguese Contexts

PALAVRAS-CHAVES:

Large Language Models; Data Science; Machine Learning; Portuguese; Natural Language Processing.

PÁGINAS: 82
GRANDE ÁREA: Ciências Exatas e da Terra
ÁREA: Ciência da Computação
SUBÁREA: Metodologia e Técnicas da Computação
RESUMO:

Data science (DS) and data analysis are complex fields, often requiring highly specialized professionals and time-intensive methods. Analyzing data and creating predictive models commonly involve intricate planning and reasoning capabilities once considered exclusive to humans. However, current advancements in large language models (LLMs), such as complex reasoning and tool use, have challenged this notion. Such advancements are reflected in the numerous recent studies that effectively apply tool-using LLM agents for data analysis and machine learning tasks. However, these developments are not as accessible as one might hope, with most frameworks and evaluations exclusively conducted using English prompts, data, and metadata. Specifically, LLM-automated DS in Portuguese contexts remains largely unassessed in the literature. To address this gap, we aim to evaluate how capable language models are at conducting lusophone analysis. To that end, we will develop a new evaluation set for Portuguese automated data science and hope to employ it to validate LLMs' accuracy and linguistic consistency when performing exploratory data analysis (EDA) and machine learning engineering (MLE). Additionally, we will explore a novel approach to assess agents on automated exploratory analysis, by ranking their analyses based on the improvement they provide to the subsequent task of automated MLE when compared to an EDA-free baseline.

MEMBROS DA BANCA:
Interno - DILSON LUCAS PEREIRA (Suplente)
Interno - ELAINE CECILIA GATTO (Membro)
Interno - LUIZ HENRIQUE DE CAMPOS MERSCHMANN (Membro)
Interno - MARLUCE RODRIGUES PEREIRA (Membro)
Presidente - DENILSON ALVES PEREIRA (Membro)

Notícia cadastrada em: 25/06/2025 22:05

SIGAA | DGTI - Diretoria de Gestão de Tecnologia da Informação - Contatos (abre nova janela): https://ufla.br/contato | © UFLA | appserver2.srv2inst1 04/07/2025 10:05