Pronóstico de ventas de comestibles de Corporación Favorita

Jaramillo Mira, Alejandro; Gutiérrez Muriel, Juan Pablo

Por favor, use este identificador para citar o enlazar este ítem: https://hdl.handle.net/10495/25073

Registro completo de metadatos

Campo DC	Valor	Lengua/Idioma
dc.contributor.advisor	Quiza Montealegre, Jhon Jair	-
dc.contributor.author	Jaramillo Mira, Alejandro	-
dc.contributor.author	Gutiérrez Muriel, Juan Pablo	-
dc.date.accessioned	2021-12-14T18:45:22Z	-
dc.date.available	2021-12-14T18:45:22Z	-
dc.date.issued	2021	-
dc.identifier.uri	http://hdl.handle.net/10495/25073	-
dc.description.abstract	RESUMEN : En esta monografía se propone una estrategia para la solución al problema publicado en Kaggle por la empresa Corporación Favorita, el cual consiste en poder predecir la cantidad de unidades vendidas por producto de Corporación Favorita en sus tiendas ubicadas en Ecuador, para este 8 reto la empresa brindó datos de unidades de productos vendidos durante un lapso de aproximadamente 19 meses, que abarcaban meses del año 2016 y parte del 2017. Adicionalmente la empresa puso a disposición otro tipo de datos complementarios, como información de las tiendas, información de la agrupación de sus productos, información histórica del precio del petróleo y de los días festivos en Ecuador, factores que pueden llegar a ser complementarios para predecir el comportamiento de las ventas de los productos. Para comenzar a trabajar los datos primero se hizo una exploración de estos, que facilitara su entendimiento y permitiera tener una familiaridad con estos y con el comportamiento del negocio en general. A su vez se prepararon los datasets de forma que fuese posible trabajarlos y analizarlos con modelos predictivos usando Python como herramienta. Luego de conocer y preparar los datos se comenzó con una primera fase de modelos predictivos, pero únicamente intentando predecir el número total de ventas por periodo de tiempo, sin discriminarlo aún por tipo de productos, para estas primeras iteraciones se construyeron modelos ARIMA que permitieran analizar las ventas como serie de tiempo y un modelo LSTM. Estos modelos permitieron obtener resultados prometedores para el análisis. Posteriormente, en la siguiente etapa de iteraciones se discriminaron las predicciones de ventas por tipo de producto. Para esto se construyeron y se compararon 4 modelos, basados en modelos como CNN, LSTM, LSTM-CNN. Estos modelos mostraron buenos resultados y finalmente se escogió CNN-LSTM como el modelo más acorde; con este se obtuvo un valor de RMSE en el conjunto de datos de 9 entrenamiento de 0.5598231404074386, y un valor de RMSE en el conjunto de datos de validación de 0.5600565469166683.	spa
dc.description.abstract	ABSTRACT : This monograph proposes a strategy for solving the problem published in Kaggle by the Corporación Favorita company, which consists of being able to predict the number of units sold per Corporación Favorita product in its stores located in Ecuador, for this challenge the company 10 provided data of units of products sold during a period of time of approximately 19 months, covering months of the year 2016 and part of 2017. Additionally, the company made available other types of complementary data, such as information on stores, information on the grouping of its products, historical information on the price of oil and public holidays in Ecuador, factors that may become complementary to predict the behavior of product sales. To begin working on the data, an exploration of these was first made, which would help facilitate their understanding and allow them to have a familiarity with them and with the behavior of the business in general. In turn, the datasets were prepared in such a way that it was possible to work on them and analyze them with predictive models using Python as a tool. After knowing and preparing the data, a first phase of predictive models began, but only trying to predict the total number of sales per period of time, without still discriminating by type of products, for these first iterations ARIMA models were built that allowed to analyze sales as a time series and an LSTM model. Which allowed us to obtain promising results for the analysis. Later, in the next stage of iterations, sales were included in the analysis by period of time, it was wanted to discriminate sales by product type, for this, 4 models were built and compared with each other using neural networks, models such as CNN, LSTM, LSTM + CNN, which showed good results. Which showed good results and finally CNN-LSTM was chosen as the model most consistent with metrics: Train rmse: 0.5598231404074386 and Validation rmse: 0.5600565469166683 Keywords: Corporación Favorita, Predictive Models, ARIMA, LSTM, CNN.	spa
dc.format.extent	39	spa
dc.format.mimetype	application/pdf	spa
dc.language.iso	spa	spa
dc.type.hasversion	info:eu-repo/semantics/draft	spa
dc.rights	info:eu-repo/semantics/openAccess	spa
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/2.5/co/	*
dc.title	Pronóstico de ventas de comestibles de Corporación Favorita	spa
dc.type	info:eu-repo/semantics/other	spa
oaire.version	http://purl.org/coar/version/c_b1a7d7d4d402bcce	spa
dc.rights.accessrights	http://purl.org/coar/access_right/c_abf2	spa
thesis.degree.name	Especialista en Analítica y Ciencia de Datos	spa
thesis.degree.level	Especialización	spa
thesis.degree.discipline	Facultad de Ingeniería. Especialización en Analítica y Ciencia de Datos	spa
thesis.degree.grantor	Universidad de Antioquia	spa
dc.rights.creativecommons	https://creativecommons.org/licenses/by-nc-sa/4.0/	spa
dc.publisher.place	Medellín	spa
dc.type.coar	http://purl.org/coar/resource_type/c_46ec	spa
dc.type.redcol	http://purl.org/redcol/resource_type/COther	spa
dc.type.local	Tesis/Trabajo de grado - Monografía - Especialización	spa
dc.subject.lemb	Técnicas de predicción	-
dc.subject.lemb	Forecasting techniques	-
dc.subject.lemb	Redes neurales (computadores)	-
dc.subject.lemb	Neural networks (Computer science)	-
dc.subject.lemb	Inteligencia artificial	-
dc.subject.lemb	Artificial intelligence	-
Aparece en las colecciones:	Especializaciones de la Facultad de Ingeniería

Ficheros en este ítem:

Fichero	Descripción	Tamaño	Formato
JuanPabloGutierrezMuriel_2021_PrediccionVentasUnidades.pdf	Trabajo de grado de especialización	765.61 kB	Adobe PDF	Visualizar/Abrir

Mostrar el registro sencillo del ítem

Este ítem está sujeto a una licencia Creative Commons Licencia Creative Commons