Intern Data scientist
03-2018 - 09-2018 Infra/Redes/Telecomunicaciones Data science and machine learning on the data collected from 1250 cars in order to find interests in data for reducing the costs and enhancing the security.
1. March - April: Collecting and scraping structured and semistructured data from different resources.
2. April - June: Exploration, data processing and pipelining for data cleaning, data enhancement, features engineering, ETL, machine learning (regression, classification, association rules, and process mining).
3. June - End: Data analysis and visualizations and report to stockholders. Propose an architecture Big Data for future works on the project
Keyword: python, spark, hadoop, machine learning, algorithm, research