About
My profile is a mixture regarding software/data engineering and data science/analytics. I’m a Data Engineer with a strong scientific background and a Data Scientist able to take ownership of the whole data pipeline required to deploy a model into production.
I hold a Ph.D. in Electrical Engineering from the Federal University of Minas Gerais were I worked mainly on machine learning topics applied to dynamical systems fault diagnosis and prognostics. I’m particularly interested in statistical and AI models that can extract knowledge from time-series recursively in a continuous learning scheme.
I also worked as a software/data engineer building RESTFull APIs for companies in the Silicon Valley and creating real-time distributed data pipelines to feed artificial intelligence algorithms. I’m very interested in ETL processes; I’ve worked with data acquisition using crawlers, APIs and browser automation and with information extraction from different file types.
If you’re looking for a team player who can work cross functionally among engineering, data science and business teams, my profile might be a good fit. Feel free to contact me!
Carrer summary
- Ph.D. degree in Electrical Engineering from UFMG with research related to fault prognostics and development of tools to improve fault detection and diagnostics in the industrial context using probabilistic and AI models;
- core competency in software/data engineering, data science/analytics, real-time predictive algorithms, incremental learning systems, and research.
- received two scholarships to study new learning mechanisms in AI for fault detection and diagnostics in electrical systems, with one of them to study in the USA for a year;
- 11 reviewed publications, including 3 in top-tier journals, 8 in national and international conferences/workshops, 1 under review, Young Author Support awardee at IFAC’20;
- developed object-oriented python web services for Silicon Valley companies, worked under Scrum management to meet tight deadlines;
- Advanced level of English (CEFR C1) certified by the British Council. [PDF Certificate]
- Tech stack: Python, Numpy, Pandas, Seaborn, Scikit-learn, Django, PostgreSQL, Docker, AWS (EC2, ELB, ECS, RDS, S3), Travis, Elasticsearch, MatLab, C++, Node-red, MQTT, GIT.
Employment
Unigal (Usiminas) Minas Gerais, BR
Data Engineer (Production Engineer) Apr 2021 - Current
- Aiding field technicians/engineers to deploy third-party predictive algorithms for processes and machines in continuous galvanizing lines.
- Tech stack: Python, Numpy, Pandas, Matplotlib, Seaborn, Scikit learn, XGBoost.
Fundação Getúlio Vargas (EMAp) Rio de Janeiro, BR
Data Engineer/Data Analyst (Postdoctoral Fellow) Jan 2021 - Apr 2021
- Worked with Universidade de São Paulo (USP) and Imaflora (a Brazilian non-profit organization) on detecting illegal timber operations in the Amazon rainforest using publicly available information.
- In order to create the datasets, I developed crawlers using both HTTP requests and browser automation with Selenium. The information was extracted from HTML and PDF pages using BeautifulSoup and PDF OCR libraries. I was able to identify situations where transported volumes exceed the authorized amount, indicating fraudulent timber transactions.
- I also taught the undergraduate course of Interactive Web Development to the Applied Mathematics and Data Science. The course received great feedback due to its homework tailored to maximize the student learning of HTML, CSS and JavaScript. More details at the course page.
- Tech stack: Python, Numpy, Pandas, Matplotlib, Seaborn, Jupyter, NetworkX, Leaflet, JS, Requests, Selenium, PDF OCR with camelot and pdfplumber, D3.js, Node, HTML, CSS.
Fundação Christiano Ottoni Minas Gerais, BR
Data Engineer/Data Scientist (Graduate Researcher) Aug 2018 - Dec 2020
- Worked in an R&D project fully funded by the Brazilian oil and gas company Petrobras (Petróleo Brasileiro S.A.) to investigate incremental learning AI-systems for fault detection and adaptive diagnosis in industrial processes.
- I was involved with creating SmartData, a full platform written in Python to run incremental learning algorithms for fault detection, diagnosis and identification. SmartData manages all the real-time data pipeline from a multi-source acquisition environment to a distributed network of intelligent algorithms.
- Due to the limitation of having labeled data, the algorithm I implemented (Online Ellipsoidal Clustering) consisted of an online clustering technique that builds its knowledge-base from scratch.
- I also represented the team in technical training and presentations. An example of that was the talk given at the VIII Oil and Gas Production Optimization Workshop, hosted at the Petrobras Research Center. The job also involved writing scientific papers and internal reports to help spread the research products of our group.
- Tech stack: Python, Numpy, Pandas, Matplotlib, MatLab, C++, Node-red, MQTT, OPC, GIT.
Cheesecake Labs Santa Catarina, BR
Software Engineer Jan 2018 - Nov 2018
- I worked as a back-end engineer on two amazing projects: Singularity University and Woovit. My job was to create python RESTFull APIs using Django Rest Framework to interact with our front-end and mobile apps.
- We used Docker to run everything on Amazon and Heroku clouds through the CI/CD process I helped set. All of our data were stored in MySQL and PostgreSQL databases with some asynchronous tasks such as email sending and crawlers being triggered through Redis.
- Thousands of people from more than 127 countries used the products I helped design. The amazing team effort aided the company to be featured as Top #1 Mobile App Development Company by Clutch.
- Tech stack: Python, Django, DRF, MySQL PostgreSQL, Docker, AWS (EC2, ELB, ECS, RDS, S3), Circle CI, Travis, Elasticsearch, Scrum, Agile.
Education
Universidade Federal de Minas Gerais Minas Gerais, BR
PhD in Electrical Engineering Aug 2018 - Dec 2020
Thesis: New Data-Driven Methodologies For Fault Prognostics Using Evolving Fuzzy Models
Supervisor: Prof. Reinaldo Martinez Palhares
Universidade Estadual de Montes Claros Minas Gerais, BR
B.Sc. in Systems Engineering Aug 2012 - Aug 2018
Project: New approaches for abrupt and incipient fault detection in dynamic systems
Supervisor: Prof. Marcos Flávio Silveira Vasconcelos D’Angelo
Honors and Awards
- [2020] Young Author Support granted by the International Federation of Automatic Control (IFAC) to attend to the 21st IFAC World Congress at Berlin. (EUR 250,00)
- [2018] Graduate research scholarship from Brazilian oil and gas company Petrobras. (BRL 80.000,00)
- [2015] Undergraduate study abroad scholarship from the Brazil Scientific Mobility Program (BSMP) to study electrical and computer engineering at the Ohio State University (OSU). (USD 50.000,00)
- [2014] Undergraduate research scholarship from the Minas Gerais State Agency for Research and Development (FAPEMIG). (BRL 4.800,00)
Volunteer Work
- [2020] Part of the UFMG COVID-19 task force to create predictive models for the disease spread in the city of Belo Horizonte, MG. Special Cover
- [2020] Wrote the fuzzy logic chapter in the “Jornada Data-Driven” book, written collaboratively with 50+ people (In progress). Jornada Colaborativa