home |> dplyr::glimpse()
  • About Me
  • Posts
    • All posts
    • Posts in english
    • Posts em português
  • Publications
    • All publications
    • Publications in English
    • Publicações em Português
  • Open source contributions
  • Donate or Sponsor Me
  • CV
    • CV in English
    • CV em Português

About Me

Github Mastodon X - Twitter LinkedIn

Bio

Hey! I’m Pedro Duarte Faria. A brazilian economist and senior data engineer, working mainly with R, SQL, Python, Databricks and Apache Spark. I do a lot of data engineering these days, but I do love teaching and writing about software development and building open-source software too.


Experience

Senior Data Engineer, DSM-Firmenich, April 2025 - present.

Data Platform Engineer, Blip, Mar 2024 - April 2025.

Analytics Engineer, Blip, Feb 2023 - Mar 2024.

Data Analyst, Blip, May 2021 - Feb 2023.

Research Engineering Intern, João Pinheiro Foundation, August 2019 - March 2021.

Business Intelligence Intern, Beltech, June 2019 - August 2019.


Projects

I’m the author of the R package {figma}, the Python package spark_map, and also, the C++ library lefer. I’m also the author of technical books about the R language, and the Python API of Apache Spark. I have also made some contributions to the R package {knitr} (which is a big open-source project in the R community), and also, to the book “rOpenSci Packages: Development, Maintenance, and Peer Review” by rOpenSci.


Education

Federal University of Ouro Preto - UFOP, Brazil

Economics, B.S., March 2017 - February 2022.

Federal University of Minas Gerais - UFMG, Brazil

Visiting Student in the Economics Department, February 2019 - November 2020.