ABOUT THE ROLE
We are looking for a Senior Data Engineer to be responsible for building and extending the data pipeline architecture of one of our partners, a company building the top revenue intelligence platform on the market. Their unique approach to data collection, enhancement, verification and growth distinguishes them from the market, solidifying their position as the best B2B data partner.
The ideal candidate for this position is an experienced data pipeline builder and data wrangler who enjoys working with big data and building systems from the ground up.
You will collaborate with software engineers, database architects, data analysts and data scientists to ensure the data delivery architecture is consistent throughout the platform. You must be self-directed and comfortable supporting the data needs of multiple teams, systems and products. The right candidate will be excited by the prospect of optimizing or even re-designing the company’s data architecture to support the next generation of products and data initiatives.
SOME MORE INTERESTING PROJECT FACTS
- Reliable B2B data, backed by the most dedicated customer service team;
- Their combination of automation and researchers allows them to reach 95% data accuracy for all their published contact data, while continuing to scale up their number of contacts;
- They have more than 5 million human-verified contacts, another 70 million plus machine processed contacts, and the highest number of direct dial contacts in the industry.
COLLABORATION
- PFA/SRL only, full time;
- Fully remote in RO;
- 10: 30 - 19: 30 (+/- 1 hour flexibility on either side).
DUTIES AND RESPONSIBILITIES
- Design and build parts of the data pipeline architecture for extraction, transformation, and loading of data from a wide variety of data sources using the latest Big Data technologies;
- Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.
- Work with stakeholders including the Executive, Product, Data and Design teams to assist with data-related technical issues and support their data infrastructure needs;
- Work with machine learning, data, and analytics experts to drive innovation, accuracy and greater functionality in our data system.
REQUIREMENTS
- 5+ years of experience in a Data Engineer role;
- 3+ years experience with Apache Spark and solid understanding of the fundamentals;
- Deep understanding of Big Data concepts and distributed systems;
- Strong coding skills with Scala, Python, Java and/or other languages and the ability to quickly switch between them with ease;
- Advanced working SQL knowledge and experience working with a variety of relational databases such as Postgres and/or MySQL;
- Cloud Experience with DataBricks;
- Experience working with data stored in many formats including Delta Tables, Parquet, CSV and JSON;
- Comfortable working in a linux shell environment and writing scripts as needed;
- Comfortable working in an Agile environment;
- Machine Learning knowledge is a plus;
- Capable of working independently and delivering stable, efficient and reliable software;
- Excellent written and verbal communication skills in English;
- Experience supporting and working with cross-functional teams in a dynamic environment.