** This is a direct hire position for one of our clients. This position is fully remote but needs to be on EST or CST time zone. Candidates must be able to work in the US without sponsorship.**
We are seeking a highly skilled and experienced Data Engineer to join our team. In this role, you will be responsible for creating and maintaining a scalable ETL data pipeline, managing a multi-modal data storage system, and collaborating with the data science team to enable ML Ops. Your strong understanding of data and proficiency in SQL will be critical for efficiently querying and obtaining data. As a Data Engineer, you will take ownership of technical and business outcomes, assist the development and data science teams with data analysis, and document processes and methodologies. Excellent communication skills are essential for effectively conveying insights and findings to stakeholders.
• Create and maintain a scalable ETL data pipeline that ingests multiple large data sets, including structured financial and patent data, as well as unstructured data such as white papers and scraped websites. Enable entity resolution and other transformations for clean data integration and usage.
• Develop and maintain a multi-modal data storage system that supports scalable and real-time processing for production-level data.
• Collaborate with the data science team to enable ML Ops, ensuring the efficient integration and deployment of machine learning models into the platform.
• Possess a deep curiosity and passion for data, demonstrating a strong and extensive understanding of data. Efficiently query and retrieve data using SQL.
• Take ownership of technical and business outcomes, demonstrating a strong sense of responsibility for the success of data engineering projects.
• Assist the development and data science teams with processing and integrating data analysis, enabling them to derive valuable insights from the data.
• Clearly document processes, methodologies, and tools used, ensuring that knowledge is effectively shared within the team.
• Bachelor's degree in a relevant technical field.
• Significant experience (at least 3-5 years) as a data engineer in the AWS ecosystem, with a strong familiarity in working with structured and unstructured large data sets. Proficient in enabling scalable and distributed compute and ensuring real-time processing at scale.
• Demonstrated expertise (at least 3-5 years) in writing complex SQL queries and conducting data correlations analysis.
• Extensive experience (at least 3-5 years) with the AWS ecosystem, including tools, services, and resources that enable scalable and distributed compute.
• Strong project management skills, with the ability to scope timelines, methodologies, and deliverables for development, testing, and integration into the platform.
• Excellent communication and storytelling skills, both written and verbal, to effectively convey insights and findings to technical and non-technical stakeholders.
Our Vetting Process
At Emergent Software, we work hard to find the software engineers who are the right fit for our clients. Here are the steps of our vetting process for this position:
- Application (5 minutes)
- Online Assessment & Short Algorithm Challenge (40-60 minutes)
- Initial Phone Interview (30-45 minutes)
- 2-3 Interviews with the Client
- Job Offer!
Join our client's dynamic team as a Data Engineer and play a crucial role in developing and maintaining our client's data infrastructure, ensuring the seamless integration and utilization of large-scale data sets. Apply your expertise in data engineering and the AWS ecosystem to drive innovation and deliver impactful solutions.
A software development and consulting company serving clients from all industries in the Twin Cities metro, greater Minnesota and throughout the count...