Data Engineer

Mayo Clinic

United States Remote

Full time

$86.6-133.7k (annually)

Jan 23


Be challenged to deliver innovative solutions that will change health care.

Mayo Clinic’s tech culture is rooted in passion for technology, embraces innovative thinking and strives for high performance. Our teams drive change in health care through comprehensive connected health and digital transformation strategies.

Some examples of our major initiatives are:

  • Utilizing artificial intelligence and machine learning principles to develop next generation patient centric care systems
  • Transforming the practice by applying data science techniques to discover new approaches to health care delivery
  • Leveraging Enterprise Architecture to construct integration centricity, promote data liquidity, and provide innovation support

This transformation creates, connects and applies integrated knowledge to deliver the best health care, health guidance and health information to patients, customers, partners, providers, employees anywhere and anytime so the needs of the patient come first.

Job Description








Information Technology


Why Mayo Clinic

Mayo Clinic is the nation's best hospital (U.S. News & World Report, 2022-2023) and ranked #1 in more specialties than any other care provider. We have a vast array of opportunities ranging from Nursing, Clinical, to Finance, IT, Administrative, Research and Support Services to name a few. Across all locations, you’ll find career opportunities that support diversity, equity and inclusion. At Mayo Clinic, we invest in you with opportunities for growth and development and our benefits and compensation package are highly competitive. We invite you to be a part of our team where you’ll discover a culture of teamwork, professionalism, mutual respect, and most importantly, a life-changing career!

Mayo Clinic offers a variety of employee benefits. For additional information please visit Mayo Clinic Benefits. Eligibility may vary.

Position description

This is a full time remote position within the United States.

The Genomics and Bioinformatics Services Section is seeking a motivated Data Engineer to be responsible for creating and maintaining the analytical infrastructure that enables most functions in the data world. Strong SQL and ETL skills are required, Candidate must have advanced Tableau experience. Prefer candidates have experience with Alteryx. Candidate must be able to work collaboratively with stakeholders to create visualizations that represent the needed analysis. You will be responsible for development, testing, and maintenance, of architectures for large-scale medical related databases in GCP using Big Query and other state of the art systems. You will also be responsible for creating data set processes for verification, acquisition, mining and modeling of clinical data through micro-services and API’s.

Create and maintain optimal data pipeline architecture using state of the art systems that access data via API’s. Ability to build and optimize data sets, 'big data' data pipelines and architectures. Ability to perform root cause analysis on external and internal processes and data to identify opportunities for improvement and answer questions. Excellent analytic skills associated with working on unstructured datasets. Ability to build processes that support data transformation, workload management, data structures, dependency and metadata in GCP. Develop and test large, complex data sets that meet functional / non-functional business requirements. Implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability. Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data source formats using on premise and cloud technology. Integrate object storage features within cloud storage to create data at rest driven compute models. Build analytics tools that utilize the data pipeline to provide actionable insights into customer acquisition, operational efficiency, and other key business performance metrics. Work with stakeholders including the Product, Data and Design teams to assist with data-related technical issues and support their data infrastructure needs. Keep Mayo’s data separate and secure across national boundaries through multiple data centers and cloud regions Work with data and analytics experts to strive for greater functionality in our data systems. Continues to build knowledge of the organization, processes and customers. Performs a range of mainly straightforward assignments. Uses prescribed guidelines or policies to analyze and resolve problems. Receives a moderate level of guidance and direction.

Mayo Clinic will not sponsor or transfer visas for this position including F1 OPT STEM.


Bachelor's degree in Computer Science or Engineering from an accredited University or College; OR an Associate’s degree in Computer Science or Engineering from an accredited University or College with 2 years of experience.

Additional qualifications

Have working knowledge and experience in Data Engineering with a minimum of 2 years of experience in data engineering and data science or analytical modeling.

Experience using scripting languages (Python, JavaScript).

A minimum of 2 years of experience leveraging micro-services using a high-level language (C#, C++, Java) for data access and analytics.

Strong interpersonal, time management skill and demonstrated experience working on cross functional teams

A minimum of 1 year of SQL or No-SQL experience.

Experience working in an agile development environment leveraging tools such as Jira.

Experience with scrum, coding from user stories, and performing retros.

Preferred qualifications for this position include:

Experience using advanced data processing solutions/capabilities such as Apache Spark, Hive, Pig and Kafka.

Experience using big data, statistics and knowledge of data related aspects of machine learning.

Experience working with Linux or other Unix based operating systems.

Knowledge of how workflow scheduling solutions such as Apache Airflow and Google Composer related to data systems.

Knowledge of using Infrastructure as code (Kubernetes, Docker) in a cloud environment.

Experience in practicing CI/CD (Jenkins, GitHub Actions, ADO)

Experience with cloud platforms such as GCP, Azure, AWS

Exemption status


Compensation Detail

$86,611 - $133,682 / year

Benefits eligible



Full Time

Hours / Pay period


Schedule details

Monday - Friday 8am - 5pm. Occasional support off hours.

Weekend schedule

May be required to provide 24x7 on-call support or occasional weekend work.



International Assignment


Site description

Mayo Clinic is located in the heart of downtown Rochester, Minnesota, a vibrant, friendly city that provides a highly livable environment for more than 34,000 Mayo staff and students. The city is consistently ranked among the best places to live in the United States because of its affordable cost of living, healthy lifestyle, excellent school systems and exceptionally high quality of life.


Ted Keefe


As an Affirmative Action and Equal Opportunity Employer Mayo Clinic is committed to creating an inclusive environment that values the diversity of its employees and does not discriminate against any employee or candidate. Women, minorities, veterans, people from the LGBTQ communities and people with disabilities are strongly encouraged to apply to join our teams. Reasonable accommodations to access job openings or to apply for a job are available.

Apply for this position Back to job

You must be logged in to to apply to this job.


Your application has been successfully submitted.

Please fix the errors below and resubmit.

Something went wrong. Please try again later or contact us.

Personal Information


View resume



Mayo Clinic

When you need answers, you know where to go. The No. 1 hospital in the nation, for you.