Sr. Data Engineer

Sorcero

Sorcero

Data Science
United States
Posted on May 7, 2025

COMPANY SUMMARY

Sorcero's medically-tuned intelligence platform transforms life sciences decision-making, accelerating patient access to life-saving treatments. By harnessing insights from global medical data, we empower industry leaders to deliver breakthrough therapies. Founded in 2018 by pioneers in AI, public health, and personalized content, Sorcero is a Certified B Corp headquartered in Washington, DC. Our innovation has been recognized by over a dozen awards, including Fast Company's Most Innovative companies of 2024, and six foundational medical AI patents.

WHY JOIN SORCERO?

Sorcero transforms decision-making in life sciences by empowering stakeholders with insights to improve patient outcomes. By joining our team, you would play a critical role in our growth and success by collaborating with our network of passionate entrepreneurs to build a scalable, impactful organization. From flexible working arrangements to encouraging meaningful interactions with teammates across the globe, Sorcero provides a supportive community to bring out the best in each of us.

Our employees are driven by these common values:

  • Pursuit of excellence and mastery of their craft
  • Humility in defeat and in success
  • Kindness and empathy towards others
  • Desire to create large-scale, positive change

THE ROLE

We are seeking a highly motivated, autonomous, and experienced Senior Data Engineer to join our growing team. This critical individual contributor role requires a hands-on problem solver who can deliver immediate business value through tactical data solutions while helping shape and champion our long-term data architecture and strategy. You will take ownership of key areas, proactively driving progress in designing, building, and maintaining robust, scalable data infrastructure and pipelines. You'll work with cutting-edge technologies across the full data lifecycle – including data lakes, data warehouses, and data vault systems – enabling data-driven decisions and powering vertical SaaS applications. If you are passionate about building high-quality, secure, and compliant data solutions, proficient in Python, and thrive in a collaborative, fast-paced environment where you can make a significant impact, we want to hear from you!

RESPONSIBILITIES:

  • Develop and Evangelize: Define, develop, and evangelize the Data Architecture, Data, and Analytics Strategy in collaboration with Engineering, AI, and Product teams, incorporating feedback for continuous improvement while maintaining feature velocity.
  • Own and Drive: Demonstrate autonomy and urgency, taking ownership and proactively driving accountability for progress in your designated areas.
  • Design and Build: Design, develop, test, deploy, and maintain scalable, reliable, secure, and high-performance data pipelines (batch and streaming), ETL/ELT processes, and data services.
  • Architect and Implement: Contribute significantly to defining and implementing the architecture for our data lake, data warehouse (e.g., BigQuery, Redshift, Snowflake), data vault systems, data catalogs, and overall data platform infrastructure (Compute, Storage, Networking), often developing technical specifications and implementation plans.
  • Optimize: Build and optimize data processing frameworks using technologies like Spark, Flink, and Kafka. Optimize query performance and data platform components (e.g., Presto, ElasticSearch).
  • Govern and Model: Implement and uphold best practices in data modeling (including dimensional and data vault), data quality, data governance, data cataloging, and data lineage, ensuring adherence to strict security policy compliance where applicable.
  • Develop: Develop software and tools in Python to support data ingestion, processing, storage, enrichment, and analysis.
  • Collaborate: Work closely with the Chapter Leader, Engineering Managers, Product teams, data scientists, analysts, and peer engineers to build out Data and Analytics Architectures and Services, particularly for Life Sciences and Healthcare datasets.
  • Facilitate and Improve: Facilitate efficient processes and workflows within a large-scale production infrastructure. Identify and share potential improvement opportunities with management and peers.
  • Execute Agile: Drive agile methodologies through experimentation, prototyping, and solid execution.
  • Build Applications: Help build out vertical SaaS applications leveraging data assets to drive major business impact.
  • Manage Infrastructure: Implement and manage infrastructure on cloud platforms (GCP/AWS), utilizing containerization (Docker) and orchestration (Kubernetes).
  • Enable DevOps: Contribute to the team's DevOps practices, including CI/CD, monitoring (Splunk, Grafana, CloudWatch), and infrastructure automation.
  • Mentor: Mentor junior engineers and contribute to technical leadership within projects.
  • Track Success: Define and track key success metrics for Data and Analytics Engineering initiatives.
  • Stay Current: Stay current with emerging big data technologies and propose innovative solutions.

REQUIRED QUALIFICATIONSs:

  • BS or MS in Computer Science, Computer/Electrical Engineering, or a closely related Software Engineering field.
  • 7+ years of hands-on experience in software development, with a significant focus on building data platforms, data products, and data pipelines.
  • 5+ years of proven experience delivering enterprise-grade Data Services and products (e.g., Core Data Pipelines, Data Lakes, Data Warehouses, Data Catalog, Data Lineage, Data Governance).
  • 3+ years of experience in technical leadership or project lead roles within Data Engineering, demonstrating the ability to guide technical direction and mentor others.
  • Expert-level programming skills in Python for data processing and application development.
  • Solid understanding of data structures, algorithms, data modeling (including dimensional, star schema and data vault), and software architecture principles.
  • Deep knowledge of mathematical concepts, probability, and statistics relevant to data engineering.
  • Extensive experience with big data technologies such as Apache Spark, Apache Flink, Kafka, Hadoop ecosystem (HDFS, YARN), Presto, and data formats like Avro and Parquet. Experience with Iceberg is a plus.
  • Strong experience with cloud-based data warehousing and analytics platforms like Google BigQuery, Amazon Redshift, or Snowflake.
  • Hands-on experience with cloud platforms (GCP or AWS), including storage, compute, database services, containerization (Docker), and orchestration (Kubernetes).
  • Proven ability to translate complex business requirements into clean, logical, and maintainable technical designs and code.
  • Strong skills in code reviews, testing methodologies, and Agile/Sprint management.
  • Ability to work effectively under pressure and deliver high-quality results.
  • Excellent communication skills including writing with experience creating solid technical specifications that explain complex technical concepts to both technical and non-technical audiences and evangelize architectural decisions.
  • A collaborative mindset, able to work effectively across different teams and operating styles, fostering an open and respectful environment.
  • Ability to support stakeholders and collaborate with team members primarily working in the EST timezone.
  • Demonstrated sense of autonomy, ownership, and urgency.

PREFERRED QUALIFICATIONS:

  • Experience with data visualization tools (e.g. Looker, PowerBI, etc).
  • Experience with search technologies like Elasticsearch.
  • Familiarity with Machine Learning concepts and technologies (e.g., TensorFlow, PyTorch, Scikit-learn, MLFlow, TF Serving).
  • Experience with infrastructure-as-code tools (e.g., Terraform, CloudFormation).
  • Experience working within regulated environments (e.g., healthcare, finance) and understanding compliance requirements (e.g., HIPAA).

COMPENSATION & BENEFITS

Sorcero is a remote and globally distributed team. We offer a rewarding salary along with the following benefits:

  • Three weeks PTO + 1 week of December holiday recess + 1 week Safe & Sick leave
  • 10 company holidays + 2 floating holidays
  • Medical, dental and vision benefits for you and your family
  • Paid parental bonding leave
  • 401(k) retirement plan
  • Company-provided laptop and hardware

Because ∑ (human + AI) > human or AI alone.

Sorcero is an Equal Opportunity Employer. We do not discriminate based upon race, religion, color, national origin, sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, education, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. and celebrates the diversity of its growing team.

We are unable to sponsor or take over sponsorship of employment visa at this time.