Macrodata Refiner
About Lumon Industries
Lumon Industries is a biotechnology and data science company with over seven decades of operations guided by the vision of our founder, Kier Eagan. We are entering a new era of data-driven discovery, and our Macrodata Refinement division is at the center of that transformation.
The Role
We are hiring a Macrodata Refiner to join our growing Severed Floor team. In this role you will process, analyze, and refine large-scale proprietary datasets using Lumon's internal tooling and modern data engineering practices. You will work closely with department heads and cross-functional partners to ensure data integrity, build automated pipelines, and surface insights that drive critical business decisions.
This is a hands-on technical role that requires strong analytical skills, comfort with ambiguity, and the ability to work independently within a structured operating environment.
Responsibilities
- Process and refine large proprietary datasets using Lumon's internal data platform and standard ETL tooling
- Build and maintain automated data pipelines using Python and SQL
- Perform statistical analysis and anomaly detection across multi-dimensional datasets
- Collaborate with cross-functional teams (Optics & Design, Wellness, Management) to translate data requirements into technical solutions
- Write clear documentation for data models, transformation logic, and pipeline architecture
- Monitor data quality metrics and implement validation checks
- Participate in weekly department syncs and present findings to non-technical stakeholders
Requirements
Must Have
- 4+ years of professional experience in data engineering, data analysis, or a related technical role
- Strong proficiency in Python (pandas, NumPy, or similar data libraries)
- Advanced SQL skills — complex queries, window functions, query optimization
- Experience building and maintaining ETL/ELT pipelines
- Bachelor's degree in Computer Science, Statistics, Mathematics, Data Science, or a related quantitative field
Should Have
- Experience with cloud data platforms (Snowflake, BigQuery, or Redshift)
- Familiarity with workflow orchestration tools (Airflow, Dagster, or Prefect)
- Understanding of statistical methods — hypothesis testing, regression analysis, anomaly detection
- Strong written communication skills — ability to document technical processes clearly and present findings to non-technical audiences
Nice to Have
- Experience with data visualization tools (Tableau, Looker, or Metabase)
- Familiarity with containerization (Docker) and version control best practices (Git)
- Prior experience in a regulated industry (biotech, healthcare, finance)
- Exposure to machine learning workflows or feature engineering
What We Value
- Precision and attention to detail in data work
- Curiosity about the systems you work within
- The ability to stay focused and productive in a structured, distraction-free environment
- Ownership — you see problems through to resolution without waiting to be told
- A collaborative mindset — you work well with colleagues across departments
Compensation & Benefits
- Competitive salary based on experience
- Comprehensive benefits package including Lumon Wellness Program
- On-site perks: Music/Dance Experience sessions, Waffle Parties, Finger Traps
- Career development within a growing division
- Melon Bar access during designated break periods