Expert II Data Science/Senior Expert Data Science, Capsid Engineering

2 days ago


San Diego, California, United States Novartis Full time $103,600 - $222,300

Job Description Summary

#LI-onsite
Location: San Diego, CA
Internal Job Title: Expert II Data Science or Senior Expert I, Data Science
**This is a dual posting. The final level & title of the offer role would be determined by the hiring team based on the skills, experience & capabilities required to perform the role at the level the role has been offered.

We are seeking a highly motivated, independent Expert II Data Science or Senior Expert I in Computational Biology to expand our AAV capsid engineering platform. This role will drive innovation in AAV capsid engineering and vector characterization, used in the development of next-generation gene therapies for neuromuscular and neurogenetic diseases. This position will collaborate across disciplines and representing the group internally and externally. The position requires an in-depth understanding of computational biology coupled with machine-learning methods, next generation sequencing, AAV biology and modern structural and sequence-based approaches to protein engineering. This role will be closely associated with laboratory-based biologists, and wet-lab experience and knowledge of sequencing library protocols.

Job Description

Key Responsibilities

  • Contribute to the scientific strategy for AAV capsid engineering, integrating computational and experimental approaches.

  • Analyzing data from directed evolution campaigns and designing AAV capsid libraries using statistical and machine learning models.

  • Custom software development for analysis and machine learning-based engineering of screened capsid sequences and in-silico design of novel capsid variants.

  • Development of new experimental methods for preparing Illumina and long read sequencing libraries in the lab.

  • Preparing summaries of data and present internally to colleagues and management.

  • Drafting SOPs, following protocols, diligently documenting experimental data in electronic lab notebooks, and authorizing scientific reports for internal documentation or regulatory submissions.

  • Represent the team in internal and external scientific forums, driving the development of novel methods and technologies. 

  • Maintaining up-to-date knowledge of emerging technologies and scientific approaches in the field of gene therapy.

  • Mentor colleagues and actively supporting a culture of innovation, learning, respect, and trust

Essential Requirements:

This is a dual posting. The final level & title of the offer role would be determined by the hiring team based on the skills, experience & capabilities required to perform the role at the level the role has been offered (Expert II Data Science or Senior Expert I, Data Science):

For Expert II Data Science:

  • Ph.D. Bioinformatics, Computational Biology, Computer Science (recent Ph.D. welcome to apply) OR MS degree in bioinformatics, computational biology, computer science, structural biology or a related field with 4 years of relevant industry research experience

  • Proficiency in Python and/or R programming languages.

  • Prior experience in analyzing Illumina next-generation sequencing data from run QC, read preprocessing, analysis pipeline, and generation of final reports with interpretation and plots.

  • Experience with workflow management software such as Snakemake or Nextflow.

  • Experience with Amazon Web Services usage for cloud computing workflows and associated programming APIs.

  • Machine learning and statistical model experience with biological sequences.

  • Independent and critical thinker, multi-tasker, and problem solver with strong verbal and written communication skills, and a team-first mindset.

For Senior Expert I, Data Science

  • PhD in Bioinformatics, Computational Biology, Computer Science, or a related field, with a minimum of 5 years of relevant industry experience.

  • Demonstrated scientific leadership and a record of high-impact publications, patents, and external presentations.

  • Advanced proficiency in Python and/or R, with professional software engineering experience (version control, testing, code reviews).

  • Extensive experience in analyzing Illumina and long-read sequencing data, including QC, preprocessing, and pipeline development.

  • Proven ability to design and implement machine learning and statistical models for biological sequence analysis.

  • Knowhow in utilizing protein structure analysis to developing protein design strategies.

  • Prior wet-lab experience in molecular biology and sequencing library preparation.

  • Experience with workflow management tools (e.g., Snakemake, Nextflow), cloud computing (AWS) and HPC.

  • Strong independent and critical thinking, problem-solving skills, and a collaborative, team-first mindset.

  • Excellent verbal and written communication skills, with the ability to engage effectively across scientific and technical disciplines.

Desirable Requirements:

  • Professional software engineering experience with version control, testing, and code reviews is strongly preferred.
  • Familiarity with long read sequencing is a plus.

The salary for this position is expected to range between:

  • Expert II Data Science the pay range for this position at commencement of employment is expected to be between: $103,600 and $192,400/ per year
  • Senior Expert I Data Science pay range for this position at commencement of employment is expected to be between: $119,700 and $222,300/ per year

The final salary offered is determined based on factors like, but not limited to, relevant skills and experience, and upon joining Novartis will be reviewed periodically. Novartis may change the published salary range based on company and market factors.

Your compensation will include a performance-based cash incentive and, depending on the level of the role, eligibility to be considered for annual equity awards.

US-based eligible employees will receive a comprehensive benefits package that includes health, life and disability benefits, a 401(k) with company contribution and match, and a variety of other benefits. In addition, employees are eligible for a generous time off package including vacation, personal days, holidays and other leaves.

To learn more about the culture, rewards and benefits we offer our people click here.

EEO Statement:

The Novartis Group of Companies are Equal Opportunity Employers. We do not discriminate in recruitment, hiring, training, promotion or other employment practices for reasons of race, color, religion, sex, national origin, age, sexual orientation, gender identity or expression, marital or veteran status, disability, or any other legally protected status. 

Accessibility and reasonable accommodations

The Novartis Group of Companies are committed to working with and providing reasonable accommodation to individuals with disabilities. If, because of a medical condition or disability, you need a reasonable accommodation for any part of the application process, or to perform the essential functions of a position, please send an e-mail to or call and let us know the nature of your request and your contact information. Please include the job requisition number in your message.

Salary Range

$103, $192,400.00

Skills Desired

Apache Hadoop, Applied Mathematics, Big Data, Curiosity, Data Governance, Data Literacy, Data Management, Data Quality, Data Science, Data Strategy, Data Visualization, Deep Learning, Machine Learning (Ml), Machine Learning Algorithms, Master Data Management, Proteomics, Python (Programming Language), R (Programming Language), Statistical Modeling

  • San Francisco, California, United States Tessera Data Full time $254,000 - $299,000 per year

    About CheckrCheckr is building the data platform to power safe and fair decisions. Established in 2014, Checkr's innovative technology and robust data platform help customers assess risk and ensure safety and compliance to build trusted workplaces and communities. Checkr has over 100,000 customers including DoorDash, Coinbase, Lyft, Instacart, and...


  • San Diego, California, United States Johnson & Johnson Innovative Medicine Full time $193,000 - $333,500

    At Johnson & Johnson, we believe health is everything. Our strength in healthcare innovation empowers us to build a world where complex diseases are prevented, treated, and cured, where treatments are smarter and less invasive, and solutions are personal. Through our expertise in Innovative Medicine and MedTech, we are uniquely positioned to...


  • San Jose, California, United States Capital One Full time $225,400 - $280,600

    Senior Manager, Data Science - GenAI Digital AssistantData is at the center of everything we do. As a startup, we disrupted the credit card industry by individually personalizing every credit card offer using statistical modeling and the relational database, cutting edge technology in 1988 Fast-forward a few years, and this little innovation and our passion...


  • San Diego, California, United States Apple Full time

    Do you have a passion for invention and self-challenge? Do you thrive with pushing the limits of what's considered feasible? As part of a best-in-class modem team, you'll craft sophisticated, innovative embedded firmware that delivers more performance in our products than ever before. You'll work across teams to transform improved hardware elements into a...


  • San Diego, California, United States Fleet Science Center Full time $28,000 - $42,000 per year

    Job Details Fleet Science Center - San Diego, CA Seasonal/Temporary $ $20.50 Hourly Description General Statement:The Science Communicator supports the Fleet Mission Statement and Visitor Experience Philosophy by providing educational programming designed for school groups and public audiences both at the Fleet and out in the community. This...

  • Data Science Intern

    3 days ago


    San Francisco, California, United States Mercor Full time $100,000 - $1,500,000 per year

    About MercorMercor is at the intersection of labor markets and AI research. We partner with leading AI labs and enterprises to provide the human intelligence essential to AI development.Our vast talent network trains frontier AI models in the same way teachers teach students: by sharing knowledge, experience, and context that can't be captured in code alone....


  • San Diego, California, United States PlayStation Global Full time $186,000 - $279,000 per year

    Why PlayStation?PlayStation isn't just the Best Place to Play — it's also the Best Place to Work. Today, we're recognized as a global leader in entertainment producing The PlayStation family of products and services including PlayStation5, PlayStation4, PlayStationVR, PlayStationPlus, acclaimed PlayStation software titles from PlayStation Studios, and...


  • San Francisco, California, United States MathCo Full time $150,000 - $250,000 per year

    About UsTheMathCompany (MathCo) is a global Enterprise AI and Analytics company trusted by leading Fortune 500 and Global 2000 enterprises for data-driven decision-making. Founded in 2016, MathCo builds custom AI and advanced analytics solutions that solve complex business challenges through our innovative hybrid model. NucliOS, MathCo's proprietary...


  • San Jose, California, United States PayPal Full time $120,000 - $250,000 per year

    Lead the execution of high-impact data science projects that drive business transformation and innovation. Collaborate with senior executives to integrate data science into strategic planning and decision-making. Oversee the development of advanced data science capabilities, including machine learning and AI. Ensure the highest standards of data governance,...


  • San Francisco, California, United States OpenAI Full time $200,000 - $240,000 per year

    About the TeamThe Strategic Finance team at OpenAI plays a critical role in shaping the company's long-term trajectory. We partner closely with Product, Engineering, and Go-to-Market teams to inform high-stakes decisions through rigorous data science and economic modeling.As part of our expanding Data Science function, we're building a best-in-class...