2023-12-05T12:52:16-05:00 X, The Moonshot Factory
Scientific Machine Learning SWE - Early Stage Project, X
Software Engineering
Mountain View, CA

X is Alphabet’s moonshot factory. We are a diverse group of inventors and entrepreneurs who build and launch technologies that aim to improve the lives of millions, even billions, of people. Our goal: 10x impact on the world’s most intractable problems, not just 10% improvement. We approach projects that have the aspiration and riskiness of research with the speed and ambition of a startup.

About the team

We are multidisciplinary scientists and engineers dedicated to creating transformative computational tools for Synthetic Biology. Our moonshot is to accelerate discovery and bio-innovation for healthier people and a healthier planet. 

About the role

We’re looking for a machine learning scientist/engineer to work with our computational biologists and experimentalists on data-driven methods for the design and optimization of proteins. Our goal is to radically improve the state of the art in applied protein engineering by developing new computational methods and applying recent advances such as AlphaFold and protein LLMs. This role combines modern engineering, foundational machine learning work, and biologically-motivated design, and is central to the team’s impact.  We are looking for someone that’s passionate about applying ML on real world problems, not SOTA chasing on public benchmarks.

The ideal team member will demonstrate scrappiness, creativity, and drive to “make it happen” by iterating rapidly and adapting state-of-the-art methods to team and customer needs. This teammate will help us keep our finger on the pulse of the real-world by ensuring we are continually adding value to our customers. They will thrive in an organization that rapidly iterates, evolves, and leans into ambiguity.

How you will make 10x impact

  • Identify, track, and summarize developments in the field of machine learning applied to protein function discovery and enzyme design, through literature reviews, evaluations, and adaptations of known and new methods to the team's needs and goals
  • Develop novel methods for applying machine learning at scale to custom data sets, to model and predict protein structure, function, and properties. Propose, design, and implement models for optimizing biomolecular sequences for enzyme functions such as folding and secretion
  • Achieve the best results in domains with scarce data: finetuning foundation models, leveraging active learning for optimal experiment designs, or figure out how to combine literature data with new experiments
  • Understand customer needs, data, and metrics, and translate them into joint work with scientists, engineers, and experimentalists on the team. Play a central role in informing and prioritizing new product features and team emphasis
  • Make business recommendations with effective presentations of findings/insights to stakeholders at various levels, as well as to external partners

What you should have

  • PhD degree in a relevant field (Computer Science, Bioinformatics, Computational Biology) or equivalent practical experience
  • Deep expertise in modern neural network architectures such as transformers, VAE, energy based models
  • Experience with state-of-the-art machine learning approaches applied to biological sequence data (RNA, DNA, proteins) in the molecular and cell biology domains
  • Experience with general-purpose programming languages (preferably Python) in a production environment. Comfort with modern software engineering practices such as version control, code reviews, unit testing, and continuous integration
  • Familiarity with modern deep learning frameworks such as JAX, Pytorch and Tensorflow
  • Excellent communication skills and aptitude for working with cross-functional teams

 It’d be great if you also had these

  • Experience working with or evaluating deep neural networks applied to protein prediction tasks.  For example: using LORA for foundation model fine-tuning, or, leveraging domain adaptation for handling batch effects from noisy heterogeneous data
  • Experience with additional data-driven tasks in biochemistry, such as metabolic engineering or genomics
  • Industry experience in small teams building solutions in the molecular biology domain
  • Experience with Bayesian search or active learning approaches for optimal experiment design
  • Excellent communication and project management skills, with a track record of handling multiple customers and streams of work

The US base salary range for this full-time position is $157,000 - $243,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your location during the hiring process.

Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits.

At X, we don't just accept difference - we celebrate it, we support it, and we thrive on it for the benefit of our employees, our products and our community. We are proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements.

If you have a disability or special need that requires accommodation, please contact us at: x-accommodation-request@x.team.

Apply Now

Fields marked with * are required

Cover Letter
This position is located in Mountain View, CA. Are you able to commute to the office and/or are you willing to relocate?*
Are you currently or were you previously an Alphabet employee, contractor, or intern?*
Application consent for X

By clicking the “I Accept” button you expressly give your consent for the collection and use of your information as described at https://www.google.com/about/careers/privacy/.  I consent to the processing of my information as described in that policy including that, in limited circumstances, my information may be shared with trusted third parties to assist in certain phases of the hiring process (such as conducting background checks).

PUBLIC BURDEN STATEMENT: According to the Paperwork Reduction Act of 1995 no persons are required to respond to a collection of information unless such collection displays a valid OMB control number. This survey should take about 5 minutes to complete.

Voluntary Self-Identification of Disability

Form CC-305 Page 1 of 1 OMB Control Number 1250-0005 Expires 04/30/2026
Why are you being asked to complete this form?

We are a federal contractor or subcontractor. The law requires us to provide equal employment opportunity to qualified people with disabilities. We have a goal of having at least 7% of our workers as people with disabilities. The law says we must measure our progress towards this goal. To do this, we must ask applicants and employees if they have a disability or have ever had one. People can become disabled, so we need to ask this question at least every five years.

Completing this form is voluntary, and we hope that you will choose to do so. Your answer is confidential. No one who makes hiring decisions will see it. Your decision to complete the form and your answer will not harm you in any way. If you want to learn more about the law or this form, visit the U.S. Department of Labor’s Office of Federal Contract Compliance Programs (OFCCP) website at www.dol.gov/ofccp.

How do you know if you have a disability?

A disability is a condition that substantially limits one or more of your “major life activities.” If you have or have ever had such a condition, you are a person with a disability. Disabilities include, but are not limited to:

  • Alcohol or other substance use disorder (not currently using drugs illegally)
  • Autoimmune disorder, for example, lupus, fibromyalgia, rheumatoid arthritis, HIV/AIDS
  • Blind or low vision
  • Cancer (past or present)
  • Cardiovascular or heart disease
  • Celiac disease
  • Cerebral palsy
  • Deaf or serious difficulty hearing
  • Diabetes
  • Disfigurement, for example, disfigurement caused by burns, wounds, accidents, or congenital disorders
  • Epilepsy or other seizure disorder
  • Gastrointestinal disorders, for example, Crohn's Disease, irritable bowel syndrome
  • Intellectual or developmental disability
  • Mental health conditions, for example, depression, bipolar disorder, anxiety disorder, schizophrenia, PTSD
  • Missing limbs or partially missing limbs
  • Mobility impairment, benefiting from the use of a wheelchair, scooter, walker, leg brace(s) and/or other supports
  • Nervous system condition, for example, migraine headaches, Parkinson’s disease, multiple sclerosis (MS)
  • Neurodivergence, for example, attention-deficit/hyperactivity disorder (ADHD), autism spectrum disorder, dyslexia, dyspraxia, other learning disabilities
  • Partial or complete paralysis (any cause)
  • Pulmonary or respiratory conditions, for example, tuberculosis, asthma, emphysema
  • Short stature (dwarfism)
  • Traumatic brain injury
Disability Status

If you believe you belong to any of the categories of protected veterans listed below, please indicate by making the appropriate selection. As a government contractor subject to the Vietnam Era Veterans Readjustment Assistance Act (VEVRAA), we request this information in order to measure the effectiveness of the outreach and positive recruitment efforts we undertake pursuant to VEVRAA. Classification of protected categories is as follows:

A "disabled veteran" is one of the following: a veteran of the U.S. military, ground, naval or air service who is entitled to compensation (or who but for the receipt of military retired pay would be entitled to compensation) under laws administered by the Secretary of Veterans Affairs; or a person who was discharged or released from active duty because of a service-connected disability.

A "recently separated veteran" means any veteran during the three-year period beginning on the date of such veteran's discharge or release from active duty in the U.S. military, ground, naval, or air service.

An "active duty wartime or campaign badge veteran" means a veteran who served on active duty in the U.S. military, ground, naval or air service during a war, or in a campaign or expedition for which a campaign badge has been authorized under the laws administered by the Department of Defense.

An "Armed forces service medal veteran" means a veteran who, while serving on active duty in the U.S. military, ground, naval or air service, participated in a United States military operation for which an Armed Forces service medal was awarded pursuant to Executive Order 12985.

Veteran Status

Voluntary Self-Identification

For government reporting purposes, we ask candidates to respond to the below self-identification survey. Completion of the form is entirely voluntary. Whatever your decision, it will not be considered in the hiring process or thereafter. Any information that you do provide will be recorded and maintained in a confidential file.

As set forth in X, the moonshot factory’s Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law.

Something went wrong. Please try again later.

Thank you for applying to X, the moonshot factory!

Your application has been received. We appreciate your interest in the open roles at X. Our team will review your materials and if there is a fit, a member of our team will contact you with more information.