Senior Data Scientist - NLP
Employment Type
: Full-Time Industry
: Miscellaneous
Job Description
Senior Data Scientist - NLP
The Gilead Data Science team within Pharmaceutical Development and Manufacturing (PDM) is
seeking a senior data scientist with experience in the development, implementation, evaluation,
and operationalization of Natural Language Processing (NLP) technology in the areas of
semantic search, topic modeling, text classification, and other machine learning applications. In
addition to experience with NLP, understanding and competency with common Python machine
learning frameworks, query skills with SQL, and understanding of the AWS platform and big
data technologies is highly desirable. Advanced knowledge of Python ML design patterns and
algorithms will be necessary.
The senior data scientist will interact with various technical roles in the data science team
including architects, data engineers, data analysts, and product managers to scope, develop,
and operationalize AI-driven applications by providing expertise in model development and
operationalization. The senior data scientist will also interact directly with internal business
customers in an advisory role.
Responsibilities
Research and develop NLP algorithms to address a variety of text analytics needs as
part of an analysis
Query data from a variety of systems including file storage (S3) and SQL-based systems
to develop operational NLP pipelines
Operationalize NLP algorithms in both search and classification context within custom
applications and frameworks
Customize and develop NLP solutions using off-the-shelf systems from platform vendors
Interact with other members of the data science team in a cross-functional way to realize
operational solutions
Interact directly with internal business stakeholders to gather requirements and develop
solutions
Education Requirements
Minimum Bachelor's degree with 5 years of experience or Masters/PhD with 2 years of
experience in data science at large biotech or technology companies.
Degree in engineering or technology areas including but not limited to data science,
software engineering, biomedical engineering, chemical engineering, mechanical
engineering, or similar.
Technical Skill Requirements
Operationalization of natural language processing (NLP) algorithms at scale
Understanding of NLP model evaluation and implementation
Python, SQL
Sklearn, PyTorch, NLTK, Gensim or other NLP packages
Cloud DevOps on AWS related to data science operations (preferred)
Project Skill Requirements
Translating customer needs into technical requirements
Scoping project requirements
Code management using Git
Project management