Senior Data Scientist - NLP

Employment Type

: Full-Time


: Miscellaneous

Job Description Senior Data Scientist - NLP The Gilead Data Science team within Pharmaceutical Development and Manufacturing (PDM) is seeking a senior data scientist with experience in the development, implementation, evaluation, and operationalization of Natural Language Processing (NLP) technology in the areas of semantic search, topic modeling, text classification, and other machine learning applications. In addition to experience with NLP, understanding and competency with common Python machine learning frameworks, query skills with SQL, and understanding of the AWS platform and big data technologies is highly desirable. Advanced knowledge of Python ML design patterns and algorithms will be necessary. The senior data scientist will interact with various technical roles in the data science team including architects, data engineers, data analysts, and product managers to scope, develop, and operationalize AI-driven applications by providing expertise in model development and operationalization. The senior data scientist will also interact directly with internal business customers in an advisory role. Responsibilities Research and develop NLP algorithms to address a variety of text analytics needs as part of an analysis Query data from a variety of systems including file storage (S3) and SQL-based systems to develop operational NLP pipelines Operationalize NLP algorithms in both search and classification context within custom applications and frameworks Customize and develop NLP solutions using off-the-shelf systems from platform vendors Interact with other members of the data science team in a cross-functional way to realize operational solutions Interact directly with internal business stakeholders to gather requirements and develop solutions Education Requirements Minimum Bachelor's degree with 5 years of experience or Masters/PhD with 2 years of experience in data science at large biotech or technology companies. Degree in engineering or technology areas including but not limited to data science, software engineering, biomedical engineering, chemical engineering, mechanical engineering, or similar. Technical Skill Requirements Operationalization of natural language processing (NLP) algorithms at scale Understanding of NLP model evaluation and implementation Python, SQL Sklearn, PyTorch, NLTK, Gensim or other NLP packages Cloud DevOps on AWS related to data science operations (preferred) Project Skill Requirements Translating customer needs into technical requirements Scoping project requirements Code management using Git Project management

