Hi,
Hope you are doing well.
This is Priya from My3 Tech.
Review the job description below and let me know your interest by replying to this email with an updated resume and a convenient time to discuss also you can reach me (or) through email at Priya@My3Tech.com
NLP Data Scientist
Length: Contract to Hire
Onsite twice monthly in Pittsburgh, PA (So candidates need to currently reside within a 2–3-hour radius)
C2H opportunity so candidates must be able to convert.
Interview process is 3 rounds, and first rounds will likely take place on Friday, 9/27.
Internal Notes:
The need for the role:
- Core focus: to look at EMR data and make sense of it
- Expanding/building new platform to facilitate identification of data of UPMC data that can then be given to researchers at Pitt, CMU, etc, and do research promoting the innovation
- Critical part of this pipeline process is to de-identification - need to leverage NLP tools
- New position within an expanding team
Education Requirements:
- PhD or MD (Medical Doctor) degree is HIGHLY preferred. Bachelor’s degree is NOT sufficient.
- Could consider Master’s Degree candidates if they have an additional 5 years of real-world experience
- Computer Science or Linguistics majors are highly preferred
Must have qualifications:
- De-identification background
- Data scientist who is very familiar with ML and NLP and to some degree specialized in NLP modeling
- Healthcare, Hospital or Clinical Data industry experience not just someone coming off of academia
Technology:
- Python and SQL – bare minimum
JOB DESCRIPTION:
NLP Data Scientist, Analytics & Informatics
Executive Summary
In support of the Data Platform as a Service, we are looking to hire a (NLP) Data Scientist within our Analytics & Informatics team. This role has been pre-approved by HR and is crucial for advancing our mission to operationalize DPaaS.
Role Overview
The Data Scientist will play a vital role in providing insights for the cohort identification and manage the use of two vendor solutions, John Snow Labs and QuantPi. They will develop and implement advanced analytics techniques, and predictive modeling methodologies as part of the DPaaS initiative.
Key Activities
- Text Data Collection and Preprocessing: Gathering, cleaning, and preprocessing text data from unstructured notes as part of DPaaS. This includes tasks such as tokenization, stemming, and handling text-specific challenges like dealing with different languages, slang, and abbreviations.
- Model Development and Training: Designing, building, and training NLP models for de-identification of EMR data using techniques such as text classification, sentiment analysis, named entity recognition, machine translation, and language generation. This involves selecting appropriate algorithms, feature engineering, and fine-tuning model parameters.
- Data Analysis and Insights: Analyzing data to extract meaningful insights and patterns that can inform business decisions. This includes performing statistical analysis, visualizing data, and communicating findings to stakeholders. Staying updated with the latest research and integrating state-of-the-art methods into projects.
- Evaluation and Optimization: Evaluating NLP models using appropriate metrics (e.g., precision, recall, F1 score) and techniques (e.g., cross-validation, confusion matrix). Continuously optimizing models to improve accuracy, efficiency, and scalability.
Qualifications: PhD/MD or equivalent in data science, statistics, or computer science; 5+ years of experience in NLP model development, validation and deployment, with managerial experience.
Priya
Sr Technical Recruiter
1601 N Harrison Ave, STE # 2B, Pierre, SD 57501
| priya@my3tech.com
W: www.my3tech.com
Certified Minority Business Enterprise (MBE)
An E-Verify Company
DISCLAIMER: The information transmitted is intended only for the person or entity to which it is addressed and may contain confidential and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon, this information by persons or entities other than the intended recipient is prohibited. If you received this in error, please contact the sender and delete the material from any computer or if you want to be REMOVED please reply with REMOVE in the Subject line of this email.