Data Science Intern

Stanford Health Care Redwood City, California”, Redwood City, California
data data science science intern team data research python science software pipeline tools data science
If you're ready to be part of our legacy of hope and innovation, we encourage you to take the first step and explore our current job openings. Your best is waiting to be discovered.

The Research IT team in Stanford Medicine Technology and Digital Solutions (TDS) is currently seeking a data science intern. Our group is currently working on cutting edge cloud technology to create and maintain clinical datasets that support the works of thousands of researchers across the school of medicine.

Job Summary

The Research IT team in Stanford Medicine Technology and Digital Solutions (TDS) is currently seeking a data science intern. Our group is currently working on cutting edge cloud technology to create and maintain clinical datasets that support the works of thousands of researchers across the school of medicine.

We are a diverse interdisciplinary team of software engineers and application developers with passion for science, technology, education and innovation. We believe that diverse teams are foundational to building better products and services. The intern will be placed with our biomedical informatics R&D team. We work closely with researchers, and software professionals. Among other things, this team works on STARR-OMOP research clinical data warehouse ( https://med.stanford.edu/researchit/news/CDW-reimagined.html ). This database includes more than 100 million de-identified clinical notes and billions of concepts extracted from them using a scalable text mining pipeline. Our work with cloud technologies have enabled us to successfully scale our operations to process this large amount of data in a cost-effective manner. Our text de-identification pipeline TiDE is the fastest one reported in literature and uses state of the art privacy preserving methods such as Hiding in plain sight ( https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3638183/ ). You will join us in our efforts to improve TiDE. The project will be tuned to your background and experience. Sample projects include improving name entity recognition by incorporating state of the art deep learning methods, or generating synthetic notes ( https://arxiv.org/abs/1803.02728 ) or generating note summary ( https://arxiv.org/abs/2003.11474 ).

We expect you to have background in programming a deep learning pipeline for clinical text (e.g., https://www.coursera.org/learn/sequence-models-in-nlp#syllabus ) using python (e.g., https://www.coursera.org/learn/python-data-analysis?specialization=data-science-python#syllabus ). You will join the team in their day to day work, gaining experience in how we develop, deploy and run industrial grade data pipelines. While your experience is important, it is even more important that you have an enthusiasm for learning ( https://www.coursera.org/learn/learning-how-to-learn#syllabus ). You will learn about state-of-the-art infrastructure ( https://cloud.google.com/bigquery ) and tools (e.g., https://jupyter.org/ ) including modern code management tools ( https://github.com/ohdsi-studies ), project management tools (e.g., https://www.atlassian.com/software/jira ) and Agile methodology ( https://www.scrumguides.org/ ). Even prior to COVID-19, our team did remote work effectively. We use modern enterprise collaboration tools such as Google Workspace, Slack, and Zoom.

You can learn more about Research IT at our group’s website ( https://med.stanford.edu/researchit.html ). You can learn more about TDS on our organization's website ( https://med.stanford.edu/tds.html ). You can learn more about the latest at Stanford Medicine by visiting the news site ( https://med.stanford.edu/news.html ).

Essential Functions

Learn how to program a deep learning pipeline for clinical text using Python.

Learn how to work with clinical data using best practices to ensure the protection of patient privacy.

Draft project related documents including specifications and results summaries.

Draft a scientific article describing the work developed during the internship.

Track and ensure proper configuration of issues and tasks using Jira project management software.

Minimum Qualifications

Education:

Bachelor’s degree from an accredited College or University

Entering or enrolled in a graduate program at an accredited College or University

Have enough time to commit to a minimum of 8 consecutive weeks Preferred Experience:

Prior coding experience with Python or R.

Excellent critical thinking and problem-solving skills

Excellent critical thinking and problem-solving skills

Required Knowledge, Skills and Abilities

Ability to learn a new topic with enthusiasm and discipline

Ability to work on a team and meet expectations where others depend on you

Ability to maintain confidentiality of sensitive material

Ability to communicate effectively and ask questions with confidence.

Research and/or publication experience

Equal Opportunity Employer Stanford Health Care (SHC) strongly values diversity and is committed to equal opportunity and non-discrimination in all of its policies and practices, including the area of employment. Accordingly, SHC does not discriminate against any person on the basis of race, color, sex, sexual orientation or gender identity and/or expression, religion, age, national or ethnic origin, political beliefs, marital status, medical condition, genetic information, veteran status, or disability, or the perception of any of the above. People of all genders, members of all racial and ethnic groups, people with disabilities, and veterans are encouraged to apply. Qualified applicants with criminal convictions will be considered after an individualized assessment of the conviction and the job requirements.

At Stanford Health Care, we seek to provide patients with the very best in diagnosis and treatment, with outstanding quality, compassion and coordination. With an unmatched track record of scientific discovery, technological innovation and translational medicine, Stanford Medicine physicians are pioneering leading edge therapies today that will change the way health care is delivered tomorrow.

As part of our spirit of discovery, we also leverage our deep relationships with luminary Silicon Valley companies to develop new ways to deliver preeminent patient care.

Learn about our awards (https://stanfordhealthcare.org/about-us/awards.html) and significant events (https://stanfordhealthcare.org/about-us/our-history.html) .

Report this job

Similar data science internship jobs in redwood city ca