Key Responsibilities:
- As a Principal Data Scientist and R Developer within the Data Science team your responsibility will be to help solve business challenges within Biostatistics and our partner organizations and use a variety of Data Science tools and methods to deliver Proof of Value initiatives.
- Innovation/Technical Knowledge.
- The diverse Data Science team consists of subject matter experts from a variety of technical backgrounds and this is reflected in the broad range of initiatives that we support.
- Our deliverables can take the form of a training course/webinar, a statistical model, an R package, an application, or even a computing environment.
- This technical role will support in the creation of these deliverables and therefore requires a broad basis of technical knowledge, encompassing aspects of application development, DevOps, statistics and machine learning.
- Leadership and Teamwork.
- You will provide technical input and direction on various initiatives from pilots through to larger production deliveries.
- This includes close collaboration with our business partners to identify possible solutions and deliver the expected business value within the agreed timelines.
- You will receive mentorship from senior team members, and work with partners from the wider Biostatistics community as applicable.
- Capability Development. In order to deliver SDS-IH’s capability development objective, it is expected that you are proactive in promoting Data Science tools and methods both internally and externally.
- This includes active participation and engagement with colleagues on internal social media channels.
- You will therefore be expected to keep up to date with the latest tools and methods coming from the Data Science community and work with SDS-IH technical experts to ensure clear messaging for the wider Biostatistics and Data Science communities.
Basic Requirements:
We are looking for professionals with these required skills to achieve our goals:
- MSc (preferred) or BSc degree (or equivalent) in STEM subjects (e.g. Computer Science, Machine Learning, Artificial Intelligence, Statistics, Bioinformatics, Engineering, Mathematics, Physics, Chemistry)
- Practical experience in delivering Data Science products/solutions to business stakeholders
- Familiarity with the Programming Language R
- Developing analytic scripts and software in R
- Familiarity with the R package landscape, in particular tidyverse, table and plotting packages
- Familiarity with common Data Science tools/practices like the RStudio and git/GitHub
- Experience of Pharma R&D, including drug development (preferred) or related healthcare industry, including clinical trial operations and regulatory compliance
- Excellent written, data visualization and verbal communication skills and demonstrated ability to effectively communicate complex technical concepts to different audiences
- An unbiased approach to solving problems with Data Science
- The ability to solve complex problems both analytically and programmatically
Preferred Skills:
- The following characteristics are desirable:
- Knowledge or experience with R package development, Shiny App development, R data structures and object systems (eg S3, S4, R6), CI/CD, and data storage formats and technologies including relational database
- An awareness of other programming languages such as Python, SQL, and SAS
- Experience presenting complex concepts to non-experts
- The "forever-student" mentality, continuously looking for opportunities to learn, build skills and share learning both internally and externally.