Data Scientist – LLM

Hudson Yards, New York

  Machine Learning



Our client is currently seeking a seasoned individual contributor with a strong background in Large Language Models (LLM) and Generative AI to join our recently established Data Science Lab. In this role, you will be instrumental in crafting advanced data science solutions that leverage the capabilities of machine learning and artificial intelligence, driving innovation across diverse business lines and products at the enterprise level. Collaborating closely with Data Science Tech Leads, you will actively contribute to impactful and highly visible projects, delivering AI/ML solutions that undergo rigorous market testing and deployment, thereby influencing risk management and overall financial performance. Successful candidates will bring industry-specific expertise, a genuine enthusiasm for applying state-of-the-art ML and AI insights, and the ability to design and implement data science capabilities that promote growth, competitive advantage, and customer satisfaction.

Key Responsibilities:
Develop capabilities in Deep Learning, Large Language Models (LLM), and Generative AI:

  • Design and create high-quality prompts and templates to guide LLM behavior and responses. Craft prompts to extract specific information or control the model's output, ensuring accuracy, relevance, and language fluency. Optimize prompts to enhance user interactions and system performance.
  • Map and mine unstructured data from sources such as insurance contracts, medical records, sales notes, and customer servicing logs.
  • Implement AI/ML solutions, including but not limited to improving underwriting risk assessment, claims auto adjudication, and customer servicing.
  • Conduct large-scale experiments, ranging from unsupervised pre-training to fine-tuning, retrieval augmentation, and prompt engineering.
  • Evaluate LLM models through statistical tests, business metrics, and assessments of bias and other regulatory considerations.

Support and contribute to building the Data Science Lab (DSL):

  • Support the development of use cases, including initial data exploration, project/sample design, data reception and processing, analysis, modeling, and the creation of final reports/presentations.
  • Perform data wrangling, data matching, and ETL processes to explore diverse data sources, gain data expertise, conduct summary analyses, and prepare modeling datasets.
  • Utilize advanced statistical and AI/ML techniques to develop high-performing predictive models and conduct creative analyses to address business objectives and partner needs.
  • Identify source data and conduct data quality checks, both in model/solution development and during production.
  • Collaborate with Data Engineers and MLOps for the packaging and deployment of models/solutions.

Contribute to the overall Data Science organization:

  • Collaborate with cross-functional teams comprising Data Science, Data Engineering, and Business groups.
  • Contribute to the standardization of Data Science tools, processes, and best practices.

Who You Are:
You have a strong passion for staying at the forefront of technology and are enthusiastic about applying the latest AI/ML algorithms and methodologies. You are characterized by analytical rigor, intellectual curiosity, and a proven track record of leading the creation and execution of data and analytic solutions to address complex business challenges. Your satisfaction comes from collaborating with fellow data scientists to tackle challenging problems using AI/ML, and witnessing the successful deployment of solutions in the market, delivering tangible value to the company. You thrive in working within a multi-disciplinary team, engaging with data engineers, business analysts, software developers, and functional business experts, as well as collaborating with business leaders.
What you will have:

  • Hold a PhD or Master's degree in Computer Science, Data Science, Statistics, Mathematics, or a related field.
  • Possess foundational experience in data analysis and statistical modeling.
  • Demonstrate a robust theoretical understanding of probability and statistics.
  • Exhibit expertise in deep learning models, encompassing Large Language Models (LLM), Prompt Engineering, and Natural Language Processing (NLP).
  • Have hands-on proficiency in utilizing GPUs, distributed computing, and implementing parallelism in Machine Learning solutions.
  • Showcase advanced programming skills in Python, with a focus on PyTorch and/or Tensorflow.
  • Possess a solid foundation in algorithms and a diverse range of Machine Learning models.
  • Display excellent communication skills and the ability to collaborate cross-functionally with Product, Engineering, and other teams, both at a leadership and hands-on level.
  • Demonstrate exceptional analytical and problem-solving abilities with meticulous attention to detail.
  • Exhibit proven leadership by providing technical guidance and mentorship to data scientists, coupled with strong management skills for monitoring and tracking performance, contributing to enterprise success.


  • 2-3 Days a week at their NYC location


  • Up to $146,000 per year plus bonus