Job Title: Lead Data Scientist
Location: Roanoke, VA (Hybrid/Remote Considered)
Job Type: Full-Time
Salary: Competitive, Based on Experience
Job Summary
We are seeking a Lead Data Scientist to design, develop, and optimize AI-driven data models that power our AI-based retrieval and decision-support systems. This role requires deep expertise in machine learning, NLP, and data science methodologies, along with leadership capabilities to guide data teams in building scalable, high-performance AI solutions.
The Lead Data Scientist will collaborate closely with AI engineers, software developers, and technical leaders to develop innovative AI models that deliver insightful and actionable intelligence for enterprise and healthcare applications.
Key Responsibilities
- Develop and refine machine learning models, NLP pipelines, and LLM-based AI solutions to support document retrieval and decision-making.
- Lead research efforts in RAG, embeddings, vector search, and hybrid AI architectures.
- Implement data preprocessing, feature engineering, and model optimization strategies.
- Design and optimize knowledge graphs and vector databases for structured and unstructured data retrieval.
- Collaborate with AI engineers and software developers to integrate models into production environments.
- Conduct A/B testing, model evaluation, and performance tuning to improve AI accuracy and efficiency.
- Drive best practices for data governance, model explainability, and bias mitigation.
- Lead data-driven research to enhance the capabilities of AI-powered retrieval and document processing.
- Ensure compliance with data privacy and security regulations (e.g., HIPAA, SOC 2).
- Mentor data scientists and provide guidance on ML model development and deployment strategies.
Required Qualifications
- Bachelor’s degree in Data Science, Computer Science, Mathematics, or a related field from an accredited institution.
- 5+ years of experience in data science, machine learning, or AI model development.
- Strong expertise in ML frameworks (TensorFlow, PyTorch, Scikit-learn) and NLP techniques (BERT, GPT, Transformer models).
- Proficiency in Python, R, or other data science programming languages.
- Experience with vector databases (Pinecone, FAISS, Weaviate, or similar) and knowledge graph applications.
- Expertise in statistical modeling, data visualization, and machine learning algorithms.
- Familiarity with big data processing frameworks (Spark, Dask, Hadoop) and cloud-based AI deployments (AWS, Azure, GCP).
- Strong understanding of data pipelines, ETL processes, and MLOps best practices.
- Experience working in an Agile development environment with cross-functional teams.
Preferred Qualifications
- Master’s or PhD in Data Science, AI, Machine Learning, or a related field.
- Experience working with healthcare data (Medicaid, Medicare, EHR systems).
- Knowledge of regulatory compliance for AI-driven decision support in healthcare.
- Familiarity with cybersecurity best practices for AI systems.
- Experience with reinforcement learning, unsupervised learning, and generative AI models.
Why Join Ginkgo-AI?
- Lead AI innovation in enterprise and healthcare data science.
- Work with a highly skilled, collaborative team in a growing company.
- Competitive salary and benefits package with professional development opportunities.
- Be part of a company shaping the future of AI-driven enterprise and healthcare solutions.
Ready to build the next generation of AI-driven insights? Apply today!