Description:
Recruiters please do not reach out to me, we want to hire this role directly. DataFrontier Innovations pvt ltd
Data Scientist (Early Career) - Healthcare AI & Agentic SystemsLocation: Bangalore (Hybrid)Experience: 2-4 yearsCompany: DataFrontier Innovations Pvt. Ltd.Email:
About DataFrontier
DataFrontier builds advanced AI, data engineering, and analytics systems that solve real-world, high-impact problems - especially in healthcare and enterprise environments.
We are currently working on: • Falls prediction systems for elderly care using clinical and behavioral data • Agentic AI systems that automate workflows and decision-making • Production-grade data platforms handling sensitive, regulated datasets
If you want to build systems that actually get used - not just models that sit in notebooks - this role is for you.
Role OverviewWe are looking for a hands-on Data Scientist who can work across the full lifecycle - from data understanding to model deployment - and is excited about applying AI in healthcare and automation.
This is not a pure research role. You will be building, shipping, and improving real systems.
What You'll Work On • Build and improve predictive models for falls risk using clinical, behavioral, and time-series data • Design features from messy real-world healthcare datasets (EHR, sensor data, logs) • Work on Agentic AI pipelines (LLMs + tools + workflows) to automate decision systems • Develop and test models like XGBoost, Random Forest, time-series models, and hybrid approaches • Collaborate with data engineers to build robust pipelines and feature stores • Implement model explainability (SHAP, feature importance) for clinical usability • Evaluate data quality and completeness for new customers (critical for healthcare deployments) • Work closely with product and clients to translate business problems into ML solutions
Required Skills • Strong fundamentals in Python (pandas, numpy, scikit-learn) • Experience with ML models (classification, regression, tree-based models) • Solid understanding of feature engineering and data preprocessing • Familiarity with SQL and working with structured datasets • Basic understanding of model evaluation metrics and validation techniques • Exposure to real-world datasets (not just Kaggle-level clean data)
Good to Have (High Impact) • Experience with healthcare data / EHR / clinical datasets • Exposure to LLMs / LangChain / agentic frameworks • Knowledge of time-series modeling • Experience with cloud platforms (AWS / GCP) • Understanding of data pipelines / ETL workflows • Familiarity with model deployment or APIs
What We're Looking For • Someone who can think, not just code • Comfortable working with unclean, incomplete, real-world data • Willing to own problems end-to-end, not just tasks • Curious about AI beyond standard ML - especially agentic systems • Strong communication skills - ability to explain models to non-technical stakeholders
What This Role Is NOT • Not a pure research role • Not a "train model and forget" role • Not limited to notebooks - you will work on production systems
Why Join Us • Work on real healthcare impact problems • Exposure to international clients and deployments • Build next-gen systems in Agentic AI + predictive analytics • Small team high ownership and fast growth
How to ApplySend your resume + 2-3 projects (GitHub / case studies) to:
Subject: Data Scientist - Early Career Application
If you're looking for comfort, this role is not for you.If you're looking to build something meaningful, let's talk.