Skip to main content

Data Scientist

Technology
8byte
Bengaluru, India1 months agoUntil 21/5/2026
Full timeOn-site

Job description

About 8byte

8byte orchestrates data, AI models, and compute pipelines to automate Financial Services processes at scale. We feature intelligent LLM routing, real-time data streamlining, and a unified framework for integrating search, classification, and personalization tasks, enabling banks, VCs, and Private Equity firms to deploy AI across decision workflows with ease.

Backed by experienced founders and powered by a team of AI/ML experts, we deliver the infrastructure needed to stay ahead in a rapidly evolving, data-driven landscape. Our clients span the BFSI sector across India and GCC markets.

Role Overview

We are looking for a sharp, ownership-driven Data Scientist to join our engineering team as a core full-time hire. This is not a support role, it is a seat at the table. You will independently own data querying, model development, and pipeline integration across real production environments, working directly alongside our senior AI/ML engineers and founders.

If you are someone who thrives on getting into messy, real-world data and making sense of it fast, we want to talk.

Data Extraction and Querying

  • Own and independently run complex SQL queries across multiple production databases.
  • Design, write, and optimize queries for accuracy, performance, and scalability.
  • Build reusable query templates and maintain a clean library of data extraction scripts.

Data Exploration and Analysis

  • Lead end-to-end data cleaning, preprocessing, and EDA on large financial datasets.
  • Surface insights and anomalies proactively, not just when asked.
  • Produce MIS reports and data summaries for internal and client-facing use.

Model Development

  • Build, fine-tune, and evaluate ML models including LLMs and NLP-based classification tasks.
  • Manage the full model lifecycle: experimentation, validation, versioning, and production readiness.
  • Apply supervised, unsupervised, and deep learning approaches to BFSI-specific problem statements.

Pipeline Integration

  • Collaborate with engineering to integrate models into real-time data pipelines.
  • Ensure data flows are reliable, well-monitored, and production-grade.

Research and Applied Innovation

  • Stay current on relevant AI/ML research and evaluate applicability to 8byte's core framework.
  • Propose and prototype new techniques where the evidence supports it.

Documentation and Knowledge Management

  • Maintain rigorous documentation of experiments, model versions, data schemas, and query logic.
  • Contribute to internal knowledge bases so the team is never dependent on one person's memory.

Technical Skills Required

  • Strong Python proficiency across data libraries: Pandas, NumPy, Scikit-learn, and related tooling.
  • Professional-level SQL: joins, window functions, subqueries, query optimization, and cross-database work.
  • Solid grounding in supervised and unsupervised learning, neural networks, and NLP concepts.
  • Comfort with Jupyter Notebooks and Git-based version control workflows.
  • Strong grasp of statistics, probability, and linear algebra as they apply to model development.
  • Clear written and verbal communication in English, including the ability to explain technical findings to non-technical stakeholders.

Preferred Qualifications

  • Degree in Computer Science, Data Science, Statistics, or a closely related field.
  • 2+ years of hands-on experience in a data science or data analytics role, preferably in fintech or BFSI.
  • Demonstrated portfolio of projects or competition experience (Kaggle or equivalent) showing real data problem-solving.
  • Familiarity with financial data structures, credit workflows, or lending processes is a meaningful advantage.
  • Prior exposure to LLM-based pipelines, RAG architectures, or production ML systems is a strong plus.

What You Will Get

  • Ownership from day one across production-grade AI pipelines used by live financial institutions.
  • Direct mentorship from experienced AI founders, not filtered through layers of management.
  • Competitive compensation benchmarked to industry standards, reviewed based on impact.
  • Fast career growth in an early-stage startup where your contributions are visible and counted.
  • A high-ownership, no-fluff culture in the heart of Bengaluru where results matter more than optics.

WORK MODE

This is a full-time, in-office position based at our Bengaluru site. We believe in-person collaboration is the fastest, most effective way to build in an early-stage environment. Remote or hybrid arrangements are not available for this role.

Skills: models,it,ml,model development,pipelines,data,data extraction

Keywords
monthsOfExperience: 24OCamlScikit-learnUnified frameworkScalabilityNumPyPythonSqlDeep learningToolchainKaggleGit

¿Te interesa este puesto?