Data Scientist
Publiée le : 25/06/2026
Kuala Lumpur Federal Territories Malaisie
CDI
Energie / Environnement
Data Scientist
Role Summary
We are seeking a Senior Data Scientist to design, develop, deploy, and monitor AI solutions across Generative AI, computer vision/OCR, and predictive analytics use cases. The successful candidate will work closely with business stakeholders to translate operational challenges into scalable AI products and data-driven solutions.
Key Responsibilities
Generative AI & Knowledge Solutions
-
Design and implement Retrieval-Augmented Generation (RAG) systems using open-source Large Language Models.
-
Develop prompt engineering strategies and evaluation frameworks to improve response quality, faithfulness, and citation accuracy.
-
Deploy and optimize GenAI solutions on cloud platforms such as Azure Machine Learning.
-
Establish governance, monitoring, and cost-control mechanisms for AI applications.
AI-Powered OCR & Intelligent Automation
-
Build OCR and document intelligence solutions for extracting structured information from images and scanned documents.
-
Develop validation frameworks using confidence scoring, schema validation, and automated quality checks.
-
Design and maintain production-grade APIs and microservices using FastAPI.
-
Deploy and monitor AI inference services on Vertex AI or equivalent platforms.
Machine Learning & Analytics
-
Perform feature engineering, model development, and model evaluation for predictive analytics use cases.
-
Build scalable data processing pipelines using Databricks and PySpark.
-
Implement data quality controls and model monitoring processes.
-
Translate model outputs into actionable business insights and decision-support tools.
Platform & Engineering
-
Develop and maintain MLOps workflows for deployment, monitoring, and lifecycle management.
-
Collaborate with data engineers, business analysts, and domain experts.
-
Ensure security, scalability, reliability, and compliance of AI systems.
-
Document technical designs, experiments, and deployment processes.
Required Qualifications
-
Bachelor's degree in Computer Science, Data Science, Engineering, Mathematics, or related discipline.
-
4+ years of experience in AI/ML engineering, data science, or related fields.
-
Strong proficiency in Python and SQL.
-
Experience with machine learning frameworks such as Scikit-learn.
-
Experience with Generative AI technologies, RAG architectures, and prompt engineering.
-
Hands-on experience with Azure ML, Vertex AI, Databricks, or similar cloud platforms.
-
Experience building APIs using FastAPI or equivalent frameworks.
-
Strong understanding of model evaluation, monitoring, and MLOps practices.
Preferred Qualifications
-
Experience with LangChain, LlamaIndex, vector databases, and embeddings.
-
Experience with OCR, document AI, or computer vision solutions.
-
Experience with PySpark and large-scale data processing.
-
Knowledge of CI/CD, Docker, and cloud-native deployment patterns.
-
Experience working in regulated industries such as utilities, government, or financial services.
Key Technologies
Python, SQL, Azure ML, Vertex AI, Databricks, PySpark, FastAPI, Scikit-learn, Open-source LLMs, RAG, LangChain, LlamaIndex, Docker, REST APIs, MLOps.