RDU / PORTFOLIO
● AVAILABLE   FOR CONSULTING BASED IN ABU DHABI, UAE · GMT+4 EST. 2016 / 10 YEARS / 9 ROLES

REZA DWI UTOMO

I am a Senior Data Scientist specializing in GenAI, NLP, and ML systems.
10+
Years shipping ML
9
Roles & industries
2
IEEE publications
5+
Cloud / ML certs
§ 01
ABOUT

Engineer first.
Researcher second.

Request CV →

I'm a Senior Full-stack AI/ML/Data Engineer/Scientist with 10 years across railway research, NLP, recommender systems, GenAI, etc. I've shipped from MATLAB simulators in academia all the way to LLMs running in production.

Currently I'm a Data Scientist at Avrioc Technologies in Abu Dhabi — building an AI-driven interactive sport coach, a GenAI chat platform, and managing LLM deployment with vLLM, Ray Serve, and Kubernetes.

Before that I led ML engineering at Mastersystem (Indonesia's largest ICT integrator), built recommendation systems at Tokopedia, NLP-for-law at Telkom Indonesia, and started in railway reliability at PT INKA after a postgrad at the Birmingham Centre for Railway Research, where my ATO-control paper was published on IEEE Xplore.

Railway R&D Manufacturing Telecom Legal Tech E-commerce IT Consulting Software Sport Tech
§ 02
WORK

Nine roles.
One throughline.

LinkedIn ↗
JUL 2024 — PRESENT

Avrioc Technologies · Abu Dhabi, UAE

Data Scientist  ·  SOFTWARE / SPORT TECH
  • Building an AI-based interactive sport coach to support and generate athletes' workouts.
  • Developing a GenAI chat application for a sport platform.
  • Managing upscaling LLM deployments; PoC for LLMs on AWS Inferentia.
  • Cross-team collaboration on new AI features.
PYTHON · PYTORCH · FASTAPI · HUGGING FACE · CHAINLIT · OPENAI · vLLM · BASH · DOCKER · KUBERNETES · RAY SERVE · ELASTICSEARCH · GITLAB · AWS (EC2 · INFERENTIA)
+
JAN 2023 — JUL 2024

Mastersystem Infotama · Indonesia's largest ICT integrator

Lead Staff / Senior Engineer Specialist  ·  IT CONSULTING
  • Led AWS-based ML solutions for clients: Telkomsel (telecom), BTPN Syariah (banking), Gudang Garam (manufacturing), PT Tirta Raharja (water distribution).
  • Built end-to-end ML system: pipeline, explainable AI (SHAP), Streamlit dashboard, CI/CD via CodePipeline, IaC via CloudFormation.
  • PoC: GenAI chatbot with Claude on Amazon Bedrock + RAG for Telkomsel.
  • Internal GenAI QnA: RAG over company documents using GPT-3.5 + LangChain + Chroma.
  • Data warehouse migration at BTPN Syariah: ETL framework (Glue, Step Functions), Spark SQL, PySpark.
  • Allocated human resources across projects; oversaw cigarette-pack image detection PoC (Amazon Rekognition).
AWS (SAGEMAKER · BEDROCK · GLUE · STEP FUNCTIONS · LAMBDA · REDSHIFT · ATHENA · CODE* · CLOUDFORMATION · KENDRA · pgvector) · CLAUDE · LANGCHAIN · OPENAI · STREAMLIT · SHAP · XGBOOST · PYSPARK · MYSQL · PYTHON
+
OCT 2022 — DEC 2022

Tokopedia · SE Asia's largest e-commerce

Senior Data Scientist  ·  E-COMMERCE
  • PIC for Data Science on shop recommendations across merchant campaign tools.
  • Built recommendation systems over hundreds of millions of products and shops.
  • Analysed large-scale data to surface insights for merchant features.
  • Collaborated with cross-functional teams on new features.
PYTHON · PANDAS · NUMPY · SCIKIT-LEARN · GCP · BIGQUERY · PLOTLY · BASH · FLASK · DOCKER · GITHUB
+
JAN 2022 — SEP 2022

Legal Analytics · powered by Telkom Indonesia

Head of Legal Data Analytics  ·  LEGAL TECH
  • Helped governmental and corporate organizations implement Big Data and AI for law and social solutions.
  • Translated business requirements into ML/AI solutions.
  • Managed a team of data scientists from concept to delivery.
+
OCT 2020 — SEP 2022

PT Telkom Indonesia (Persero), Tbk

Data Scientist  ·  TELECOM
  • NLP for Indonesian law: named entity recognition (NER) with spaCy, Keras, and IndoBERT pre-trained model.
  • BERT-based text similarity and text summarisation for Constitutional Court (Mahkamah Konstitusi) documents.
  • WhatsApp Bot and Telegram Bot development.
  • Elasticsearch text analytics; social media and news analytics; affective text generation.
  • Public transport ticketing data analytics.
  • Developed methodology to categorise potential risks in social media and news texts.
PYTHON · spaCy · NLTK · GENSIM · FASTTEXT · KERAS · TENSORFLOW · PYTORCH · TRANSFORMERS · ELASTICSEARCH · KIBANA · MLflow · DVC · FASTAPI · DOCKER · GITLAB · GEPHI · AWS
+
APR 2020 — SEP 2020

CODEX · powered by Telkom Indonesia

Data Scientist  ·  INNOVATION LAB
  • NLP text similarity for legal documents.
  • Anomaly detection on big-data analytics for PT Pupuk Indonesia.
  • COVID-19 PeduliLindungi: scraped pandemic data from Indonesian regional government websites.
PYTHON · PANDAS · NUMPY · SCIKIT-LEARN · BEAUTIFUL SOUP · SELENIUM · MARIADB · spaCy · NLTK · FLASK
+
FEB 2019 — DEC 2019

PT Industri Kereta Api (Persero) · Indonesia's national rolling stock manufacturer

Engineer in Reliability Analytics  ·  MANUFACTURING
  • Statistics-based reliability prediction analysis (descriptive, inferential, regression) on passenger coaches (MATLAB).
  • Lead author of INKA's dedicated RAMS analysis methodology under EN 50126.
  • Main contributor for RAMS management process development at the company.
MATLAB · REGRESSION · MAXIMUM LIKELIHOOD ESTIMATION · MONTE CARLO SIMULATION · EN 50126
+
MAY 2018 — JAN 2019

BPPT — Centre for Transportation System & Infrastructure Technology · Agency for Assessment & Application of Technology

Junior Expert in Reliability Analytics  ·  RAILWAY R&D
  • Statistics-based reliability prediction analysis and FTA (Fault Tree Analysis) in R/RStudio.
  • LRT Greater Jakarta project: RAMS analyses (FTA, FMEA, MTTR, MTBF, EN 50126) for doors, HVAC, bogies, wiring, and control panels.
  • Collaborated in engineering teams with PT INKA.
R · MATLAB · REGRESSION · MAXIMUM LIKELIHOOD ESTIMATION · EN 50126
+
AUG 2016 — APR 2018

Birmingham Centre for Railway Research & Education · University of Birmingham, UK

Postgraduate Researcher  ·  ACADEMIA
  • Three-aspect signalling and timetable simulation for Wirksworth–Duffield via Shottle using BRaVE (Java-based microscopic simulator).
  • Developed ATO (Automatic Train Operation) control system for the Docklands Light Railway in London (MATLAB/Simulink).
  • Paper presented at IEEE ICIRT 2018, Singapore; published on IEEE Xplore.
MATLAB · SIMULINK · BRaVE · KALMAN FILTER · ADAPTIVE CONTROL · PID · FUZZY GAIN SCHEDULING
+
§ 03
EDUCATION

Two degrees.
Two countries.

AUG 2016 — SEP 2018

University of Birmingham · United Kingdom

Master of Research — Railway Systems Integration
  • Core modules: Mathematics as an Engineering Tool, Railway Operations & Control Systems, Railway Traction Systems Design, Railway Control Systems Engineering, Research Skills.
  • Thesis: ATO (Automatic Train Operation) Control Systems with Kalman filter in MATLAB — case study: Docklands Light Railway in London.
  • Research presented at IEEE ICIRT 2018, Singapore and published on IEEE Xplore.
MATLAB · SIMULINK · KALMAN FILTER · ADAPTIVE CONTROL · FUZZY GAIN SCHEDULING · BRaVE SIMULATOR
+
SEP 2010 — AUG 2015

Diponegoro University · Indonesia

Bachelor of Engineering — Computer Engineering
  • Core modules: artificial intelligence, fuzzy logic, neural networks, real-time operating system (RTOS), microprocessor design, embedded systems, distributed embedded systems.
  • Final project: fuzzy logic controller (MATLAB/Simulink) for train transfer function control — published in IEEE Conference Proceedings.
MATLAB · SIMULINK · FUZZY LOGIC · C · MICROPROCESSOR · EMBEDDED SYSTEMS
+
§ 04
PROJECTS

Public side projects.

All on GitHub ↗
§ 05
CERTS

Verified credentials.

View All →
§ 06
WRITING

Notes from the field.

All on Medium ↗
§ 07
STACK

What I reach
for, by default.

12 CATEGORIES

Languages

01
  • Python
  • SQL
  • R
  • MATLAB / Simulink
  • Bash

ML / DL

02
  • PyTorch
  • TensorFlow / Keras
  • scikit-learn
  • XGBoost
  • SHAP

NLP

03
  • Hugging Face
  • spaCy
  • NLTK
  • Gensim · FastText
  • Transformers

GenAI

04
  • OpenAI
  • LangChain
  • vLLM
  • Chainlit
  • Claude / Bedrock

AWS

05
  • SageMaker · Bedrock
  • Inferentia · EC2
  • Glue · Step Functions
  • Lambda · Athena · Redshift
  • CodePipeline · CFN · SNS

GCP

06
  • BigQuery
  • Vertex AI
  • Cloud Storage
  • Compute Engine

Data

07
  • PySpark
  • Pandas · NumPy
  • Plotly · Dash
  • Streamlit
  • Gephi (graph analytics)

Storage

08
  • MySQL · MariaDB · SQLite
  • Elasticsearch
  • Chroma · FAISS
  • pgvector

Serving

09
  • FastAPI · Flask
  • Docker
  • Kubernetes
  • Ray Serve

MLOps

10
  • MLflow
  • DVC
  • Pytest · Pre-commit
  • Pylint · Black · Isort
  • Jenkins

Azure

11
  • Azure Machine Learning
  • Azure AI Fundamentals
  • Azure Fundamentals

Other

12
  • BeautifulSoup · Selenium
  • Git · GitHub · GitLab
  • Linux Bash
  • RAMS / EN 50126
  • Monte Carlo · MLE
§ 09
CONTACT

Let's build
something.

REPLY < 24H

Best way to reach me — pick one.

Email is fastest for new projects. LinkedIn for recruiting. The form on the right goes straight to my inbox via Formspree.