close
Skip to content
View dvskr's full-sized avatar

Block or report dvskr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
dvskr/README.md

BERJAYA

Typing SVG

LinkedIn Twitter Product Hunt IndieHackers PMHNP Hiring


👨‍💻 About Me

Data Engineer @ Propper International | Product Builder by Night

🔧 Day Job: Building 100+ data pipelines for TB-scale retail ops
🚀 Side Hustle: Shipping SaaS with Cursor + Claude AI
📍 Location: St. Louis, Missouri 🇺🇸
🎓 Education: MS Computer Science

Impact: 40% reduced data latency • 50% automation increase • 20% cost reduction

"The best engineers don't just move data — they build things that matter."

BERJAYA

🚀 What I'm Building

BERJAYA

PMHNP Hiring

BERJAYA

Niche job board for Psychiatric NPs

Metric Value
📊 Jobs 5,600+
🏢 Companies 1,349
🎯 Dedup Rate 85%
BERJAYA BERJAYA
BERJAYA

Gym Tracker

BERJAYA

AI-powered workout tracking app

Feature Detail
🏋️ Exercises 423
🤖 AI Coach GPT-4
❤️ Health Apple
BERJAYA BERJAYA
BERJAYA

FreelancerShield

BERJAYA

Business OS for freelancers

Feature
👥 Clients
📄 Contracts
💰 Invoicing
BERJAYA BERJAYA

💼 Professional Experience

Propper Data Engineer • May 2023 - Present • St. Louis, MO

Building data infrastructure for a multi-terabyte retail operation

Metric Impact
📊 ETL Pipelines Built 100+ using Airflow & AWS Glue
⚡ Real-time Streaming Kafka + Spark Structured Streaming
📈 Data Latency Reduction 40% improvement
🤖 Process Automation 15+ manual processes automated
💰 Cost Optimization 20% reduction via Parquet/ORC
📍 Previous: Globus Medical (2021-2022)

Data Engineer • Healthcare data engineering with HIPAA/GDPR compliance

  • Built 50+ batch and streaming pipelines for clinical/financial data
  • Ingested 1TB+ healthcare data daily via Spark
  • 25% reduction in claims processing SLA
  • Zero compliance violations during HIPAA audits
  • Implemented bronze-silver-gold lakehouse methodology

🛠️ Tech Stack

Python
Python
AWS
AWS
Azure
Azure
Spark
Spark
Snowflake
Snowflake
Kafka
Kafka
Databricks
Databricks
Docker
Docker
TypeScript
TypeScript
React
React
Next.js
Next.js
Tailwind
Tailwind
Supabase
Supabase
Prisma
Prisma
GitHub
GitHub
Vercel
Vercel
📊 Full Skills Breakdown

Data Engineering (Day Job)

Languages       → Python (Pandas, NumPy, PySpark) • SQL • Bash
Big Data        → Apache Spark • Hadoop • Hive • Kafka • Delta Lake
Warehousing     → Snowflake • Redshift • PostgreSQL • BigQuery • MongoDB  
ETL             → Airflow • dbt • AWS Glue • Azure Data Factory • NiFi • Databricks
Cloud           → AWS (S3, Glue, Lambda, Redshift) • Azure (ADF, Synapse) • GCP
DevOps          → Terraform • Jenkins • GitHub Actions • Docker • Kubernetes
BI              → Power BI • Tableau • Looker

Product Building (Side Projects)

Frontend        → Next.js • React Native • TypeScript • Tailwind CSS
Backend         → Supabase • Prisma • PostgreSQL • Stripe
AI Tools        → Cursor • Claude • OpenAI API
Deployment      → Vercel • Expo • App Store • Play Store

📊 GitHub Stats

GitHub Stats GitHub Streak

Activity Graph

github-snake


📊 Featured Data Projects

🏎️ Formula 1 Racing Analytics

Enterprise Lakehouse on Azure for F1 telemetry

Tech: Azure Databricks • Delta Lake • PySpark • ADF

  • 📊 Raw → Bronze → Silver → Gold architecture
  • 🔐 Unity Catalog governance
  • ⚡ Real-time race KPIs
  • 📈 Power BI dashboards

🚕 NYC Taxi Analytics

Big data pipeline on Azure Synapse

Tech: Azure Synapse • PySpark • Cosmos DB • Power BI

  • 🗺️ Geospatial visualization
  • 🔄 Batch and streaming pipelines
  • 🔗 HTAP with Cosmos DB
  • 📊 100M+ trip records processed

🎓 Education & Certifications

🎓 M.S. Computer Science Southeast Missouri State University GPA: 3.7
🎓 B.Tech Computer Science Karunya Institute of Technology
📜 Neural Networks & Deep Learning DeepLearning.AI
📜 Python for Everybody University of Michigan

🌱 Currently

+ 🚢 Shipping Gym Tracker to iOS and Android
+ 🔨 Building FreelancerShield features  
+ 📝 Documenting journey on Twitter #buildinpublic
+ 💼 Open to Data Engineering roles with product impact

📬 Let's Connect

Email

I'm always happy to chat about Data Engineering, AI-assisted development, or the indie hacker journey.


BERJAYA

"The best engineers don't just move data — they build things that matter."

Popular repositories Loading

  1. Python__Projects Python__Projects Public

    Python 1

  2. PMHNP-Job-Board PMHNP-Job-Board Public

    TypeScript 1

  3. gym-tracker gym-tracker Public

    TypeScript 1

  4. Freelancer-Shield Freelancer-Shield Public

    TypeScript 1

  5. Databricks_Project_On_Formula1 Databricks_Project_On_Formula1 Public

    Jupyter Notebook

  6. Synapse_Analytics_Project_On_NYC_TAXI Synapse_Analytics_Project_On_NYC_TAXI Public

    Jupyter Notebook