Apache Spark, Apache Kafka, Airflow, PySpark, Flink-style streaming, ETL/ELT Pipelines, Real-time & Batch Processing
Python, SQL, TypeScript, PostgreSQL, MongoDB, Neo4j, Snowflake, ElasticSearch, Redis
AWS (EC2, ECS, Lambda, S3, Route 53, VPC), GCP (BigQuery, Dataproc, Cloud Storage), Azure (Databricks, Data Lake Storage)
Terraform, Docker, Kubernetes, Jenkins, GitLab CI/CD, Infrastructure as Code (IaC)
PyTorch, OpenCV, ML Pipeline Development, Time-Series Forecasting, Computer Vision, Power BI, Tableau, Streamlit
Data Lake Design, Microservices, Distributed Systems, Data Modeling, Stream & Batch Architecture