Data Engineers design, build, and maintain systems that enable organizations to collect, store, and analyze large volumes of data. They ensure data pipelines are efficient, scalable, and reliable, empowering data-driven decision-making.
NguyenBuilding the future, one pipeline at a time 🚀
A modern data lakehouse platform with conversational AI interface for querying data using natural language. Users can interact with complex data queries through an intuitive chat interface, enabling data discovery and insights without requiring SQL knowledge.
Real-time data pipeline for IoT using Apache Flink, Kafka, Delta Lake, and Apache Iceberg. Handles CDC streams from MongoDB for analytics and downstream consumption.
Automated ETL pipeline with Apache Spark, Airflow, and HDFS. Data is processed, transformed, and stored for large-scale analytics with orchestrated workflows.
Modern data warehouse using Medallion architecture (Bronze, Silver, Gold). Focus on ETL, data modeling, normalization, and analytics reporting with SQL Server.
Feel free to reach out for collaboration or any questions!