Recall Modernization & Data Mart Development
S&P Global
Legacy system hosted on VMware needed modernization for better scalability. Migrated the end-to-end system to a managed AWS environment and architected a centralized "Recall Data Mart".
Scalable data pipelines. Engineered for impact.
Designing & implementing data pipelines, optimizing existing architectures, and integrating AI/LLM technologies โ for batch and streaming workloads.
End-to-end data engineering solutions tailored to your business needs
Architect and build end-to-end data pipelines for both batch and streaming workloads. From ingestion to transformation to delivery โ designed for reliability and scale.
Revise, refactor, and optimize existing data pipelines for better performance and lower cost. Migrate legacy systems to modern cloud-native architectures.
Design and implement scalable data warehouses and lakehouse architectures using Medallion patterns for reliable, queryable, and maintainable data.
Leverage the latest AI, LLM, and GenAI technologies within your data platform. From embedding models in pipelines to building intelligent data products.
I'm Arnob Kumar Dey, an independent freelance Senior Data Engineering Consultant with over 10 years of experience helping organizations transform their data infrastructure.
I hold an M.Tech from BITS Pilani and specialize in cloud-native data architectures, particularly the Medallion Architecture pattern. I work with Fortune 500 companies and fast-growing startups alike โ designing pipelines that are reliable, scalable, and cost-effective.
Battle-tested tools and platforms I use to deliver production-grade solutions
Enterprise-scale data solutions that drive business value
S&P Global
Legacy system hosted on VMware needed modernization for better scalability. Migrated the end-to-end system to a managed AWS environment and architected a centralized "Recall Data Mart".
Delta Airlines
Manual pilot scheduling needed automation to ensure strict FAA regulation compliance. Led the design of a cloud-native scheduling optimizer using AWS CDK, Lambda for orchestration, and SageMaker for model deployment.
Internal Project
Designed a custom framework using Great Expectations to enforce data quality standards across pipelines handling FHIR healthcare data, ensuring compliance and reliability.
A proven approach to delivering data engineering solutions
Understand your data landscape, existing infrastructure, and business objectives.
Design scalable, cost-effective solutions aligned with your tech stack and goals.
Build, test, and deploy with CI/CD best practices and thorough documentation.
Monitor performance, tune for efficiency, and iterate for continuous improvement.
Verified expertise in cloud and data engineering technologies
Developer Associate
Expertise in AWS services, deployment, and security best practices
Data Engineer Associate
Proficiency in Databricks, Apache Spark, and lakehouse architecture
Have a data engineering challenge or project in mind? Let's discuss how I can help.