PinnedPublished inLevel Up CodingApache Iceberg: 4 Methods To Create A Warehouse With PySparkA Hands-On Tutorial For Data EngineersMar 1, 20241Mar 1, 20241
PinnedPublished inLevel Up CodingPython Generators: How To Efficiently Fetch Data From DatabasesTwo practical use cases for Data Engineers.Dec 5, 202314Dec 5, 202314
PinnedPublished inTDS Archive3 Data Engineering Courses To Advance Your Career In 2023Join the data industry, change role or simply learn cutting-edge technologies by enrolling in Data Engineering Nanodegree In 2023.May 13, 202110May 13, 202110
PinnedPublished inTDS Archive10 Algorithms To Solve Before your Python Coding InterviewIn this article I present and share the solution for a number of basic algorithms that recurrently appear in MAANG interviews in 2023.Jul 30, 202026Jul 30, 202026
Published inLevel Up CodingQuerying Kafka Topics Using Trino and SQLPadA Step-By-Step Tutorial With DockerNov 11, 2024Nov 11, 2024
Published inLevel Up CodingDelete Thousands of S3 Objects Safely with Boto3 And PaginatorsIntroductionJun 17, 2024Jun 17, 2024
Published inTDS ArchiveHow to Automate PySpark Pipelines on AWS EMR With AirflowOptimising big data workflows orchestration.Aug 23, 20239Aug 23, 20239
Published inTDS ArchiveBoto3 vs AWS Wrangler: Simplifying S3 Operations with PythonA comparative analysis for AWS S3 developmentJun 20, 20232Jun 20, 20232
Published inTDS Archive4 Ways to Write Data To Parquet With Python: A ComparisonLearn How To Efficiently Write Data To Parquet Format Using Pandas, FastParquet, PyArrow or PySpark.Mar 13, 20233Mar 13, 20233
Published inTDS ArchiveDockerizing Apache Zeppelin and Apache Spark for Easy DeploymentLearn How To Build a Portable and Scalable Data Analysis Environment with Docker-Compose And Volumes.Jan 24, 20235Jan 24, 20235