PinnedPublished inLevel Up CodingApache Iceberg: 4 Methods To Create A Warehouse With PySparkA Hands-On Tutorial For Data EngineersMar 11Mar 11
PinnedPublished inLevel Up CodingPython Generators: How To Efficiently Fetch Data From DatabasesTwo practical use cases for Data Engineers.Dec 5, 202312Dec 5, 202312
PinnedPublished inTowards Data Science3 Data Engineering Courses To Advance Your Career In 2023Join the data industry, change role or simply learn cutting-edge technologies by enrolling in Data Engineering Nanodegree In 2023.May 13, 202110May 13, 202110
PinnedPublished inTowards Data Science10 Algorithms To Solve Before your Python Coding InterviewIn this article I present and share the solution for a number of basic algorithms that recurrently appear in MAANG interviews in 2023.Jul 30, 202026Jul 30, 202026
Published inLevel Up CodingDelete Thousands of S3 Objects Safely with Boto3 And PaginatorsIntroductionJun 17Jun 17
Published inTowards Data ScienceHow to Automate PySpark Pipelines on AWS EMR With AirflowOptimising big data workflows orchestration.Aug 23, 20239Aug 23, 20239
Published inTowards Data ScienceBoto3 vs AWS Wrangler: Simplifying S3 Operations with PythonA comparative analysis for AWS S3 developmentJun 20, 20232Jun 20, 20232
Published inTowards Data Science4 Ways to Write Data To Parquet With Python: A ComparisonLearn How To Efficiently Write Data To Parquet Format Using Pandas, FastParquet, PyArrow or PySpark.Mar 13, 20233Mar 13, 20233
Published inTowards Data ScienceDockerizing Apache Zeppelin and Apache Spark for Easy DeploymentLearn How To Build a Portable and Scalable Data Analysis Environment with Docker-Compose And Volumes.Jan 24, 20235Jan 24, 20235
Published inTowards Data Science3 Ways To Aggregate Data In PySparkPySpark Basic Aggregations Explained With Coding Examples.Dec 13, 20221Dec 13, 20221