PinnedAntonello BenedettoinLevel Up CodingApache Iceberg: 4 Methods To Create A Warehouse With PySparkA Hands-On Tutorial For Data EngineersMar 11Mar 11
PinnedAntonello BenedettoinLevel Up CodingPython Generators: How To Efficiently Fetch Data From DatabasesTwo practical use cases for Data Engineers.Dec 5, 202312Dec 5, 202312
PinnedAntonello BenedettoinTowards Data Science3 Data Engineering Courses To Advance Your Career In 2023Join the data industry, change role or simply learn cutting-edge technologies by enrolling in Data Engineering Nanodegree In 2023.May 13, 202110May 13, 202110
PinnedAntonello BenedettoinTowards Data Science10 Algorithms To Solve Before your Python Coding InterviewIn this article I present and share the solution for a number of basic algorithms that recurrently appear in MAANG interviews in 2023.Jul 30, 202026Jul 30, 202026
Antonello BenedettoinLevel Up CodingDelete Thousands of S3 Objects Safely with Boto3 And PaginatorsIntroductionJun 17Jun 17
Antonello BenedettoinTowards Data ScienceHow to Automate PySpark Pipelines on AWS EMR With AirflowOptimising big data workflows orchestration.Aug 23, 20239Aug 23, 20239
Antonello BenedettoinTowards Data ScienceBoto3 vs AWS Wrangler: Simplifying S3 Operations with PythonA comparative analysis for AWS S3 developmentJun 20, 20232Jun 20, 20232
Antonello BenedettoinTowards Data Science4 Ways to Write Data To Parquet With Python: A ComparisonLearn How To Efficiently Write Data To Parquet Format Using Pandas, FastParquet, PyArrow or PySpark.Mar 13, 20233Mar 13, 20233
Antonello BenedettoinTowards Data ScienceDockerizing Apache Zeppelin and Apache Spark for Easy DeploymentLearn How To Build a Portable and Scalable Data Analysis Environment with Docker-Compose And Volumes.Jan 24, 20235Jan 24, 20235
Antonello BenedettoinTowards Data Science3 Ways To Aggregate Data In PySparkPySpark Basic Aggregations Explained With Coding Examples.Dec 13, 20221Dec 13, 20221