Join the data industry, change role or simply learn cutting-edge technologies by enrolling in a Data Engineering Nano-Degree.

Photo by Marvin Meyer on Unsplash

A note for my readers: This post includes affiliate links for which I may earn a small commission at no extra cost to you, should you make a purchase.

**USE CODE UPSKILL21 FOR A 50% DISCOUNT ON UDACITY COURSES**

You Are Not What You Studied At Uni

As controversial as it may sound, you are not what you studied at university and as you progress with your career, companies will tend to focus less on the classes you attended in your early twenties and more on the skills you acquired along the way and that your will bring to the table once hired.

You are not what you…


Programming | Interviewing | Office Hours

In this article I present and share the solution for a number of basic algorithms that recurrently appear in FAANG interviews.

Photo by Headway on Unsplash

Update: Many of you contacted me asking for valuable resources to nail Python coding interviews. Below I share 4 courses/platforms that I strongly recommend to keep exercising after practicing the algorithms in this post:


FORECASTING | PYTHON | OFFICE HOURS

Learn how to add value to your business by forecasting future performance with Prophet

Photo by Mitchell Luo on Unsplash

Update: Many of you contacted me asking for valuable resources to learn more about time series forecasting with Python. Below I share 2 courses that I personally took and that would strongly recommend to expand your knowledge on the topic:

Hope you’ll find them useful too! Now enjoy the article :D

Introduction

In the first part of this tutorial (Forecasting Business KPIs With Python Using Prophet-Part I), we built a simple predictive model using Facebook’s Prophet library


Forecasting | Python | Office Hours

Learn how to add value to your business by forecasting future performance with Prophet.

Photo by Frank Busch on Unsplash

Update: Many of you contacted me asking for valuable resources to learn more about time series forecasting with Python. Below I share 2 courses that I personally took and that would strongly recommend to expand your knowledge on the topic:

**USE CODE JULY75 FOR A 75% DISCOUNT ON UDACITY COURSES**

Hope you’ll find them useful too! Now enjoy the article :D

Introduction

Picture this: you are a data analyst or business intelligence analyst with experience building KPIs…


Learn how to query a PostgreSQL DB with Psycopg2 or SQLAlchemy, when a SSH encryption is required.

Photo by John Schnobrich on Unsplash

Do you wish to become a Data Engineer and advance your career in 2021? Have a look at Udacity’s on-demand Data Engineering Nanodegree and take advantage of the following special discount code for August 2021:

*USE CODE UPSKILL21 FOR A 50% DISCOUNT ON UDACITY COURSES**

Introduction

Python offers at least two straightforward ways to interact with a PostgreSQL database:

  • creating a connection using pycopg2 package
  • generating and engine through the sqlalchemy package.

As long as you have valid credentials, you can establish a connection with few lines of code. …


Let me share with you 200+ realistic, high quality questions to become a certified Spark 3.0 Developer

Photo by Dzenina Lukac from Pexels

Introduction

Preparing for a professional certification can be stressful. Because of your full time job and your other commitments, you probably have little time left to focus and need to decide what’s the material that is worth studying.

The Databricks Associate Apache Spark Developer Certification is no exception, as if you are planning to seat the exam, you probably noticed that on their website Databricks:

  • recommends at least 2 official books;
  • offers a number of (expensive) on-demand remote or F2F courses;
  • avoids to share mock MCQs test to assess the difficulty of the exam.

So you may be wondering:

“What should…


In this tutorial you will learn how to generate a live data stream in JSON format, store files in an AWS S3 bucket and aggregate data on the fly with Python (Spark)

Photo by Evgeny Tchebotarev on Pexels

The Problem: Generating Streaming Datasets

If you are familiar with PySpark and with its Structured Streaming API you know how easy it is to express your streaming job as standard batch job, with the difference that a data stream can be treated as a table that is being continuously appended.

Despite writing a stream processing model in straightforward, finding streaming data sources, could be a challenging task, particularly while testing your application before deploying it or setting your very first streaming job while learning PySpark.

To make your life at work easier and your learning process quicker, in this tutorial I will show you how…


Why You Don’t Need To Know Java Or Scala To Learn Data Stream Processing right now

Photo by Sigmund on Unsplash

*USE CODE UPSKILL21 FOR A 50% DISCOUNT ON UDACITY COURSES**

A Modern Approach To Data Engineering

A stream of data can be defined as an uninterrupted flow of data generated by multiple type of sources. As modern companies increasingly rely on applications that produce and process data in real-time, data streaming is an increasingly in-demand skill for data engineers.

By using a stream processing framework ( like Kafka Streaming, Apache Streaming, Apache Faust, AWS Kinesis), stream of events can be processed, stored, and analyzed as they are generated in real-time.

A natural question that arises is:

“Have you and your Data Engineering team already shifted toward…


In this article I share real MCQs, as well as my 3 top tips to get ready for the exam.

Certification Badge By credential.net

A note for my readers: This post includes affiliate links for which I may earn a small commission at no extra cost to you, should you make a purchase.

A Lack Of Resources To Practice?

If you are in the process of studying for the Databricks Associate Developer for Apache Spark 3.0 certification you are probably facing the same problem I faced a few weeks ago: a lack of mock tests to assess your readiness.

By now, you should know that the exam consists of 60 MCQs and that you will be given 120 mins to answer correctly to at least 42 of them (70%).

Another…


That SQL interview is approaching and you are looking for some challenging exercises to test your readiness? Your are in the right place

Photo by Tristan Gassert on Unsplash

Has Your Interview Countdown Started?

Congratulation! If you are reading this article you have probably already passed the first screening interview and have been invited to the next step: a SQL coding round.

Whether your next interview is due in 24 hours or in 2 weeks time, one thing is for sure: while you are doing your best to revise the most common SQL topics by solving realistic exercises, the clock is ticking.

In order to help you testing your knowledge and conscious of the popularity of SQL Window Functions, below I shared a few challenging exercises on ranking by re-creating the sort of interaction…

AnBento

Snr BI Engineer @Wise | 🏆 Among Top Writers In Data Engineering 💻 Follow & Contact Me 🤝 https://www.linkedin.com/in/anbento4

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store