Welcome to our blog

We are a group of consultants who are passionate about data technologies and coding. Our team covers a wide range of expertise in Big Data, ranging from Ab Initio and open source data engineering technology, to machine learning and other data science tools. Sharing knowledge is a core practice for our team. Therefore, we would like to use this blog to post technical findings and practical use cases related to our past and current projects. In addition, we will keep you updated about events, insights and conferences.

04/10/2019

Making Life Easier with Databricks

As a consultant using PySpark on the daily basis, I always ask any Data Scientists or Data Engineers if they have ever tried using it. The usual response: […]
Read more
02/10/2019

Car Classification with Deep Convolutional Neural Networks on Databricks

Convolutional Neural Networks (CNN) are state-of-the-art Neural Network architectures which are primarily used for computer vision tasks. CNN can be applied to a number of different […]
Read more
26/04/2019

Setup Pyspark on Windows

1. Install Anaconda You should begin by installing Anaconda, which can be found here (select OS from the top): https://www.anaconda.com/distribution/#download-section For this How to Anaconda 2019.03 […]
Read more
03/04/2019

Cloudera bootcamp in Amsterdam

Cloudera bootcamp in Amsterdam It all started when our boss Pasquale introduced us to the Cloudera Services Enablement Bootcamp in December of last year: a 1 […]
Read more
14/02/2018

Types of data and their importance

Introduction Nowadays, when you start talking about data, you will frequently come across two main terms; structured and un-structured. Many times, also the term semi-structured, and […]
Read more
12/01/2018

Chicken & Egg – data or question?

Introduction Let me start with some basic questions in order to find a good entry in this topic: Q: Why do we start talking about data? […]
Read more