data-engineering

Quickstart: Apache Spark on Kubernetes

Using Apache Spark Operator in Kubernetes to streamline your Big Data workflows with a cloud-native approach without relying on a Hadoop cluster.

ReclameAQUI Data Lake

Containerized Data Lake running on GCP, using Kubernetes (GKE) to orchestrate Apache ecosystem components, with GCS for data storage and BigQuery as the analytical interface. Governance and security fully implemented using existing Google Suite groups and users through LDAP, giving stakeholders full autonomy to consume data from the Lake (with auditing).

Dotz Data Labs

Serverless and cloud-managed Big Data architecture using Google's Cloud Platform (GCP) to support a 360-degree view of customers and partners of Dotz, one of the largest companies in the field of loyalty program in Brazil

Easynvest Data Platform

Hybrid-cloud Data Lake with most of its capabilities running in AWS. Among the main objectives we had the automation of credit analysis, targeted campaigns to investors according to profile and intelligent detection of money laundry