Latest
Mar
28
Books for next level skills
Algorithms and Structures Massive Datasets
Author: Dzejla Medjedovic , Emin Tahirovic
beautiful pictures and inspiring reading!
Designing Cloud Data Platforms
Author:
Dec
22
Minio S3 small setup 40 Gb/sec read using PCI 5.0.
Small S3 minio setup on prem gives nearly 40Gigabyte/Sec sustained read. Roughly 20Gigabyte/Sec left on the databus . Earlier
1 min read
May
24
50 shades of Iceberg CDC
There is a plethora of ways to transfer data from common relational SQL databases (source) to Apache Iceberg (destination). This
3 min read
May
08
Apache Knox First encounter
This post setups up an standalone Apache Knox 1.3. I do this to better understand and improve customer'
4 min read
May
07
Minio S3 small setup -22.8 Gb/sec read.
Small S3 minio setup for benchmarking Iceberg/Delta on prem with S3.
This is the non clustred minio setup gives
2 min read
May
07
Linux Software raid - 3x 980 pro
I Combined 3 x nvme Samsung 980 pro on my Workstation based on ASUS Pro WS WRX80E-SAGE SE. The result
2 min read
Oct
06
Rclone
Robust software to distribute files between A and B. Distribute operations are move/copy/sync and A,B is roughly
2 min read
Sep
24
Artificial Neural Networks: The biological model
The first computational model of an artificial neuron was proposed by McCulloch & Pitts all the way back in 1943,
3 min read
Sep
04
Fixed column to parquet
The original solution parsing fixed column size files to parquet using spark v2.3.2/jdk8 was suspiciously slow. Trying
1 min read
Aug
29
Kafka Delta Ingest
A first look at an Delta sink meant to transfer data from Kafka to Delta. This software is implemented in
1 min read