Today, privacy campaigners have been circulating tweets about Brittany Kaiser’s document dump proving that Cambridge Analytics and AIG were working as a single entity. This would seem somewhat contrary to what the police and the ICO have been able to find. There’s more to find here and when I find it, I’ll post the stories…
Read moreBig Data
I am once again trying to write my blog on solutions architecture and the GDPR. I looked up “Data Lake” again and came across some very good resources in a You Tube channel from Intricity. This summarises the design bifurcation between distributed data sources and unified query logic. It’s five years old. He or his…
Read moreApache Flume
Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. It uses a simple extensible data model that…
Read more