Search
Results
apache/arrow-datafusion: Apache Arrow DataFusion and Ballista query engines
Running Awk in parallel to process 256M records
[https://ketancmaheshwari.github.io/posts/2020/05/24/SMC18-Data-Challenge-4.html] - - public:aguynamedryan
TXR Language
What's new in Kiba ETL v3 (visually explained)
thbar/kiba: Data processing & ETL framework for Ruby
The Rise and Fall of the OLAP Cube
Starting out with data puddles, then we’ll think about data lakes
[https://medium.com/comic-relief/starting-out-with-data-puddles-then-well-think-about-data-lakes-f103111946db] - - public:aguynamedryan
Doing data right is time-consuming and hard! There you go the secret is out. But can we make it easier? Surely that is just part of engineering 101 and we should just accept it, right? The issue for…
GNU Recutils
Building Serverless Data Pipelines on Amazon Redshift By Writing SQL with Datacoral | Amazon Web Services
[https://aws.amazon.com/blogs/apn/building-serverless-data-pipelines-on-amazon-redshift-by-writing-sql-with-datacoral/] - - public:aguynamedryan
Pentaho, Talend, and Jitterbit Comparison
[http://it.toolbox.com/blogs/open-etl-tools/pentaho-talend-and-jitterbit-comparison-31925] - - public:Yabalicious