Activity
Mon
Wed
Fri
Sun
Nov
Dec
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
What is this?
Less
More

Memberships

Data Innovators Exchange

Public • 193 • Free

14 contributions to Data Innovators Exchange
Data Innovator Community Meetups
To all Data Innovators, Where do you want to see the next Data innovator events? Due to the huge success of the community meetups in London and Munich we want to keep them coming. In these events we got very good feedback on the focus of the workshops, so we are going to keep the focus on them and stay with the 1 day format with: - Success story - 2 Workshops - Panel discussion The team currently is brainstorming locations and would love to hear your thoughts on this 🙂 So please let us know in the comments where you want a Data Innovators Event to take place and we will give our best to make it happen 🙂 Thank you for your input!
4
9
New comment 4d ago
1 like • 10d
@Martin Šafránek If that was in November, this would be a wine and a goose🤷‍♂️
1 like • 6d
@Paul Barlow Yes!
Data Modeling in the Modern Data Stack
Data modeling is crucial due to the increased number of data sources, diverse data consumers, and the need for optimization (both speed and cost). Mike Kahan briefly showed different modeling approaches, e.g., Star Schema and Data Vault 2.0. At the end, he delves into hybrid approaches. Have you used hybrid approaches before? If so, which modeling approaches did you mix together and how?
2
1
New comment 24d ago
Data Modeling in the Modern Data Stack
2 likes • 24d
Interesting one. Many years ago I was, as a data engineer, on a project where the intention was to create EDW with core in 3NF model and data marts in Star - dimensional model. Basically an Inmon's idea of a data warehouse. We ended up with: 1. having biggest transactional table in core as a Fact and a Dimension, rest in 3NF'ish 2. Data marts with Star'ish model, but some dimension referenced each other directly (no via fact table) This was far from perfect modeling (and not only modeling), but business was extremely happy having EDW delivery value, so in general it was a great success. :)
Recommendations Needed: Best Resources for Learning Data Governance Strategies
Hi everyone! I'm looking for recommendations on where to learn more about Data Governance, especially from a strategic and requirements perspective, rather than just technical-focused content. Any good courses, training programs, or resources that dive deeper into governance frameworks and best practices? I’d appreciate your insights!
3
6
New comment 26d ago
0 likes • 27d
For all data management aspects, where data governance is part of, I would think of www.dama.org
0 likes • 26d
@Lorenz Kindling Not personally, but I know that they are quite active as an organization. I own the book DAMA-DMBOK. It is really set of principles for data management (like TOGAF for architecture, or ITIL for IT Management) - so quite boring read :) AI Assistant would give you probably something shorter and equally good - to start with. 🤫
Is hashing good enough for anonymising data?
https://www.rnz.co.nz/news/business/527419/inland-revenue-giving-thousands-of-taxpayers-details-to-social-media-platforms-for-ad-campaigns?fbclid=IwY2xjawFLSwJleHRuA2FlbQIxMAABHfiQoZd2lKNuLPKWDo5IrGSrtYtTKNwWBrS0kfJBtccVTTWP9FPKrjY3zg_aem_jp9YxiznNYVeVO5Oo9wqfA
5
7
New comment 26d ago
Is hashing good enough for anonymising data?
1 like • Sep 9
Seems right. What are alternatives?
0 likes • 26d
@Shane Gibson this is an operational solution. I what would be the technical better solution, something like “salted hashing”
Tools Data Engineers need to know in 2024
There are so many tools out there, but I found this video on YouTube. It did a good job of breaking down the essentials. Here's a quick list of the tools mentioned: - Basics: SQL, Python, Linux, bash scripting, network understanding - Technical Basics: Git, SFTP, PGP - Databases: PostgreSQL, MySQL, MongoDB - Data Platforms: Snowflake, Databricks, BigQuery, Redshift, Azure Synapse Analytics - Orchestration, ETL & Data Pipelines: Airflow, Dagster, SSAS, Azure Data Factory - Cloud: AWS, Azure, GCP - Others: Docker, Kubernetes, Terraform If you had to give advice on where to start, what would that be? What are your favored tools?
6
5
New comment 27d ago
Tools Data Engineers need to know in 2024
3 likes • 29d
I would add to this “data storage formats”: table, json, parquet, delta, iceberg
2 likes • 27d
@Lorenz Kindling Correct. I think that for each person that works with data, on each level of data stream, it is important to be at least familiar with basic modeling terminology and concepts.
1-10 of 14
Jaroslaw Syrokosz
3
9points to level up
@jaroslaw-syrokosz-8262
⭐️Analytics & Modeling Engineer | Value-Driven Data Consultant ⭐️Certified in: DataVault 2.0, Snowflake, DBT, Azure, Airflow, Teradata, MicroStrategy

Active 16h ago
Joined Jul 28, 2024
EU
powered by