Activity
Mon
Wed
Fri
Sun
Jun
Jul
Aug
Sep
Oct
Nov
Dec
Jan
Feb
Mar
Apr
May
What is this?
Less
More

Memberships

Modern Data Community

Public ā€¢ 573 ā€¢ Free

4 contributions to Modern Data Community
What is the value of extracting data into a cheap file store?
I've often seen it recommended to extract data out of a data source into a parquet file into a cheap file storage like Azure Blob or s3 buckets... what is the value of adding this step when all I'm doing is copying it on into a SQL Database?
2
6
New comment Feb 19
1 like ā€¢ Feb 13
@Emile Van Der Heyde thanks for your reply. It sounds like the main reason is to create more resilient pipelines. In fact, a problem I've often experienced is when the whole pipeline fails on an overnight schedule, we often had to ingest from the source again, which often takes time, and longer, because the operational systems are by them in use. It makes sense then that we can reload from the files in the bucket/Blob instead, without requiring a fresh copy which could be subject to data change. We probably should also consider how much of it to keep and will need to look into how to move it around in the bucket etc. I've also thought around whether having some validation rules and quarantine would be useful in this area rather than in SQL.
What do we call ourselves
Often, in small teams, we find ourselves handling a wide spectrum of tasks related to data, including Data Analytics, Engineering, and Machine Learning. I believe many of us are in that versatile position in our careers. I joined as a Product Analyst and am currently the entire Data team at my company šŸ˜„. This situation has led me to wonder about what title makes more sense when introducing myself. Is anyone else facing a similar dilemma? I've come across titles like 'Data Specialist,' 'Data Analytics Engineer,' and 'Data Generalist.' Are there any other titles that convey a more nuanced or meaningful description of roles similar to these?
4
9
New comment Feb 7
2 likes ā€¢ Feb 7
I've had various data job titles so far; Data Analyst, Insight Analyst, Business Analytics Engineer, Business Intelligence Developer and now a Business Intelligence Engineer. Pretty much all of those have been the same tasks of work, with maybe slightly different focus given the tech and demand.
[Start Here] Welcome to The Modern Data Community!
Hello! Welcome to The Modern Data Community. The goal of this community is to help Data Engineers on small (or solo) teams confidently build modern architectures by simplifying key concepts, clarifying common strategies & learning from others. Pumped to have you here! ==================== HOW IT WORKS ==================== By joining, you get instant access to tons of free content (see Classroom). Dive right in. But even more can be unlocked by contributing to the Community, which I encourage you to do. It works like this: Contribute (post/comment) >> Get points (likes) >> Reach new levels >> Unlock content ==================== 6 SIMPLE GUIDELINES ==================== āŒ Do not post error messages looking for others to debug your code. That's why Stack Overflow and other tool-specific Slack channels exist. āŒ Do not use this community for self-promotion (unless Admin approved). We all know it when we see it. āŒ Do not create low-quality posts with poor grammar/spelling or that provide little (or no) value. These will be deleted. You can do better! āœ… Ask questions, share your experiences & overall be a good person. This not only helps everyone get better, but can help you unlock bonus content faster. Win-Win. āœ… Speaking of wins, share yours! Whether it's finally solving a complex problem, hitting a team milestone or starting a new gig - post about it. You'll get the props you deserve and it just might inspire somebody else. āœ… Take the time to craft thoughtful posts & always proof-read before hitting submit. We're all about quality here. High quality posts --> more engagement (aka you'll climb the leaderboard & unlock content) --> ensures the community stays enjoyable for everyone. ==================== QUICK LINKS ==================== Here are a few links to help you get going: - Classroom - What's Your Data Stack? - Leaderboard - Work with me (Kahan Data Solutions)
31
123
New comment 20d ago
1 like ā€¢ Feb 6
Hello, I'm Adam. I've been working in data roles for 5-6 years, and a 10 year IT background before that. I currently work as a divisional tech lead where I am focused on planning out a roadmap for the data architecture and how to get there, as well as implementing that, growing the team and supporting the business as usual.
0 likes ā€¢ Feb 7
@Michael Kahan thank you!
What's your Data Stack?
It's one thing to read articles or watch videos about perfectly crafted data architectures and think you're way behind. But back here in reality, things get messy & nothing is ever perfect or 100% done. Most of us are usually working on architectures that are: - Old & outdated - Hacked together - Mid-migration to new tools - Non-existent Or perhaps you're one of the lucky ones that recently started from scratch and things are running smoothly. Regardless, the best way to learn what's working (and not working) is from others. I believe this could be one of the best insights this community can collectively offer each other. So let's hear it. What does your data stack look like for the following components? 1. Database/Storage 2. Ingestion 3. Transformation 4. Version Control 5. Automation Feel free to add other items as well outside of these 5, but we can focus on these to keep it organized.
8
65
New comment 17d ago
2 likes ā€¢ Feb 6
I work predominantly with a Microsoft Stack: Orchestration: Azure Data Factory Data Storage: Azure Blob & Azure SQL Database Reporting: Excel and Power BI
1-4 of 4
Adam Smith
2
12points to level up
@adam-smith-1422
Business Intelligence Engineer

Active 84d ago
Joined Feb 6, 2024
powered by