/r/bigdata
For all bigdata gurus everywhere from hedgefunds (quant finance) to biotech (drug discovery) to social media (twitter) to discuss the latest trends, topics, career opportunities and tricks of the trade!
Rules: No advertising, don't blatantly link to your own product(s). Posts must be relevant to big data technologies or discussions.
Related subreddits:
/r/bigdata
hey , does anyone knows resources for big data course or anyone that explains the course in detail? (especially Cambridge slides) i’m lost
I thought this would be interesting to the audience here.
Uber is well known for its scale in the industry.
Here are the latest numbers I compiled from a plethora of official sources:
They leverage a Lambda Architecture that separates it into two stacks - a real time infrastructure and batch infrastructure.
Presto is then used to bridge the gap between both, allowing users to write SQL to query and join data across all stores, as well as even create and deploy jobs to production!
A lot of thought has been put behind this data infrastructure, particularly driven by their complex requirements which grow in opposite directions:
I have covered more about Uber's infra, including use cases for each technology, in my 2-minute-read newsletter where I concisely write interesting Big Data content.
I don’t think so. Instead, it’s here to free data scientist and ML engineers 𝗳𝗿𝗼𝗺 𝘁𝗲𝗱𝗶𝗼𝘂𝘀, 𝗿𝗲𝗽𝗲𝘁𝗶𝘁𝗶𝘃𝗲 𝘁𝗮𝘀𝗸𝘀—so you can focus on higher-value work like 𝗯𝘂𝗶𝗹𝗱𝗶𝗻𝗴 𝗯𝗲𝘁𝘁𝗲𝗿 𝗺𝗼𝗱𝗲𝗹𝘀, 𝘂𝗻𝗰𝗼𝘃𝗲𝗿𝗶𝗻𝗴 𝗶𝗻𝘀𝗶𝗴𝗵𝘁𝘀 𝗳𝗿𝗼𝗺 𝘂𝗻𝘀𝘁𝗿𝘂𝗰𝘁𝘂𝗿𝗲𝗱 𝗱𝗮𝘁𝗮 𝗳𝗮𝘀𝘁𝗲𝗿, 𝗮𝗻𝗱 𝗱𝗿𝗶𝘃𝗶𝗻𝗴 𝗺𝗼𝗿𝗲 𝗶𝗺𝗽𝗮𝗰𝘁 𝗳𝗼𝗿 𝘆𝗼𝘂𝗿 𝗼𝗿𝗴 𝗮𝗻𝗱 𝗰𝘂𝘀𝘁𝗼𝗺𝗲𝗿𝘀.
Check out this Medium article on how Google, Teradata, and Gemini are transforming enterprise data workflows and insights with Generative AI:
Would love to hear your thoughts—𝗵𝗼𝘄 𝗱𝗼 𝘆𝗼𝘂 𝘀𝗲𝗲 𝗚𝗲𝗻𝗔𝗜 𝘀𝗵𝗮𝗽𝗶𝗻𝗴 𝘁𝗵𝗲 𝗳𝘂𝘁𝘂𝗿𝗲 𝗼𝗳 𝗱𝗮𝘁𝗮 𝘀𝗰𝗶𝗲𝗻𝗰𝗲 𝗮𝗻𝗱 𝗠𝗟? 👇
The data science domain is huge and if you want to make a career in data science, then you need to be aware of the various components that make up this widely used technology including data, programming languages, machine learning, and more.
As I've described. I'm looking to see what would be the best certification for entry into big data field. I'm currently working as IT Auditor and hope to use that as a stepping stone.
Take your organization from data exploring to #data transformed with this comprehensive guide to data maturity. Discover the four key elements that determine data maturity and how to develop a data-driven culture within your organization. Start your journey to #datatransformation with this insightful guide. Become USDSI® Certified to lead your team in creating a data-driven culture.
Hi everyone,
I graduated in 2022 and currently have 2.5 years of experience in the big data domain. Most of my work involves developing complex Spark-Scala-based procedures and functions tailored to client requirements. I also have some experience with Bash scripting to create reconciliation scripts, as we primarily store data in Hive databases.
The tools and technologies I am proficient in include:
Apache Spark,Kafka,Hadoop,Hive,HBase Scala programming,MS SQL,Bitbucket ,IntelliJ,Git,Python
Although my team also works on Power BI report generation, I haven't had direct exposure to it yet.
I enjoy working in this domain and am eager to expand my knowledge for better career opportunities and growth. Which additional tools or technologies should I learn, or in which of my current skills should I deepen my expertise, to advance my career in big data?
This week, RWA Inc. dropped some incredible updates! The platform, which makes investment opportunities more accessible by tokenizing real-world assets, is bridging the gap between traditional finance and decentralized technology. And the Launchpad platform is at the heart of it all. Launchpad simplifies the process of launching new projects, raising capital, and tokenization, making it way easier for both entrepreneurs and investors.
RWAI, short for Research, Reporting, and Launch AI Agent, is an AI tool developed by RWA Inc. Its main goal? To make the research, reporting, and launch processes for projects faster and easier. In short, it’s a helpful companion for both project creators and investors. Here's what RWAI brings to the table:
RWAI’s roadmap includes some standout features:
RWAI truly aims to provide a practical and seamless experience for its users.
Staking $RWA tokens on the RWA Inc. platform offers users a range of perks that go beyond just earning rewards. Here’s what you get:
Staking is more than just passive income—it’s your gateway to investment opportunities and active participation in the ecosystem.
DAO Labs hosted its first-ever ILO (Initial Labor Offering) for RWA Inc. on its platform, and it was a massive success! As social miners, we had a front-row seat to witness this milestone. This launch clearly showcased DAO Labs' community-focused vision.
Through this process, we saw just how impactful community-driven projects can be. DAO Labs has set a strong example for future project launches and has become a solid reference point for the community.
Hey all!
We’ve launched a Substack called Big Data Performance, where we’re publishing weekly posts on all things big data and performance.
The idea is to share practical tips, and not just fluff.
This is a community-driven effort by a few of us passionate about big data. If that sounds interesting, check it out and consider subscribing:If you work with Spark or other big data tools, this might be right up your alley.
So far, we’ve covered:
This is a community-driven effort by a few of us passionate about big data. If that sounds interesting, check it out and consider subscribing:
👉 Big Data Performance Substack
We’d love to hear your feedback or ideas for topics to cover next.
Cheers!
The Rise of the AI Data Scientist! AI Data Scientists are leading the way in transforming raw data into powerful insights. Their expertise in both AI and data science is creating ground breaking solutions across industries. Ready to become part of this exciting evolution?
Hi data nerds,
Here are the latest updates from Rollstack—a platform designed to connect your favorite BI tools (Power BI, Tableau, Looker, Metabase, and Google Sheets) to your presentation software for automatic report generation. If you’re juggling QBRs, client reports, or departmental updates, you might find something here that simplifies your routine.
January 2025 Updates
Power BI Integration Has Arrived
By rolling out this feature, PBI teams now gain access to the same AI-driven reporting that Tableau, Looker, and Metabase have offered since our early days—cutting tens of thousands of hours from report generation. Want to explore further or book a demo?
Learn more and schedule a demo: Power BI integration!
AI Insights Are Now Open to All
Rollstack AI is now open to everyone, giving business professionals an advanced way to generate customized, relevant insights within their presentations and documents. Teams can edit slide commentaries, titles, and more—while preserving the deck’s structure—so stakeholders can reach decisions faster and with greater clarity.
Learn more: AI insights and native charts
Native Charts for PowerPoint and Google Slides
If you’d rather use PowerPoint or Google Slides charts instead of those in your BI tool, you can now convert them into fully editable versions in your presentation software. They include an accompanying spreadsheet, letting you take a closer look at the source data whenever you need.
Check it out: AI insights and native charts article
Thanks for reading, and we hope you’ll explore these new Rollstack features. If you have any questions, let us know in the comments. Your feedback genuinely helps us shape what comes next!
—Team Rollstack
Hey everyone!
I’m working with a 22GB PostgreSQL database (Bitnami/PostgreSQL:16.2.0) and need to generate quick reports, such as linking patients to specific types of consultations.
I’m looking for an open-source tool, preferably Docker-ready, that allows me to:
I need something easy to use, especially for someone comfortable writing SQL queries in PostgreSQL. What’s new in the market that’s simple yet powerful?
Thanks a lot! 🙌