/r/visualization
For topics related to information visualization and the design of graphs, charts, maps, etc.
For topics related to information visualization and the design of graphs, charts, maps, etc.
Post guides, tutorials, and discussion threads about information visualization.
We also welcome posts including visualization works-in-progress and requests for critiques.
Be polite and constructive when posting in this subreddit. Posts and comments that are rude, harassing, sexist, racist, etc. will be removed and may result in a ban.
While posts linking to finished information visualizations are allowed, we encourage sharing visualizations only when they will lead to discussion about the design and construction of the visualization.
See the Related Subreddits section below for more appropriate places to share finished work.
Do NOT post sales, memes, cute pictures, jokes, etc. Repeated offenses of this rule will result in a ban.
Please report any submissions or comments violating these rules using the report button.
If you want to post something related to information visualization but it doesn't fit the criteria above, consider posting to one of the following subreddits.
DataIsBeautiful: Share data visualizations
MapPorn: Share maps, map visualizations, etc.
Infographics: Share infographics and other unautomated diagrams
WordCloud: Specifically for sharing word clouds
DataVizRequests: Request a visualization to be made
Tableau: Share and discuss visualizations made with Tableau software
DataSets: Request and share data sets
SampleSize: Conduct and share surveys
DataIsUgly: Share poorly designed information visualizations
FunnyCharts: Share funny graphs and charts
MathPics: Share pictures and visualizations of mathematical concepts
RedactedCharts: Try to guess what a chart is about without the labels
Statistics: For all questions and articles related to statistics
/r/visualization
Hi there,
I would like to visualize actual business process of large blue chips and build a software that allow everyone to become an insider of on-going routines of a big corporation.
I would like to squizze the syrop from the oranges (daily meetings, chats across team members, on-going activities, emails correspondence, documentation, , blue print, decision making in reference to architecture , software , strategy, team , clients, etc ) , remove any identification, add extra visualization and offer this interactive software for educational institutions like colleges and universities. Students can benefit from a real time environment that allows them to get a grasp of what happens within a blue chip, why this and that decision is accepted or rejected, how the tools , people and processes are interrelated and integrated, what kind of issues pop up and how they are resolved , etc. This provides practical knowledge that can be useful for faster employment .
Example :
01. daily activities of FICO reporting and consolidation department of a big pharma (the issues experienced, how they are resolved , teams dynamics over years, what kind of proposals they raise up and acceptance/rejection rate )
02. daily activities of supply chain architect of a big pharma (the issues experienced, how they are resolved , teams dynamics over years, what kind of proposals they raise up and acceptance/rejection rate )
The second step is to make a software that visualize on-going activities across different departments in blue chips. This product can be offered to big corporations upon import of their source file of different origin.
I appreciate your critics and any other feedback.
Thanks
Hi, I'm 26 year old and have 2 year gap as I quit my job in December 2022- BPO job started as customer service representative and got promoted to senior quality analyst by the time I quit with 3.6 years of experience. Now I've wasted 2 years of my life with nothing to show for the gap except 1 or 2 certificate courses. I'm learning data analytics to become a data analyst. How should I justify the gap and what should I do to get the job asap?
In the age of digital transformation, voice assistants have become ubiquitous in daily life. However, the way we visualize their usage and interaction patterns can significantly influence the interpretation and decision-making process. This paper aims to explore the challenges in visualizing voice assistant usage data effectively. Using an example from UNESCO’s report, we critically analyze a flawed visualization that misrepresents the frequency of voice assistant use. By identifying these flaws, we propose better approaches for displaying such data to ensure clarity and accuracy. Ultimately, the goal is to enhance the understanding of user engagement with voice assistants, providing actionable insights for both developers and policymakers.
Every story begins with a problem, and this one is no different.The visualization we selected comes from UNESCO’s report titled "I'd blush if I could: closing gender divides in digital skills through education". It displays the frequency of use for voice assistant features based on data from Voicebot.ai’s 2018 Smart Speaker Use Case Survey. At first glance, the chart seems informative, but upon closer inspection, it reveals critical flaws that undermine its credibility.
Figure 1.1 - Frequency of Use for Voice Assistant from UNESCO
The most obvious flaw lies in the stacked bar chart design, where categories like Use daily, Use monthly, and Tried at least once overlap, rather than being mutually exclusive. This results in some rows exceeding 100% when summed up—completely violating the basic principle of stacked bar charts. This leaves room for misinterpretation, leading readers to false conclusions. To make matters worse, the legend labels for “Use daily” and “Tried at least once” are swapped, a blunder that’s hard to overlook, especially in such a high-profile report. When such errors appear in an authoritative document, it shatters trust and highlights the need for rigorous quality control.
Figure 1.2 - 2018 Smart Speaker Use Case Frequency from Voicebot.ai
This visualization is important because it addresses the growing role of voice assistants in everyday life, shaping public perceptions and policy decisions. Flawed design and inaccurate labeling not only mislead readers but also undermine the very message the report seeks to convey. By fixing these issues, we can restore the chart’s integrity and ensure it delivers its critical insights effectively.
This chart was intended for a UNESCO report, a context where clarity and accessibility are paramount. The audience likely includes educators, policymakers, and researchers with varying levels of expertise in data interpretation. A visualization with design flaws not only risks misinforming readers but also diminishes the credibility of the report's conclusions.
This chart provides a detailed display of the frequency distribution of smart speakers in different usage scenarios as of January 2018. These usage scenarios cover a wide range from basic functions such as "ask a question" to advanced functions such as "control smart home devices". The data is presented in three levels of classification, namely "Use daily" (dark purple), "Use monthly" (orange), and "Tried at least once" (light purple), clearly showing the proportion of each usage scenario in different frequency categories. Through this classification method, the chart not only reveals users' overall usage preferences for smart speaker functions, but also enables in-depth analysis of the proportion of active users and potential usage trends for different functions. However, the story is obscured by several design flaws that detract from the data's clarity and accessibility.
To understand the chart, the reader must:
However, the lack of labeling for the x-axis (percentage) leaves the reader guessing whether the values represent frequency rates or proportions of users. Moreover, the reliance on a legend forces the audience to repeatedly cross-reference colors with categories, further complicating the reading process.
Procedural Knowledge: Procedural knowledge refers to the "how-to" aspect of cognition, where tasks are performed based on learned rules or processes. In the context of visualizations, readers rely on procedural knowledge to interpret common chart types correctly. For example, stacked bar charts are expected to display mutually exclusive categories that add up to 100%.
Data-Ink Ratio (Edward Tufte, 1984)
Tufte emphasizes minimizing non-essential visual elements (non-data ink) in charts. A high data-ink ratio ensures that visualizations focus the audience’s attention on the data itself rather than decorative or redundant elements.
This visualization violates several cognitive principles, which undermines its usability and interpretability:
Germane cognitive load refers to the mental effort required to integrate new information into existing knowledge structures.
Extraneous cognitive load arises from poorly designed visualizations that demand unnecessary mental effort to process irrelevant details.
Cognitive tunneling occurs when a user focuses narrowly on one aspect of a visualization, neglecting other relevant information.
Replication was the first step in our journey. Using matplotlib, we recreated the original graph as faithfully as possible.
Figure 3.1 - Replicate the graph using matplotlib
The original graph we selected for improvement was marred by complexity and ambiguity, impeding its ability to convey the intended message. Our primary objective was to transform this visualization into a clear, insightful narrative that adheres to the principles of effective data representation.
The first step in this transformation involved selecting the most appropriate chart type. We then tackled the issue of inadequate labeling and annotations to ensure clarity. Increasing the data-ink ratio was another crucial change, enhancing focus on the core data. To address cognitive tunneling, we implemented an interactive chart using the Pyecharts library. Finally, the iterative process of revision and feedback played a vital role in refining our visualization.
In our journey to transform a complex visualization into a clear and insightful narrative, the first pivotal step was selecting the most suitable chart type. The original visualization, a non-standard stacked bar chart, obscured the data's true story by merging disparate elements into a single, confusing image. This not only hindered comprehension but also risked misinterpretation by the audience.
Breaking Down the Complexity:
Choosing Grouped Bar Charts:
Clarity is paramount. A graph must serve as a transparent window into the data it represents, allowing viewers to grasp insights effortlessly. However, the original visualization we encountered was shrouded in ambiguity, primarily due to inadequate labeling and annotations.
"Frequency" or "Proportion"?
Our Approach:
Edward Tufte's concept of the data-ink ratio (as we just introduced in Section 2.3) serves as a cornerstone for creating clear and impactful graphics.
Maximize the data-ink ratio
In our original visualization, Figure 1, the reliance on legends to distinguish categories such as "daily use," "monthly use," and "at least once tried" presented a significant challenge. This setup forced readers into a repetitive back-and-forth between the bar chart and the legend, disrupting their cognitive process and inadvertently increasing cognitive load. Such interruptions can lead to cognitive tunneling, where the audience becomes overly focused on deciphering colors rather than grasping the overall data narrative.
Interactive chart implementation:
Pyecharts
library. This innovation allows users to effortlessly display the values of each feature by simply hovering their mouse over the chart. By eliminating the need to constantly reference the legend, readers can maintain their focus on the data itself, facilitating immediate comprehension and minimizing cognitive distractions.Figure 4.1 - Implementing dynamic interactive charts using pyechart
(This image is our demonstration GIF. To preview it, please check in the markdown report)
*Note: For those using Jupyter Notebook, please refer to the link provided in the appendix to experience this interactive visualization chart, as image previews may not be available.*</br> Preview interactive charts online: https://www.tarikvon.cn/files/pyechart.html
In the realm of information visualization, reducing cognitive load (as we just introduced in Section 2.4) is paramount to ensuring that audiences, irrespective of their expertise, can effectively interpret and engage with the data presented. In this section, we propose changes aimed at minimizing both germane and extraneous cognitive load in the original visualization, thereby enhancing its clarity and accessibility.
Germane Cognitive Load (Audience):
Extraneous Cognitive Load (Visualization):
In the journey of transforming our visualization, the significance of revision and soliciting feedback became our guiding compass.
Attempts:
Feedback and Iterative Refinement:
After implementing the proposed changes, we arrived at a redesigned graph that tells a clearer and more engaging story.
Figure 4.2 - Improvements to the Graph
We have shared our project's work and records on the Reddit forum's visualization
section, a vibrant community dedicated to the discussion and enhancement of information visualization. This platform is frequented by both enthusiasts and professionals who are passionate about the art and science of visualizing data. By posting our project here, we aim to contribute to the collective knowledge and encourage discourse on effective visualization practices.
Our decision to publish the project on this forum underscores the importance of community engagement in advancing information literacy. We hope to inspire others to prioritize quality in their visualizations and to recognize the transformative power of well-crafted data representations.
This project underscored the importance of storytelling in data visualization. A good graph is not just about presenting data; it’s about crafting a narrative that connects with the audience. Through this process, we learned:
Poster I created during the pandemic while staying at home. The genes are color-coded and placed in a simple array with no specific scientific ordering. Designed more as a conversation starter.
Based on the genome information from GenBank: MN908947.3 - www.ncbi.nlm.nih.gov/nuccore/MN908947
Using R and R Shiny for effective data visualization and risk assessment - Real world demo and presentation showing Raiffeisenlandesbank Oberösterreich’s (Austria) advanced risk management practices
Free R in Finance webinar - This week, Thurs, Dec 12, 2024 - Full recording provided to all registrants after webinar is completed
https://r-consortium.org/webinars/quantification-of-participation-risk-using-r-and-rshiny.html
Hi everyone,
I’m conducting a quick survey to gather feedback on graph visualization libraries and the features that matter most to users. Whether you’re a student, developer, data scientist, product manager etc. your insights would be incredibly valuable in helping improve tools for exploring and analyzing complex datasets.
The survey is short (just 3-5 minutes) and focuses on understanding what you look for in a graph visualization library.
Here’s the link to the survey: [Link]
Thank you so much!
A chart with the usual palette (left) and something more natural (right).
Lately, I've been wondering about why data visualizations customarily use highly-saturated color palettes. I understand the conventional wisdom is that vivid colors are supposedly more legible, distinct, and easier to visually map from chart to legend, but are these assumptions necessarily correct? Are there studies?
The human eye is incredibly sophisticated, and has evolved being able to discern a camouflaged predator from the grass it is hiding in. We can distinguish colors in all qualities of light, even across shadows. So, why not make visualizations that better respect what we can see?
My thinking is that not only are posterized palettes sometimes annoying to look at, but they could be more off-putting, too. Are more natural colors easier to look at? And would this tend to make more people look at things if they were easier on the eyes?
I recently did a chart (attached here) where I tried to see if I could add more visual nuance with colors and shading. I basically overlayed some texture (in Photoshop) and tinkered with some filters (the ribbing is a pattern I made from a photo of venetian blinds). I may have gone over the top (yes, probably so with the tungsten yellow), but I'm genuinely curious if anyone sees any benefit in exploring such things.
People who do data visualizations wrestle with legibility and color palettes all. the. time. Such a crowd must have an opinion.
I’ve tried a bunch of tools for work and wanted to do a review of why I think Visme stands out and if you find it worth considering for your infographics, charts, data analytics etc. ← Not paid or sponsored lol.
1. User-Friendly Interface
Visme uses a drag-and-drop editor that makes it easy to start without a steep learning curve. AKA you don’t have to be a designer or data specialist to make something look really good and really fast.
2. Editable Infographic Templates
Their templates are a lifesaver. You can customize every element to match your company colors and look.
3. Charts and Graphs
Visme allows you to upload your data spreadsheets, then generates totally customizable charts, maps, or graphs. Bar/Axis/Radial graphs, Pie or donut charts, scatter plots, histograms, pictograms.
You name it, they got it.
4. Live Editing with Coworkers
Working on a team or remotely? Me and my coworkers can edit together from our own PC’s and make comments in real time - sort of like a google doc. It helps us be aligned if we’re presenting on a last minute deadline.
5. Integration-Heavy
Almost every single platform you already use can be integrated. Think: Hubspot, Salesforce, Monday.com, Dropbox, the whole 9 yards. Makes it super easy to move across platforms in a project.
Why You Should Try Visme
Basically give it a try. I think it’s def worth it.
I'm building this AI infographics generator product.
But I'm a developer not a designer. Want to understand more deeply from a designer's point of view about infographics design:
1, would you use an AI infographics product?
2, what's the biggest pain points?
3, what do you think is the best trade off between control and flexibility? what details do you want to control, what do you want to leave all to AI?
Thank you so much!
Hi, I’m on a elf growth journey and really would like to have a good visualisation meditation I can follow when I wake up in the morning. There are sooo many out there I’m unsure which to choose. Does anyone have any suggestions of what I can try?
Thanks :-)
hi r/visualization!
i'm comparing two groups to see if the treatments are significantly different, and originally, i had plotted bar charts with error bars (ggplot2 geom_bar and geom_errorbar), but when eyeballing my data, i noticed that the variance in the data is huge, regardless of treatment (means were not significantly different between treatments anyway).
i have four main metrics that i tested, so i had made four bar charts, but when i noticed the variability, i wondered if there's a better way to plot this. i calculated coefficients of variance both for metrics overall, and per treatment. certain metrics have higher CVs than others, and i want to figure out how to communicate this, while still displaying that no metrics had significant differences between treatments.
my thought process is, i change my four bar charts to be box plots and just put the p-value above (to indicate non-significance), then i create a grouped bar chart of the CVs (four groups of 3: treatment 1, treatment 2, overall- then times four).
is there a better way to do this? i don't want to have five bar charts on my research poster but i'm not sure what else to do. thanks!
🚀 I’m thrilled to present the Rose Pine Theme for R, a custom aesthetic for ggplot2 that blends modern minimalism with the serene elegance of the Rose Pine design philosophy. Drawing inspiration from its Python counterpart, this R-specific implementation has been refined to align with ggplot2’s unique functionality.
Sophisticated Aesthetic: Offers a calm, polished appearance, perfect for professional and academic visualizations.
Rich Color Palette: Integrates the signature Rose Pine color scheme, complemented by customizable accent tones to suit diverse datasets.Seamless Integration: Designed to effortlessly fit into standard ggplot2 workflows, saving you time while enhancing your plots.
Installing and applying the Rose Pine Theme is straightforward, and the result is a visually striking output tailored for high-impact presentations or publications. Here’s a glimpse of the theme in action, showcasing its versatility across different types of data.
Comprehensive documentation, including implementation guidelines, is available in the GitHub Repository. Check it out to explore usage tips and visual examples.
Feedback and contributions are highly encouraged! Whether you have suggestions for enhancements or want to contribute directly to the project, feel free to submit a pull request or open an issue on GitHub. Collaboration is the key to growth!
📩 Get in TouchFor any questions, feedback, or collaborative ideas, reach out via my portfolio website or connect with me on GitHub.
🔗 GitHub Repository: Explore the Rose Pine Theme for R
📌 ᴀʀᴇ ʏᴏᴜ ɪɴᴛᴏ ᴅᴀᴛᴀ ꜱᴄɪᴇɴᴄᴇ, ᴀɴᴀʟʏᴛɪᴄꜱ, ᴏʀ ᴠɪꜱᴜᴀʟɪᴢᴀᴛɪᴏɴ? ʟᴇᴛ’ꜱ ᴄᴏɴɴᴇᴄᴛ—ɪ’ᴅ ʟᴏᴠᴇ ᴛᴏ ᴄʜᴀᴛ ᴍᴏʀᴇ ᴀʙᴏᴜᴛ ɪᴛ!
Thank you
I have a large number of words, and I want to visualize their frequency of use in some data. This is exactly what a word cloud does. But i just don't like how.... floofy? they seem. Like something I'd see on etsy.
Beyond a bar plot with every word, is there another good way to visualize this data? Or ways to make the word cloud seem more scientific? I appreciate any advice
I’m playing WWE 2k24’s Universe Mode and keeping track of certain stats (who won what championships for how long, who’s friends/enemies with who, and how many matches a specific wrestler has won and lost). Initially, I used a coordinate graph on Desmos, but I wasn’t able to save it since it exceeded the file size limit. Other iOS apps wouldn’t let me add images instead of points (labelling each point would be messy since there are 96 wrestlers and there’s a lot of overlap), so while I’m looking for another app, I’m wondering if there’s a better way to accomplish my goal.
Also, if I’m able to store data for each wrestler (click on a name/point on the graph and get information about their championship stats, for example), I’d love it.
I never thought I’d be someone who could stick with a habit for this long, but here I am—116 days of meditation in a row. It started small, just 2 minutes a day, but tracking it in Mainspring habit tracker app kept me motivated to keep going.
At first, it felt like a chore, but now it’s something I actually look forward to. It’s helped me feel calmer, more focused, and way less stressed. Honestly, I’m just proud of myself for showing up every day.
Anyone else crushing their habit goals? Let’s celebrate some wins!
Hello guys, I am a masters student in data analytics and I have a data visualization team project.
I am a chess enthusiast who used to play a lot, I have a rating of around 2100-2200 on chess.com.
My team and I were really interested to find how psychological factors plays a role in chess.
We conducted our entire project based on a lichess data set of 200k games.
We studied how chess can teach us about psychology. We are presenting in front of a non chess audience with the hope to attract new players and show the beauty of the game. Here are the visuals as well as a blog post for some further explanation.
This is the link for the visuals:
https://public.tableau.com/app/profile/lynn.hajj/viz/Project_Chess41/Dashboard11?publish=yes
This is the link for the blog:
https://sites.aub.edu.lb/datavisualization/2024/11/25/the-psychology-behind-chess/
We would be grateful for any feedback! If that project had any impact on you please let us know!!