<![CDATA[Data Integrity]]>https://di.samizdat.co/2021/https://di.samizdat.co/2021/favicon.pngData Integrityhttps://di.samizdat.co/2021/Ghost 3.40Thu, 17 Aug 2023 16:41:49 GMT60<![CDATA[Week 14]]>
  • Free Form
    • Individual meetings to discuss your prototypes

Assignment

  • Free Form
    • Complete your visualization and prepare to present it at our final class meeting in two weeks.
]]>
https://di.samizdat.co/2021/week-14/60882e600b95a66bc16266a9Tue, 27 Apr 2021 15:32:01 GMT
  • Free Form
    • Individual meetings to discuss your prototypes

Assignment

  • Free Form
    • Complete your visualization and prepare to present it at our final class meeting in two weeks.
]]>
<![CDATA[Final Proposal]]>(I'm having trouble logging in my Github account so I'll post my work here for now:)

]]>
https://di.samizdat.co/2021/final-proposal/607f299e0b95a66bc1626684Tue, 20 Apr 2021 19:27:00 GMT(I'm having trouble logging in my Github account so I'll post my work here for now:)

]]>
<![CDATA[Week 13]]>
  • Presentation
    • Kevin on Amanda Cox & The Upshot
  • Free Form:
    • Review proposals as a group
    • Meet individually to look over progress

Assignment

  • Free Form
    • Develop an initial prototype of your visualization
    • If your prototype is static, commit your mock-ups as a (potentially multipage) PDF called process/prototype.pdf
    • Whether you're
]]>
https://di.samizdat.co/2021/week-13/607f1b2a0b95a66bc162667bTue, 20 Apr 2021 18:19:33 GMT
  • Presentation
    • Kevin on Amanda Cox & The Upshot
  • Free Form:
    • Review proposals as a group
    • Meet individually to look over progress

Assignment

  • Free Form
    • Develop an initial prototype of your visualization
    • If your prototype is static, commit your mock-ups as a (potentially multipage) PDF called process/prototype.pdf
    • Whether you're building something screen based or simply using Excel & Illustrator to generate your visuals, place your most up-to-date data files in project/data and name your primary script project/sketch.js
    • Whatever form your project takes, the prototype must use real data; this is a first draft, not a pencil sketch! If your dataset is too large to visualize fully in one week, pull out a handful of representative cases and visualize those to show the range of outcomes.
]]>
<![CDATA[Week 12]]>
  • Presentation
    • Olivia on Forensic Architecture
  • A Thousand Suns
    • Final crit
  • Free Form
    • Lightning round: report on topics & data sources
    • Individual meetings & work in small groups

Assignment

  • Free Form
    • Select one idea to develop further for your final project and create:
      1. a one-page proposal,
      2. three concepts with two sketches
]]>
https://di.samizdat.co/2021/week-12/6075d2900b95a66bc1626671Tue, 13 Apr 2021 17:19:49 GMT
  • Presentation
    • Olivia on Forensic Architecture
  • A Thousand Suns
    • Final crit
  • Free Form
    • Lightning round: report on topics & data sources
    • Individual meetings & work in small groups

Assignment

  • Free Form
    • Select one idea to develop further for your final project and create:
      1. a one-page proposal,
      2. three concepts with two sketches apiece,
      3. a spreadsheet with one or more tabs arranging your collected data into a form you can begin generating charts from.
    • See the “Sketches and Data” section of the assignment for details…
]]>
<![CDATA[Week 11]]>
  • Presentation
    • Judy on Hans Rosling
  • A Thousand Suns: initial critique
  • Free Form discussion: postponed to next week

Assignment

  • A Thousand Suns
    • Complete your final version and commit your code and documentation in a folder called 3.mapping-quantities/final. If you have designed a static visualization, upload a PDF at the
]]>
https://di.samizdat.co/2021/week-11/606c98b90b95a66bc1626660Tue, 06 Apr 2021 17:24:04 GMT
  • Presentation
    • Judy on Hans Rosling
  • A Thousand Suns: initial critique
  • Free Form discussion: postponed to next week

Assignment

  • A Thousand Suns
    • Complete your final version and commit your code and documentation in a folder called 3.mapping-quantities/final. If you have designed a static visualization, upload a PDF at the proper scale and trim its art-board to be full-bleed. Your description of the project’s data and visualization approach should be in a file called 3.mapping-quantities/final/README.md.
  • Free Form
    • Refine your lists of 10 ideas and 5 data sources and be prepared to present them to the class next week.
]]>
<![CDATA[The Subtleties of Color]]>It is known that humans are unable to process all colors of the world. With our limited perceptions of color, the tools we use to transmit color data become crucial in our daily life. Color profiles like RGB, HSB and CIE L*C*h served different purposes depending on the

]]>
https://di.samizdat.co/2021/the-subtleties-of-color/606b420c0b95a66bc1626654Mon, 05 Apr 2021 17:00:52 GMTIt is known that humans are unable to process all colors of the world. With our limited perceptions of color, the tools we use to transmit color data become crucial in our daily life. Color profiles like RGB, HSB and CIE L*C*h served different purposes depending on the ultimate medium. The fact that humans are less able to perceive saturated blue than red and green has to be taken into when designing data visualizations.

The method of increasing lightness while shifting hue is a wonderful solution for visualizing the scale of data. However it is only appropriate for sequential datasets.  It is also interesting to see the colors used in divergent datas and the importance of catering towards color blind individuals.  Categorical data on the other hand is tricky to manage as a very controlled color palette is needed.

]]>
<![CDATA[Subtleties of Color]]>In this reading, Robert Simmon details the subtleties of color, and notes three types of datasets, those being sequential, divergent, and qualitative.  

Forgetting the specific definition of these datasets for a second, we first explore the fascinating ideas behind color and the power it has on perception and audiences. Different

]]>
https://di.samizdat.co/2021/subtleties-of-color/606373910b95a66bc1626620Tue, 30 Mar 2021 19:00:00 GMTIn this reading, Robert Simmon details the subtleties of color, and notes three types of datasets, those being sequential, divergent, and qualitative.  

Forgetting the specific definition of these datasets for a second, we first explore the fascinating ideas behind color and the power it has on perception and audiences. Different colors can enhance storytelling, graphics and work to make the viewer more interested and more engaged with the content in front of them. Simmon works to explain and analyze how color palettes should be shifted situationally, playing with ideas of hierarchy, depth and data visualization.

Regarding the three types of datasets, the first is sequential datasets, which are  represented with shifts in hue, saturation, and lightness simultaneously. This type of representation is most apt in order to depict data that has variations continuously from high to low.

Divergent datasets are useful in order to focus on differences. The color palettes for this type of dataset rely on variations of hue, lightness and saturation, and work the best when two contrasting hues and a neutral middle color are used.

Qualitative datasets are used to categorize items or data. Similar to how the data is separated into different categories, the use of various colors for qualitative datasets mirror the differentiation of categories and data.

I must note that while I understand how color can affect mood, tone etc, and have aptly used it in other design and artistic endeavors in order to convey the feelings I want, I had never consciously considered how utilizing varieties of color schemes and palettes would be more apt depending on the data. Rather, I believe I was more concerned with color in order to simply provide clarity and ease of understanding, and thinking about color palettes subconsciously.


]]>
<![CDATA[Subtleties of Color Response]]>Good use of different color palettes can enhance the storytelling, clarity, and attention aspects of different datasets. I think the blog series introduced the ideas of applying color scales to data nicely. It discussed about three types of datasets: sequential, divergent, and qualitative. These theories are interesting to me. I

]]>
https://di.samizdat.co/2021/subtleties-of-color-response-2/606284c90b95a66bc1626485Tue, 30 Mar 2021 18:19:53 GMTGood use of different color palettes can enhance the storytelling, clarity, and attention aspects of different datasets. I think the blog series introduced the ideas of applying color scales to data nicely. It discussed about three types of datasets: sequential, divergent, and qualitative. These theories are interesting to me. I do know color is a significant variable in datasets, but never thought too much about how different color schemes are suitable for different datasets.

Sequential datasets are suitable for colors scales that are smooth and even. The lightness and saturation of the palette should be uniform. Since this type of datasets includes continuously varying data (ex: from low to high), the color scheme should also reflect the continuously changing quality.

Divergent datasets focus on measuring differences. The concept of divergent palettes is introduced, which is composed of two sequential palettes merging with a neutral central color. This color is usually white and gray.

Qualitative datasets categorize things. As data is separated into different categories and classes, the colors for qualitative datasets should be as diverse and distinct from each other as possible. Because people all perceive things differently, and also because of the simultaneous contrast effect, the maximum number of categories that can be displayed and recognized is probably no more than 12.

I also think it is a big challenge to design a graph with color that can be perceived by normal people and colorblind people at the same time. What will a qualitative dataset look like under this idea?

]]>
<![CDATA[Week 10]]>
  • Presentation
    • Summer on the Washington Post graphics dept.
  • Reading #2: Subtleties of Color
    • To actually use your newfound understanding of color, start looking into using chroma.js in your sketches
      • Note the use of the .hex() method to convert from chroma’s color representation to p5’s on line 23
]]>
https://di.samizdat.co/2021/week-10/6063684a0b95a66bc16265c6Tue, 30 Mar 2021 18:08:49 GMT
  • Presentation
    • Summer on the Washington Post graphics dept.
  • Reading #2: Subtleties of Color
    • To actually use your newfound understanding of color, start looking into using chroma.js in your sketches
      • Note the use of the .hex() method to convert from chroma’s color representation to p5’s on line 23 of this example
    • If the chroma.js library is too heavyweight for your needs, take a look at my brewer palette generator and see if you find it easier to use. Consult this catalog to find the name of the palette you want.
  • A Thousand Suns
    • Review your sketches merging an external data set with the testing timeline
    • Pick one direction to develop for next week

Assignment

  • A Thousand Suns
    • Develop your initial visualization and commit your code and documentation in the folder called 3.mapping-quantities/project. If you have designed a static visualization, upload a PDF at the proper scale and trim its art-board to be full-bleed.
    • Fine tune the text and typography surrounding your diagram in order two provide ‘three reads’ in terms of information from headline to body text to legends & labels (similar to the three visual reads we explored in the first assignment).
    • Include a brief (just a few sentences) explanation of your project’s data and how it is presented in 3.mapping-quantities/project/README.md
  • Free Form
    • Spend an hour brainstorming ten ideas for your final project. Focus on data that seems interesting to you for reasons you might not be able to articulate, then start posing questions that you could potentially answer with that data (either alone or in combination with other information).
    • Describe each idea in a sentence or three (ideally ending with a concrete, testable question) in 4.final-project/process/ideas.md (you’ll need to run make update to pull down the template for this file first)
    • From your ten ideas, find five data sources on the web and document them in the file 4.final-project/process/datasources.md using the format demonstrated with the USGS example at the top of the file (and be sure to delete this once you've completed your list).
]]>
<![CDATA[Free Form]]>

Final Project

In this final project you will be bringing the conceptual dimension of the class together with the visualization techniques we’ve learned. You will develop and implement a final project following a complete, iterative design process. The first step in this is the creation of a set of

]]>
https://di.samizdat.co/2021/free-form/606368d10b95a66bc16265dfTue, 30 Mar 2021 18:08:01 GMT

Final Project

Free Form

In this final project you will be bringing the conceptual dimension of the class together with the visualization techniques we’ve learned. You will develop and implement a final project following a complete, iterative design process. The first step in this is the creation of a set of proposals.

Process

You should aim to generate about 10 ideas during the next week that are intellectually distinct, address a diverse range of points of view, and explore different subject domains and levels of complexity. Work rapidly and don't get hung up on polishing anything digitally — limit yourself to handwritten notes and sketches during this phase while you are doing research (recommended: 10 ideas in a 1 hour session).

During the research and sketching process, address the following questions:

  • What are some observations you find noteworthy?
  • What insights could this offer?
  • How is it related to visualization, what can visualization reveal here?
  • What new ideas came out of the process of sketching and research?
  • What are the next steps?

Refine your list daily and let your initial set of ideas sit for at least a day before you take things a step further. Talk about your ideas with a peer outside of class (their level of expertise does not matter here).

Ideas & Sources (due 13 Apr)

List your ten ideas in the process/ideas.md file in your 4.final-project directory. Each idea can be described in 2 or 3 sentences, but make sure you outline the subject matter being considered, the question being asked, and the format (digital vs print, static vs interactive, diagram vs text vs map, etc.) you intend to use.

For five of these, find at least one datasource and record the details in your process/datasources.md file. There is a placeholder entry in the file describing the USGS feeds. Delete that entry, but use it as a model for the five new entries you'll be adding.

Concept Design & Development

Based on the feedback you’ve received for your 10 ideas from the ideation phase, choose one direction and create three different series of wireframe sketches (with at least 2 sketches per series) to illustrate how you plan to convey your subject visually. Your wireframes will be static representations of the eventual user interface plus schematic views of your visualization. The individual components of the sketch should be rough, functional approximations of the final placement, size, and interaction type (click, drag, hover, etc.) of UI elements but shouldn't yet be concerned with aesthetics.

Develop these sketches quickly enough to explore alternative approaches that present the subject in different ways. Experiment both with the visual representation of the data (in terms of the retinal variable mappings we've examined) and the different affordances your UI provides, allowing the user to pose different questions and filter/sort/focus the information in different ways.

In parallel with your design work, build a proof-of-concept illustration that your data-source is sound and will be able to provide the quantitative and qualitative information necessary for your final visualization. To that end, write a simple p5 script in which you read in one of your data files (whether CSV, JSON, or otherwise) and print/draw its values to a canvas. This sketch will be the basis for the visualization portion of your project and should not incorporate any of your UI design ideas until we get to the next step of the process.

Sketches and Data (due 20 Apr)

Select one idea to develop further for your final project and create a one-page proposal, 3 wireframe sketches, and a p5 script that loads in the data you’ve collected as either a JSON file or CSV table:

  • Summarize your final project ideas as a 1-page project proposal. Include rough visual sketches to illustrate your approach as needed.
    • The written description can be short (200–500 words) and should establish the subject matter, the question you’re trying to answer, and what you expect your intended audience to take away from the project. Be sure to address the questions listed above in the “Process” section.
    • The sketches are there only to help communicate your overall concept, so don’t spend too much time ‘designing’ them. If you don’t include a sketch, then describe your visualization approach thoroughly.
    • Create a PDF called proposal.pdf that combines your written description and supporting sketches and place it in the process folder.
  • Bring in three conceptually distinct approaches to visualizing your chosen dataset, depicted with at least 2 sketches apiece.
    • The sketches should illustrate both the representation of the data and the user interface approach.
  • Collect the data files (plural!) that contribute evidence to your subject (look in particular for CSV, TSV, JSON, XML, or SQL sources) and write a p5.js script that reads in your data file and generates something visual from it.
    • The form of the visual representation is not important, but it must include every field from the data source that you'll be using in your final visualization.

Initial Prototype (due 27 Apr)

Continue developing your wireframes and draw up a set of static representations of your user interface that demonstrate all of the ‘feature complete’ version’s eventual affordances, modes, and states. Devote particular attention to screen (or page) layout, typography, color mappings, contrast (both in terms of lightness and type hierarchy), and overall look and feel. These static mock-ups will serve as the ‘specification’ which the rest of your development work this semester will attempt to implement.

  • Collect your mock-ups in a single PDF file and place it in your repository as process/prototype.pdf.

  • Clean your data and convert it to a more programmatically usable (CSV, JSON, TXT, or XML) form. If you’re working off of a ‘live’ data source, learn the relevant API. In the case of relational databases, ensure you have a full understanding of its schema and prepare the queries you’ll be using. Submit your p5 sketch that successfully parses the data as data/sketch.js and place any local data files you’ll be using in data/assets.

Final (due 11 May)

You will present your completed projects at our last class meeting. Good luck!

]]>
<![CDATA[Sophie Fu]]>Specifically noting the perception of color, I am wondering whether or not certain ranges of colors would not be perceived as different from one another. Not to the extent of color blindness, in which green and red are highly indistinguishable, but rather in the case of the equiluminant colors chart

]]>
https://di.samizdat.co/2021/sophie-fu-4/606359100b95a66bc16264c8Tue, 30 Mar 2021 17:33:29 GMTSpecifically noting the perception of color, I am wondering whether or not certain ranges of colors would not be perceived as different from one another. Not to the extent of color blindness, in which green and red are highly indistinguishable, but rather in the case of the equiluminant colors chart where the spread is radial. The closer to the center of the circle the colors get, the more vague the color differences end up being. In that sense, we can see the same effect happen in our Retinal Variables charts, for the hue and value gradiations the initial start was nearly unperceivable.

In that sense, it seems to be necessary to avoid the usage of green, as the general blend of color seems to accentuate this problem even more so than other colors. Color palettes that have both a progression in lightness as well as a shift in hue seem to be the best solution as the two ends of the color spectrum, as a result, vary so drastically from one another that the middle increments have a more distinguishable step up from the previous.

On the otherhand, it is interesting that for Divergent Data, because the focus is more on the two ends rather than all values, the full palette tends to want more of the eye catching variances to avoid the center range. Categorical Data focuses on distinct hues for each value, with the reading stating that the maximum number is around 12, or possibly even fewer. To go over this estimation would mean that certain colors may start to resemble another specific color on the spectrum, which may cause confusion in perceiving the chart if it is a large scatter of colors.

]]>
<![CDATA[Reading 2]]>Subtleties of Color presents the dilemmas and their corresponding solutions of color choices in data visualization. Since, unlike computers, the human eyes perceive colors subjectively, color design precautions are necessary for a data visualization to serve its purpose of illuminating data.

Based on the three different types of data––sequential,

]]>
https://di.samizdat.co/2021/reading-2-4/606355ff0b95a66bc16264a3Tue, 30 Mar 2021 17:23:05 GMTSubtleties of Color presents the dilemmas and their corresponding solutions of color choices in data visualization. Since, unlike computers, the human eyes perceive colors subjectively, color design precautions are necessary for a data visualization to serve its purpose of illuminating data.

Based on the three different types of data––sequential, divergent, and qualitative, the three basic aspects of colors, hue, saturation, and lightness, are to be manipulated separately or collectively to fit the dataset. Sequential data is best displayed with a palette that varies in both lightness and saturation, with the supplementary addition of shift in hue; divergent data is best displayed with bifurcated palettes with a neutral central color; qualitative data should be represented by a set of easily distinguishable colors, or colors that vary sharply in hue, saturation, or lightness. In general, palettes appropriate for its content and distinguishable to human perceptions are the perfect palettes.

In addition to the fundamentals, there are some subtle aspects of color design for data visualization. Considering the perceptual norms of colors, the color palettes should be tailored for each dataset based on intuitions, layering, complementary datasets, breakpoints, separations, and hierarchies.

]]>
<![CDATA[The Subtleties of Color]]>The three different types of datasets discussed are sequential, divergent, and qualitative. The series of posts does a great job with analyzing the correct color palettes to use in each situation. When used correctly, color has the ability to enhance quality, aid storytelling, and draw a viewer in. How does

]]>
https://di.samizdat.co/2021/r2/6063488a0b95a66bc1626492Tue, 30 Mar 2021 16:03:41 GMTThe three different types of datasets discussed are sequential, divergent, and qualitative. The series of posts does a great job with analyzing the correct color palettes to use in each situation. When used correctly, color has the ability to enhance quality, aid storytelling, and draw a viewer in. How does color contribute to multi-level storytelling? In what ways can color create hierarchy? I thought it was most interesting to see how color placement can evoke a sense of depth. If used incorrectly, this sense of depth can skew data and create misleading information. Therefore, color truly does play an important part in making data easily accessible while maintaining the data’s integrity.

Sequential datasets require simultaneous shifts in hue, saturation, and lightness because they vary continuously from high to low.

Divergent datasets make use of continuous linear color palettes that rely on variations of hue, lightness, and saturation. Two contrasting hues with a neutral middle color is the best for this dataset.

Qualitative data are best to be represented by various colors since the increased amount of data requires a higher level of differentiation.

After reading the series of articles I am left with questions pertaining to how we can make color accessible for blind people? Especially since color is an integral component to understanding the information presented. Also, I wonder what determines if use of a color is qualitative or quantitative. Qualitative data uses color to separate areas into categories. Therefore, it is is best to have very different colors in the set to avoid confusion. However, I am curious how colors that are too similar can possibly affect the integrity of the data. It seems that color choice is important in providing a viewer with clarity.

]]>
<![CDATA[Reading 2 - Subtleties of Color]]>The blog explores the problem of color in data visualization, when most often a color palette is picked out with free whims and contaminates the data set.

One interesting point in the beginning of the blog is the inaccuracy of human's perception of color with the computer generated gradient. It

]]>
https://di.samizdat.co/2021/reading-2-2/605baeff0b95a66bc1626434Mon, 29 Mar 2021 21:51:00 GMTThe blog explores the problem of color in data visualization, when most often a color palette is picked out with free whims and contaminates the data set.

One interesting point in the beginning of the blog is the inaccuracy of human's perception of color with the computer generated gradient. It points out that among the 3 main variables, lightness is the most common and easiest one to spot. There're 3 main color palette put in use, sequential data palette, divergent color palette, and categorical color palette. The one thing I haven't noticed before is the accessibility of the colors. When color blindness is taken into consideration, the choice of accessible color palette can be come very challenging.

At the end of the blog, we've also get a sense of color's meaning is more or less cultural and relies heavily on context (the data set). When the color can intuitively communicate the data's information (eg. green as plants, grey as deserted lands, blue as water), it's highly recommended to follow such rules to help the general readers understand the data more easily.

]]>
<![CDATA[Final Project List]]>
  • Film visualization – Éric Rohmer's Tales of the Four Seasons
  • Celestial data visualization - Past and Future 50 years of Sky, 1970-2070
  • Emotion visualization - twitter keyword data set within 2020
  • New York City wild life (looking at Citizen app's alert involving wild life)
  • Bechdel test on past 10 years film
  • ]]>
    https://di.samizdat.co/2021/final-project-list/605e3b190b95a66bc1626470Mon, 29 Mar 2021 20:03:00 GMT
  • Film visualization – Éric Rohmer's Tales of the Four Seasons
  • Celestial data visualization - Past and Future 50 years of Sky, 1970-2070
  • Emotion visualization - twitter keyword data set within 2020
  • New York City wild life (looking at Citizen app's alert involving wild life)
  • Bechdel test on past 10 years film (more defined film category needed)
  • ]]>