PROJ 0: CULTURE ANALYTICS - MYSQL
Posted: Sat Dec 26, 2015 4:42 pm
CULTURE ANALYTICS - MYSQL
Media Theorist Lev Manovich defines Culture Analytics as "the analysis of massive cultural data sets and flows using computational and visualization techniques."
Create a MySQL query that creatively and insightfully reveals something about the Seattle Public Library dataset over a 10 year period (2006-2015)
----
CULTURE ANALYTICS - 2D VISUALIZATION SCHEDULE
1.05 Introduction to course and project
1.07 Software setup MYSQL, Processing, Introduction to MYSQL queries
1.12 Students present: MYSQL query, Lev Manovich article discussion
1.14 Intro to 2D demo in processing
1.19 Review of visual language basics: form, color, movement, etc.
1.21 Presentation of Culture Analytics project in 2D
----
The assignment in MySQL is to dig into the database of the Seattle Public Library consisting of over 80 million entries since September 2005 and to select a topic that reveals some cultural trends taking place in the past 10 years. Your assignment is to discover some interesting cultural content, trends and/or patterns within the data and then to visualize it in a 2D matrix. The library includes over 2 million items of books, cds, DVDs, and other forms of published material.
You will need to become familiar with 1) the actual data content of the library database, 2) the system by which the data is organized, including irregularities in the organizational structure 3) and how the items are used by the public – what is checked out, for how long, and how this may reveal cultural trends.
Approach:
. Get an overview of the dataset
. Explore topic specifics of personal interest
. Embrace detail to integrate complexity
Begin by asking what are people checking out, what may be of personal interest to you in books, movies, music, topics, etc. Go to the library site: https://seattle.bibliocommons.com/searc ... mit=search and search for items of interest to you, be it “Global warming”, “nuclear war”, “chinese cooking”, “beatles”, “Downtown Abbey”, etc. If there is some activity, then proceed to MySQL.
Methodologies:
1) Track the checkout history of the item or topic over the 10 year history to make sure there is adequate activity to study. Your queries should provide both general overview to details.
2) Compare the item/topic’s history to other similar ones. Select one or more items to see if performance patterns are comparable. Identify repeating patterns, and irregularities. Look for interesting, unexpected occurrences.
What to look for:
. Search for (unexpected) patterns in the data
. How often something happens (frequency)
. Anomalies in the system (errors, outliers, etc.) http://en.wikipedia.org/wiki/Anomaly_detection
. Association: Search for relationships between variables
. Do statistical analysis using MySQL aggregate functions: http://dev.mysql.com/doc/refman/5.6/en/ ... tions.html
Your Query Search Details
. Download a minimum 3 columns of metadata – this will be used for horizontal, vertical, and depth values for your visualization
. Standardize columns for output to .csv file: (vertical, horizontal, pixel value + more)
. Your search results should be sufficient to feature subtleties in the data to be expressed visually within a screen size between 1920x 1280 to 2560 x 1850 pixels
Process:
. Imagine a question
. Explore all fields of the database
. Test the query for logical and other errors
. Start with Booleans (AND, OR NOT), wildcards
. Make the query efficient (finetune)
. Interpret the results (data analysis)
. Export to .csv file to be used in visualization
Your Assignment Report:
. Concept/Question (describe what question you are exploring)
. Provide the Query
. Explain the Query
. Provide the results
. Give Processing Time
. Give an Analysis (report back on your results, explain how you did it, and what your outcomes are)
Post your Project:
. Print out as pdf and attach. Keep csv files in the ".csv" format. Post your report at http://www.mat.ucsb.edu/forum/ at MAT 259/Winter 2016
. PostReply to Proj 1 - Attach the pdf.
Grading
. Standard Completion of Project: B
. Revised, advanced functions: B+, A-
. Innovative question: A
Media Theorist Lev Manovich defines Culture Analytics as "the analysis of massive cultural data sets and flows using computational and visualization techniques."
Create a MySQL query that creatively and insightfully reveals something about the Seattle Public Library dataset over a 10 year period (2006-2015)
----
CULTURE ANALYTICS - 2D VISUALIZATION SCHEDULE
1.05 Introduction to course and project
1.07 Software setup MYSQL, Processing, Introduction to MYSQL queries
1.12 Students present: MYSQL query, Lev Manovich article discussion
1.14 Intro to 2D demo in processing
1.19 Review of visual language basics: form, color, movement, etc.
1.21 Presentation of Culture Analytics project in 2D
----
The assignment in MySQL is to dig into the database of the Seattle Public Library consisting of over 80 million entries since September 2005 and to select a topic that reveals some cultural trends taking place in the past 10 years. Your assignment is to discover some interesting cultural content, trends and/or patterns within the data and then to visualize it in a 2D matrix. The library includes over 2 million items of books, cds, DVDs, and other forms of published material.
You will need to become familiar with 1) the actual data content of the library database, 2) the system by which the data is organized, including irregularities in the organizational structure 3) and how the items are used by the public – what is checked out, for how long, and how this may reveal cultural trends.
Approach:
. Get an overview of the dataset
. Explore topic specifics of personal interest
. Embrace detail to integrate complexity
Begin by asking what are people checking out, what may be of personal interest to you in books, movies, music, topics, etc. Go to the library site: https://seattle.bibliocommons.com/searc ... mit=search and search for items of interest to you, be it “Global warming”, “nuclear war”, “chinese cooking”, “beatles”, “Downtown Abbey”, etc. If there is some activity, then proceed to MySQL.
Methodologies:
1) Track the checkout history of the item or topic over the 10 year history to make sure there is adequate activity to study. Your queries should provide both general overview to details.
2) Compare the item/topic’s history to other similar ones. Select one or more items to see if performance patterns are comparable. Identify repeating patterns, and irregularities. Look for interesting, unexpected occurrences.
What to look for:
. Search for (unexpected) patterns in the data
. How often something happens (frequency)
. Anomalies in the system (errors, outliers, etc.) http://en.wikipedia.org/wiki/Anomaly_detection
. Association: Search for relationships between variables
. Do statistical analysis using MySQL aggregate functions: http://dev.mysql.com/doc/refman/5.6/en/ ... tions.html
Your Query Search Details
. Download a minimum 3 columns of metadata – this will be used for horizontal, vertical, and depth values for your visualization
. Standardize columns for output to .csv file: (vertical, horizontal, pixel value + more)
. Your search results should be sufficient to feature subtleties in the data to be expressed visually within a screen size between 1920x 1280 to 2560 x 1850 pixels
Process:
. Imagine a question
. Explore all fields of the database
. Test the query for logical and other errors
. Start with Booleans (AND, OR NOT), wildcards
. Make the query efficient (finetune)
. Interpret the results (data analysis)
. Export to .csv file to be used in visualization
Your Assignment Report:
. Concept/Question (describe what question you are exploring)
. Provide the Query
. Explain the Query
. Provide the results
. Give Processing Time
. Give an Analysis (report back on your results, explain how you did it, and what your outcomes are)
Post your Project:
. Print out as pdf and attach. Keep csv files in the ".csv" format. Post your report at http://www.mat.ucsb.edu/forum/ at MAT 259/Winter 2016
. PostReply to Proj 1 - Attach the pdf.
Grading
. Standard Completion of Project: B
. Revised, advanced functions: B+, A-
. Innovative question: A