PROJ 0: CULTURE ANALYTICS - MYSQL

glegrady · Post by **glegrady** » Sat Dec 26, 2015 4:42 pm

CULTURE ANALYTICS - MYSQL

Media Theorist Lev Manovich defines Culture Analytics as "the analysis of massive cultural data sets and flows using computational and visualization techniques."

Create a MySQL query that creatively and insightfully reveals something about the Seattle Public Library dataset over a 10 year period (2006-2015)
----
CULTURE ANALYTICS - 2D VISUALIZATION SCHEDULE

1.05 Introduction to course and project
1.07 Software setup MYSQL, Processing, Introduction to MYSQL queries
1.12 Students present: MYSQL query, Lev Manovich article discussion
1.14 Intro to 2D demo in processing
1.19 Review of visual language basics: form, color, movement, etc.
1.21 Presentation of Culture Analytics project in 2D
----
The assignment in MySQL is to dig into the database of the Seattle Public Library consisting of over 80 million entries since September 2005 and to select a topic that reveals some cultural trends taking place in the past 10 years. Your assignment is to discover some interesting cultural content, trends and/or patterns within the data and then to visualize it in a 2D matrix. The library includes over 2 million items of books, cds, DVDs, and other forms of published material.

You will need to become familiar with 1) the actual data content of the library database, 2) the system by which the data is organized, including irregularities in the organizational structure 3) and how the items are used by the public – what is checked out, for how long, and how this may reveal cultural trends.

Approach:
. Get an overview of the dataset
. Explore topic specifics of personal interest
. Embrace detail to integrate complexity

Begin by asking what are people checking out, what may be of personal interest to you in books, movies, music, topics, etc. Go to the library site: https://seattle.bibliocommons.com/searc ... mit=search and search for items of interest to you, be it “Global warming”, “nuclear war”, “chinese cooking”, “beatles”, “Downtown Abbey”, etc. If there is some activity, then proceed to MySQL.

Methodologies:
1) Track the checkout history of the item or topic over the 10 year history to make sure there is adequate activity to study. Your queries should provide both general overview to details.

2) Compare the item/topic’s history to other similar ones. Select one or more items to see if performance patterns are comparable. Identify repeating patterns, and irregularities. Look for interesting, unexpected occurrences.

What to look for:
. Search for (unexpected) patterns in the data
. How often something happens (frequency)
. Anomalies in the system (errors, outliers, etc.) http://en.wikipedia.org/wiki/Anomaly_detection
. Association: Search for relationships between variables
. Do statistical analysis using MySQL aggregate functions: http://dev.mysql.com/doc/refman/5.6/en/ ... tions.html

Your Query Search Details
. Download a minimum 3 columns of metadata – this will be used for horizontal, vertical, and depth values for your visualization
. Standardize columns for output to .csv file: (vertical, horizontal, pixel value + more)
. Your search results should be sufficient to feature subtleties in the data to be expressed visually within a screen size between 1920x 1280 to 2560 x 1850 pixels

Process:
. Imagine a question
. Explore all fields of the database
. Test the query for logical and other errors
. Start with Booleans (AND, OR NOT), wildcards
. Make the query efficient (finetune)
. Interpret the results (data analysis)
. Export to .csv file to be used in visualization

Your Assignment Report:
. Concept/Question (describe what question you are exploring)
. Provide the Query
. Explain the Query
. Provide the results
. Give Processing Time
. Give an Analysis (report back on your results, explain how you did it, and what your outcomes are)

Post your Project:
. Print out as pdf and attach. Keep csv files in the ".csv" format. Post your report at http://www.mat.ucsb.edu/forum/ at MAT 259/Winter 2016
. PostReply to Proj 1 - Attach the pdf.

Grading
. Standard Completion of Project: B
. Revised, advanced functions: B+, A-
. Innovative question: A

changhe · Post by **changhe** » Wed Jan 13, 2016 6:59 pm

When “New” Books Are No Longer New

It’s a common situation that library buys new books every several months or each year, so does Seattle Public Library. The library has its own collection code defined. With the call number together with the collection code, it’s easy to know whether the book is a new one bought this year or not. Therefore the question is how does these “new” book attract readers over time. People might guess that those new books won’t be as popular as they arrived after 3 years. But is that the truth? How fast does the frequency pattern drop? Does the category of books affect as well? These are the concepts I try to explore.

akibmayadav · Post by **akibmayadav** » Wed Jan 13, 2016 8:30 pm

Is The Sun Out ?

Visiting a library is a recreational activity for some ,academic pursuit for some and research for a few others. But whether we humans take the step ahead and go visit the library for these activities depends on a lot of factors. A major factor that I thought would play a crucial role in this decision at an individual level would be the weather.I wanted to see how the number of checkouts in Seattle Public Library ,which is dependent on the peoples’ presence in library is affected by weather conditions.

ITERATION ONE :

What was done here ?
I started with comparing the the number of checkouts every month from 2006-2010.

What was observed?
There was a variation in the number of books being checked out every month.Two major conclusions were drawn from here.

1. The trend of decline of the number of books checked out in the months of December and January and the hike in the number of books checked out in the months of June-July has prevailed through years. This was because December and January are the months of extreme weather and June-July is when the weather is better.

2. There was a constant increase in the number of books checked out as the years passed.Particularly there was a hike in the number of books checked out during July 2009. July 2009 is historically known as the hottest month Seattle has ever seen.

In the month of July 2009, another aspect ,that I found out about when I searched about the 2009 Seattle heat wave , was that people found Seattle Public library to be the "one of the coolest spot to hang out in" during the Summer of 2009. It is evident from this post : https://weatherspark.com/history/29735/ ... ted-States

SQL QUERY :