Search found 8 matches

by shaokang
Mon Nov 28, 2022 10:44 pm
Forum: Fall 2022
Topic: wk10 - 11.29.22 Final Project
Replies: 4
Views: 13861

Re: wk10 - 11.29.22 Final Project

Final Project: Clustering & Dimensionality Reduction Clustering and Dimensionality Reduction are 2 effective approaches used in data analysis. Clustering is often used to see if there’s any grouping pattern in the data, while dimensionality reduction are helpful in visualizing high-dimesional data. ...
by shaokang
Sun Nov 20, 2022 8:02 pm
Forum: Fall 2022
Topic: wk9 - 11.22.22 Random Sampling
Replies: 4
Views: 4027

Re: wk9 - 11.22.22 Random Sampling

Random Sampling is a method to estimate characteristics of the whole population by sample a subset within the whole population randomly. For this week‘s assignment, I am interested in questions below: * By using random sampling technique, I try to find if there is any pattern within the sampled data...
by shaokang
Wed Nov 16, 2022 11:54 pm
Forum: Fall 2022
Topic: wk8 - 11.15.22 Outliers
Replies: 4
Views: 4878

Re: wk8 - 11.15.22 Outliers

For this week‘s assignment, I try to find outliers of different kinds. * Using standard deviation of checkout times to find the most popular and unpopular items within CD category * Using both purchase number and checkout times as the indicator of popularity, applying algorithms find out the outlier...
by shaokang
Mon Nov 07, 2022 11:36 pm
Forum: Fall 2022
Topic: wk7 - 11.8.22 4 MySQL commands new to you
Replies: 4
Views: 4726

Re: wk7 - 11.8.22 4 MySQL commands new to you

In this weeks report, I use commands about JSON operations, bit operations, String functions to find out some interesting titles and some numerical patterns.
week07_01.csv
(20 Bytes) Downloaded 82 times
week07_03.csv
(93.78 KiB) Downloaded 82 times
week07_02.csv
(922 Bytes) Downloaded 84 times
week07_04.csv
(23.29 KiB) Downloaded 75 times
Week 07 New SQL Commands.pdf
(266.61 KiB) Downloaded 80 times
by shaokang
Tue Nov 01, 2022 2:06 am
Forum: Fall 2022
Topic: wk6 - 11.01.22 MidTerm Presentation
Replies: 4
Views: 4711

Re: wk6 - 11.01.22 MidTerm Presentation

Frequency-pattern related algorithm is used as an analytical process that finds frequent patterns or associations from data sets. For example, grocery store transaction data might have a frequent pattern that people usually buy chips and beer together. With this tool, I would like to check if there’...
by shaokang
Tue Oct 18, 2022 11:54 am
Forum: Fall 2022
Topic: wk4 - 10.18.22 Discover Patterns with MySQL
Replies: 4
Views: 4491

Re: wk4 - 10.18.22 Discover Patterns with MySQL

During past decades, numerous data storage devices appeared and dominated the market at some timestamp. For example, the VHS format, developed by JVC, is released in 1976, and in 2016, no one would make VHS products since there are much more advanced storage devices (DVD, Blu-Ray etc.). Instead of u...
by shaokang
Tue Oct 11, 2022 2:07 am
Forum: Fall 2022
Topic: wk3 - 10.11 .22 2nd Project in MySQL
Replies: 4
Views: 4855

Re: wk3 - 10.06 .22 MySQL queries studies

A report focused on two topics: SPL hotspot devices and 2019 No Late Fee Policy Thinking on adding one topic more: about the legacy items. Here's the of the report: (Revised as of 10.17) Assignment 02_revised.pdf Topic 01 results: SPL_overview.csv Hotspot_devices_type.csv models_over_years.csv Topic...
by shaokang
Mon Oct 03, 2022 1:57 pm
Forum: Fall 2022
Topic: wk2 - 09.29.22 1st Project in MySQL
Replies: 4
Views: 4282

Re: wk2 - 09.29.22 MySQL queries studies

Codes and analysis are in this PDF Assignment 01 Shaokang.pdf 1.Find the single longest and shortest borrow time 2.What’s the number of books of different dewey classes (Items of same bibNumber count once) ? 3.What’s the number of items of different dewey classes (Items of same bibNumber count multi...