2022 Fall



SLAPDASH BOT

Curating art with a random word generator Twitter bot...

Read More

Documentary Resources Workflow

Workflow to pull moving images, videos, and clips from the Library of Congress (LOC) API to match a documentary theme...

Read More

Met Highlight Wikidata Project

Compare object records from the Metropolitan Museum of Art API with corresponding data from the Wikidata API...

Read More

Forsaken Bones

Uncovering forgotten histories of African American cemeteries in the United States...

Read More

WHCL FM What's Playing?

Webscraping tool to make a simple HTML list of songs being played ...

Read More

Fanfiction Folksonomy Case Study

Folksonomy development over time by recording freeform tags applied to fanfictions on Archive of Our Own (AO3) for Our Flag Means Death...

Read More

DSpace Translator

Python script translates the contents of a DSpace collection using google's translation API....

Read More

Plotting Discogs Data in Plotly

A starting point for visualizing music data from Discogs....

Read More

Exploring NYC's Dirties Restaurants

Ploting resturant datasets from the Deparment of Health and Mental Hygiene...

Read More

Is the Met becoming more Gender-inclusive?

Explore themes of inclusion and representation in culture, as reflected in the collections and acquisitions of the Metropolitan Museum of Art...

Read More

Library Social Media Use by State

Analysing social media use by public libraries...

Read More

2021 Fall



Contemporary Chinese Art Price Trends

Compiled 40 years of auction data to analyze and visualize...

Read More

College: The Cliff Notes

Exploring a variety of data related to higher education…

Read More

Covid Vaccination Sentiment Analysis using Twitter Feed

Sentiment analysis of covid vaccination data using Twitter's api...

Read More

Nazi-Era Provenance Metadata and Linked Art Models

Examining the metadata available for Nazi-era Provenance paintings within The Metropolitan Museum of Art's Provenance Research Collection...

Read More

The State of Art in WIKIDATA (with a limit)

Attempt at a global survey of art work data in Wikidata…

Read More

What Users Want: Python for User Survey Data Analysis

Analyzing survey results from students and faculty at The New School…

Read More

2021 Spring



Comic Book Sales Trends

Web scraping and analyzing 17 years of comic book data...

Read More

Harvard Art Museums Color Palettes

Web app that displays color palettes from objects at the Harvard Art Museum...

Read More

werdnerd

An etymology search engine...

Read More

Video Game Sales: An Analysis

Exploring video game sales from different regions of the world...

Read More

Correlating Movie Genres and Dates

Investigating movie data to see if there are defining correlations between genres and release dates...

Read More

Understanding the Plants of Central Park

Mapping project that shows the locations and information about all of the plants in Central Park...

Read More

RomCom Thermom

Using sentiment analysis to map romance and comedy screenplays...

Read More

Filming New York

Exploring filming trends around the city using NYC Open Data’s Film Permits dataset...

Read More

ICON_bot: Visual Examples for Iconclass

Paring Iconclass notations with examples drawn from the Rijksmuseum in Amsterdam...

Read More

NYC Landmark Buildings

Analyzing the characteristics of landmarked buildings in New York City...

Read More

Car Here Now History

Analyzing a year and a half of parking history in NYC...

Read More

Orchid Fever in the American Press

Searching digitized historic newspapers for articles on orchid hunting published between 1881 and 1960...

Read More

Human and Python NGrams

A foundation for generating word clouds from JStor Data for Research...

Read More

2020 Fall



Timeline of Roosters at the Brooklyn Museum

A timeline of rooster images at the Brooklyn Museum...

Read More

NYC Reported Crimes Visualization

Explore criminal activity in New York City with criminal complaint reports from 2006 to 2019...

Read More

Songwriter:Re

Generative text project that uses python to remix Lorde lyrics pulled from a random assortment of 20 songs...

Read More

Weather Twitter Bots

Two companion Twitter bots that analyze weather patterns in Ithaca, NY...

Read More

Surreal Art Bot

A Twitter bot that creates new Surrealist works of art that don't exist...

Read More

Comparison of Male and Female Writers

Comparing the number of male and female writers born in the U.S. from 1900 to 2000 and the change of proportions...

Read More

Unforgotten Restaurants

Restaurants which have been temporarily or permanently closed due to the Covid-19 pandemic...

Read More

Sonic Youth Lyric Bot

Twitter bot posting random lyrics from Sonic Youth...

Read More

2020 Spring



MET Painting API Analysis

An analysis of the Metropolitan Museum of Art API service...

Read More

Startup Investments

An analysis of startup investment data...

Read More

Brooklyn Museum Cross Stitch Bot

A Twitter bot that would post cross-stitch patterns of objects in the Brooklyn Museum collection...

Read More

NYC Street Trees

Using python to find several insights regarding street trees throughout the NYC area...

Read More

Pratt listserv Work Opportunities

Analysis of the different types of work opportunities emailed out on the Pratt Institute School of Information listserv...

Read More

Women in Movies

Bechdel Test using the The Movie Database API...

Read More

Just Keep Swimming

Tracking the movements of female Olive Ridley and Hawksbill Sea Turtles...

Read More

Building a Seed Library Catalogue

Web scraping and structuring a website that sells rare and historic seeds...

Read More

Auto-Poems

Generating new poems from the Poetry Foundation corpus...

Read More

Parentage on Mount Olympus

Web scraping to illustrate the overlapping and contradictory parentages of the Greek gods...

Read More

Dungeons & Python

Python programming to create a 5th Edition Dungeons & Dragons character generator...

Read More

21st-Century Collecting Trends

21st-Century Collecting Trends at The Met, Cleveland, and Harvard...

Read More

Oscar Winners of 2011-2020

Mining the data of the Oscar winners from 2011-2020...

Read More

2019 Fall



Planned Construction and Airbnb

Analyzing and geospatially visualizing planned construction in New York City in terms of potential future Airbnb revenue...

Read More

Time Series of Stocks

Using the Yahoo Finance API to compare companies stocks for the last two year ...

Read More

Asian Restaurants Price Map

Mapping Chinese, Korean and Japanese restaurants and their prices in NYC...

Read More

Automated Image Classification and the Metropolitan Museum

Analysis of Google Vision, Amazon Rekognition, and Human generated tags for MET collections...

Read More

Mapping the 2018 Squirrel Census of Central Park

Explore data from the 2018 Squirrel Census of Central Park...

Read More

SAAM API Artists

Explores demographic data from the Smithsonian American Art API to illuminate the work of under represented artists...

Read More

Mapping NYC Music

Common artists listened to in New York, popular venues, streaming numbers and genres...

Read More

Lyrics Sentiment Analysis

Sentiment analysis on the top 40 songs five artists - Drake, Ariana Grande, Mariah Carey, Ne-Yo, and Rihanna...

Read More

Hard to Count Populations & US Census

Exploring correlation between U.S. Census response rate and funding...

Read More

Duterte's Drug War

Examines data provided by the Columbia University School of Journalism's Stabile Center for Investigative Journalism...

Read More

Analysing sneakers sales in StockX

To understand the popularity, price stability and premium value of the Adidas Yeezy and Nike Jordan...

Read More

Pop science media & scholarly discourse

Bridging popular science media and scholarly discourse with linked open data...

Read More

MOMA Analysis

Using Python to parse the MOMA's collection...

Read More

2019 Spring



US Therapists

Analysis and visualization of therapist availability in the United States by state. ...

Read More

Robert Rauschenberg Foundation Archives EAD

ArchivesSpace Repository Finding Aid: EAD XML to CSV ...

Read More

The New Deal in New Orleans

Webscraping FSA and WPA photo collections ...

Read More

311 Live Map

Creating a 'live' map of 311 data for the City of New York ...

Read More

Mapping Pre-Prohibition Alcohol in Brookyln

Mapping distilleries, breweries and residences of brewers and distillers in Brooklyn in 1869, 1903 and 1912 ...

Read More

Object Records at the Fogg Museum

Using the Harvard Fogg Museum API ...

Read More

RhinoScriptSyntax Exploration

Visualizing CSV data in 2D and 3D space ...

Read More

Native Arts at the Denver Art Museum

Scraping Denver Art Museum’s collections website ...

Read More

Human vs. Machine

Analysing human keyword tags to keyword tags provided by the Google Cloud Vision API ...

Read More

Top Songs Sentiment Analysis

Sentiment analysis on the lyrics of the top 20 songs for the last 10 years ...

Read More

Groundwater Contamination in Tennessee

Looking at the effect of coal ash contamination on groundwater in the state of Tennessee ...

Read More

MoMA's Collection: Gender & Artist Collectives

Analyze gender dynamics in MoMA's collections ...

Read More

Visit Like a New Yorker

Visualizing and mapping using Yelp API ...

Read More

2018



NYC Park Monuments

Pulls information from a publically available csv about New York park monuments and combine that information with data harvested from wikidata into a single file ...

Read More

Organizing Image Collections

Explores how Python may be used as a tool to create separate 100,000 photos into sub-collections ...

Read More

IMDB visualization

Use Python to get movie data from IMDB, from 1980s to present. And try to use that data to create a genre by year data visualization ...

Read More

Anxiety of Influence

Mapping and Visualization Project for Translations of Fantastic Literature into Spanish during the 19th-Century ...

Read More

citibike_ebike

Analyzing Changes in Citi Bike Trips after the Introduction of E-Bikes

Analysis on how Citi Bike trips have changed after Citi Bike introduced about 200 e-bikes on August 20th, 2018. ...

Read More

Stock Ticker Capital Appreciation Comparison

Visualize financial information in a more digestible way by non-tech savvy audiences ...

Read More

Exploring Chinese Traditional Medicine

explore the topic of Traditional Chinese Medicine to learn what is the most commonly used herb in all formulas ...

Read More

Some New Classics

Taking a close look at the NYRB Classics to understand the cultural makeup of this collection ...

Read More

Where are the MTA Art Works?

Scrapping and mapping the MTA's subways station artworks ...

Read More

Sentiment Analysis of “#gene_editing” tweets

Sentiment analysis for a twitter hashtag focused on CRISPR gene-edited twin babies ...

Read More

Scraping for Library Jobs

Scraping job postings from three popular professional associations' websites: ...

Read More

Images of the Solar System

This project uses APIs to collect image records of the solar system from cultural heritage and science institutions. ...

Read More

NYC HIV/AIDS Services and Facts Dashboard

This dashboard includes facts and figures related to HIV/AIDS data available through NYC Open Data ...

Read More

Web Scraping of Turner Paintings

Scraping the Tate Collection website for Joseph Mallord William Turner paintings ...

Read More

U.S. Documentaries, 1878 to 2017

Use data from IMDB to compare the growth of documentary versus non-documentary movies ...

Read More

Confronting Documentation of the US War on Terror

Takes existing metadata about US government documents collected by the nongovernmental organization the ACLU and re-presents it in new ways ...

Read More

2017



Hopper at the Theatre

This project investigates the cultural and theatrical landscape of Edward Hopper’s New York using linked open data (LOD) technologies ...

Read More

Time(s) Splitter

With Burroughs as inspiration, I chose to query the New York Times api as my resource much in line with Burroughs’s cut-up material of choice, the newspaper ...

Read More

contamiNation

Mapping data on lead contamination of United States public water sources ...

Read More

Spotlight on New York Vaudeville

Investigation of what elements may be of greatest interest to researchers using NYPL's performing arts database Ensemble ...

Read More

Endangered Plants of Westchester and Fairfield Counties

Creating a list of endangered plants through web scraping ...

Read More

Peshawar Scrapin'

Processing documents from CIA FOIA Electronic Reading Room related to the Peshawar Seven ...

Read More

Hudson River Museum Data

Visualizing the collections of the Hudson River Museum ...

Read More

Documenting ICE

A tool to compile a central collection of records documenting U.S. Immigration and Customs Enforcement: FOIA requests, detention facilities, available government records, and resources ...

Read More

Visualizing the Paston Letters Network

Visualizing the Paston Letters Network displays the network generated between letter writers and recipients from the Paston Letters and Documents collection ...

Read More

Python and the Queer: Utilizing coding to analyze the American LGBTQ community through data and information

Exploration of python coding, in order to analyze what government and media information expresses about different aspects of the LGBTQ community ...

Read More

Failure to Communicate : Taxonomy vs. Folksonomy in Hip-Hop Cataloging

This project examines library cataloging practice versus user-generated metadata for hip-hop and rap music ...

Read More

Exploring Magic the Gathering (and Gathering) Data

Measuring aspects of the Magic the Gathering card game data ...

Read More

2016



Ghosts in The New York Times

For this project, I wanted to explore the coverage of ghosts and haunted houses by The New York Times ...

Read More

New York State Landmarks

For this project I chose to explore datasets related to historic landmarks in New York State ...

Read More

Monday Night Wars

"Monday Night Wars: Data From Wrestling's Golden Age" is a project that uses python programming and data visualization techniques to unpack and era of professional wrestling called "Monday Night Wars." ...

Read More

Mapping Brooklyn Historical Society

Webscraping and mapping the Brooklyn Historical Society's digital images collection ...

Read More

Mollusca

Exploring The Biodiversity Heritage Library holdings on Mollusca ...

Read More

Mapping Alan Lomax

Using the metadata created by the Association for Cultural Equity, this project aims to create a world map, where Lomax’s career is visible for users to trace ...

Read More

Visualizing The Rijksmuseum

This project utilizes the Rijksmuseum API and the Getty Research Institute’s Union List of Artist Names Vocabulary (ULAN) to take a closer look at the museum’s collection ...

Read More

Boston MFA Tsuba Collection

Scraping and analysis of the online tsuba collection of the Boston Museum of Fine Arts ...

Read More

Wikidata Exploration

Exploring Wikidata Properties With the Linked Jazz Name Directory ...

Read More

Photography Of James Van Der Zee

A timeline of some of James Van Der Zee’s photographs ...

Read More

Repurposing Archival Metadata

Repurposing Archival Metadata with the Python CSV Writer ...

Read More

Lyric Analysis

Using web scraping to extract song lyrics from Azlyrics.com for analysis ...

Read More

2015



DBO:INFLUENCE

Investigation focused on particularly subjective properties in the DBpedia ontology: influences ...

Read More

Linnaeus Tripe’s Photography

Analysis of V&A’s collection of Linnaeus Tripe’s photography of South India and Burma (Myanmar) ...

Read More

Linked Jazz Meets Carnegie Hall

Connecting Carnegie Hall's performance data with Linked Jazz ...

Read More

Metropolitan Hall of Fame

Connect images, often artistic renderings of baseball’s earliest stars, with the official player statistics for the 310 people in the Hall of Fame ...

Read More

Land of the Free Music Archive

A network of artists found in The Free Music Archive connected via sonic properties ...

Read More

Python Radio

Exploration and analysis of online data about Old Time Radio (between 1920’s-early 1960’s) ...

Read More

Tack Spitter Circus

Analysis of posters from The Golden Age of American Circus ...

Read More

Linked Open Data at the Whitney

Mapping Artists from the Permanent Collection (1931-1948) ...

Read More

Artists’ Books Holdings

An attempt to analyze and visualize data about artists’ books holdings on an international scale ...

Read More

Persian Manuscripts in Cultural Institutions

Analysis of Persian Manuscripts holdings in two museums ...

Read More

2014



Analyzing Modernism

Modern and Contemporary Painting and Sculpture at The Metropolitan Museum of Art is a data analysis and visualization capstone project ...

Read More

Educators Twitter Network

Visualization of 500 self-identified teachers on Twitter ...

Read More

Tumblr Image Bot: a friendly social-media robot

Randomly-selected photo posts to Tumblr along with appropriate caption text and tags ...

Read More

JSON-LD for Cemetery Data

This project’s naissance was in the bottom shelves of a local history & genealogy department of a middling public library in Maine ...

Read More

NYT Timeline

This project is a timeline from 1915 to the present showing headlines with the words “birth control” from the New York Times. ...

Read More

Scraping & Analyzing Photography Auctions

I want to investigate the circumstances that have led to the establishment of photography as an "accepted high art form" and that conception's subsequent influence on the sales of photography at auction ...

Read More

Quipu: Chinese Exclusion Act

An attempt to build a dictionary of uniform metadata for primary sources on a single subject. It queries resources in digital archives and special collection to retrieve metadata ...

Read More

Spanish Artists Dictionary Project

The Spanish Artists Dictionary (SAD) is a reference source created by scholars at the Frick Art Reference Library. Originally a print publication, the dictionary was formatted as a Filemaker database in the early 1990s and made available through the Frick’s online research portal ...

Read More

Wiki Scrape

For my final project I data scraped the Harry Potter Wiki site and transpose that information onto a timeline, using Knightlab’s TimelineJS program ...

Read More