Category Archives: Spring 2014

Beyond Citation: Building digital tools to explain digital tools

Over the last couple weeks, the Beyond Citation team has transformed into a web production team of sorts, focused on making key decisions about platform, site architecture, user interaction, design, and communication.

Beyond Citation—a project to build a website that aggregates accessible, structured information about scholarly databases—has the potential to enhance how scholars approach, use, and interpret resources from some of today’s most widely used digital collections. While it would be straightforward for our team to simply gather and publish information about those resources, our challenge is to build a digital tool that supports meaningful interaction with that information, one that can also scale in the future and cater to a community of contributors.

In the project’s nascent stages, the tactical concerns before us are familiar—we’re taking on the common challenge of building and launching a website or web app. Thrust into the very practical realm of software, decisions, and constraints, discussions of critical theory get put off to discuss the merits of WordPress and Drupal. These powerful tools place the project in a digital ecosystem much wider than academia. The platform we have chosen—WordPress—pushes us deeper still into the wide worlds of relational databases, server-side scripting, and content management—the digital tools that will allow us to explain other digital tools.

As we construct the basic building blocks for the site, we find that the best way to focus our approach is by seeking the advice of experts, reading blogs about WordPress customization, and learning more about MySQL and WordPress taxonomies. The robust open source community behind WordPress has enabled us to confirm that the technical requirements for the Beyond Citation website can be met many times over through combinations of WordPress plugins.

Something to consider while building this tool with WordPress, is that we are seeking to publish data about proprietary tools by using open source technology. Perhaps this isn’t really so unusual—we see this in a similar vein as increasingly popular APIs that allow for easier data aggregation or configuration from multiple sources. And toolsets that are hybrids of proprietary and open source systems are extremely common.

But there’s an important depth to explore when thinking about Beyond Citation as a bridge between proprietary and open source systems. The idea of “exposed” information, built on “hidden” information, represents a direction that the project can try to push technically. For instance, if in a future iteration the team can uncover information about scholarly databases that’s not just hard to find, but not openly available (such as how search algorithms work, or the criteria behind publisher contracts), then I think the value of Beyond Citation increases in a direction most closely aligned with its original ambition. This would also allow the project to explore the similarities and differences in how scholarly databases work in more meaningful ways.

Before we can do that, everyone on the team is doing their part to fill in knowledge gaps, and discovering “how technology works” on multiple levels. Just as we are researching the types of information about scholarly databases that we want the project to highlight, we are also researching the types of data-driven web frameworks that could easily support such information. Like many Digital Humanities projects, Beyond Citation is about knowledge acquisition and aggregation for both developers and researchers. We are challenging ourselves to learn as much as we can about one set of digital tools before we can communicate new information about other sets of digital tools—both of which are moving targets, evolving in their own realms of authorship.

As we work towards a May launch date for an early version of the site, we realize that the authors of digital projects need a constant appetite for more knowledge—technical knowledge and subject-matter knowledge—in order to create and maintain an authoritative tool.

Follow us on Twitter as we get ready for May: @beyondcitation

It’s a Two-Fer!

Travelogue group members
Sarah – Project Manager
Amy – Technology and Design
Melanie – Outreach and Communication
Evonne – Research
Adam – Technology and Design

Last week, due to illness, the Travelogue’s outreach and communication person was ironically silenced.  However, that means this week there is twice as much Travelogue team blog fun to catch up on!

Travelogue’s Twitter page has a great new logo courtesy of Adam.  Initially, we had encountered an issue with the size of the first Travelogue logo not looking great sized down for Twitter.  Adam also created the Travelogue logo that appears on the Travelogue’s Common’s page.  Throughout the design process, Adam shared drafts for input from the group.  Amy has been hard at work on the design and content of the Travelogue’s Common’s page.

Last Monday on March 3rd the team, sans one under the weather outreach and communication member, presented an update on the project status to the DHPraxis class.  In preparation, Sarah created an action plan outlining how each team member could explain the progression the team has made so far.

Sarah met with our DH Praxis professor Matt Gold to go over the scope of the project and get his input on the current ideas the team has.  Sarah is working on the Travelogue website’s wireframe and created a mock up of the layout.  Also, she is continuously working on the project plan.  The team has been actively communicating, to organize the communication and each team member’s responsibilities, Sarah established an Asana page for the team.

Evonne has been compiling research resources, organizing the research conducted, what needs to be further researched and maintaining citations in a Travelogue Zotero page.  Using Evonne’s extensive research as a guide and the Gale database Directory of Special Libraries and Information Centers, Melanie has been reaching out to multiple academic institutions.  The preliminary goal is to introduce the Travelogue project, request info on the usage of content (for example from the Library of Congress) and building relations from there.  Through the Travelogue Twitter account Melanie has followed organizations working on mapping projects  and will be actively working creating engaging content in the pursuit of followers.

The team has been exploring ArcGIS Story Maps as the mapping tool for the project.  A schedule of meetings outside of class is being established as to best collaboratively brainstorm face to face.  The team is looking into whether Travelogue will be paralleling the travel narratives of the chosen authors (Ernest Hemingway and Zora Neale Hurston), literally displaying the travel trajectories of both on the same map?  Or, will each author’s journey be depicted on a separate map?  The website’s URL is also currently being decided upon.

If you want to contact us please do. Our project blog is at  travelogue.commons.gc.cuny.edu. Email us at dhtravelogue [at] gmail [dot] com or follow us on Twitter @DhTravelogue

New Friend, New Platform for DH Box

Cross-posted from: https://dhbox.commons.gc.cuny.edu/blog/2014/dh-box-new-friend-new-platform


This week the DH Box team reconsidered their choice of platform, with the help of Dennis Tenen, a professor at Columbia University in the Digital Humanities and New Media Studies program (and former developer with Microsoft).

A couple weeks ago we were surprised and delighted to find that another team had come up with the idea for a portable tool that could help users quickly get going with DH applications. And this week we found that Professor Tenen and colleagues had also discussed how to tackle such a project and had come up with yet a different solution! In discussing that solution, we found it matched our aim of providing an ease of quickly setting up an environment for new users and made us change our focus for both implementation and outreach.

Read more

Beyond Citation: Wireframes A Visual Tool

Web designers should aim to create a satisfactory and enjoyable user experience.  As I think about scholars and librarians, the individuals who are most likely to visit the Beyond Citation website, I wonder how its design will aid in the discovery of new information. Because wireframes assist the placement of rectangles on grids and the appropriate use of negative space (any space which is not in use by an item) as an integral part of the design, I imagine if implemented well, the user will be visually attracted to the website. As the website’s designer, I believe the best way to alleviate concerns around layout is to use wireframes, which are meant to support the purpose and main idea of the imagined website. Although the wireframe appears simplistic because it is often completed in black or white, once executed through scripts in HTML5 and CSS, the wireframe becomes the underlying structure that will ultimately point users to the discovery of information.

The placement of content and the function of fields should each complement the user’s experience, and promote ease of use.  Wireframes are building blocks that can aid in developing the personality of the website by emphasizing type size while minimizing the use of words and utilizing rectangles to describe content placement. The wireframe’s adaptive nature aids in responsive design, and may consider varied grid widths to accommodate computer screens, tablets and mobile phones. The website’s navigational roadmap is conceived through the developed wireframe, and is assisted by design that makes its primary statement within the confines of the wireframe.  Wireframes visually describe the construction of web pages.

Wireframes can be created on tablets and apps which aid the ability to revise and share development as a collaborative tool. Wireframes also alleviate worry as they create a complementary relationship between the idea, the design and pixels, and when completed, usher in the next stage of the website’s development, which is scripting.  Wireframes are the blueprint that will be utilized to create the Beyond Citation website.

Opening DH Box

This is it! The inaugural post of the DH Box blog (the DH stands for Digital Humanities). Here we intend to make the process of planning, creating, and publicizing the DH Box transparent for our readers. Hopefully this provides some inspiration, and even a blueprint, for future collaborative DH projects.

But let’s not get ahead of ourselves! First, some questions and answers:

What is DH Box?

Not much, so far. But we intend it to be a portable, customized environment for Digital Humanities learners that can rely on incredibly inexpensive technology. All you really need is a computer (and a monitor and keyboard, of course!) — but the platform that excites us most is the Raspberry Pi, a tiny computer that sells for just $35. Imagine a collection of DH tools, pre-installed and configured, and a set of texts for users to interrogate — all on a portable and inexpensive device.

What inspired the idea of DH Box?

Several ongoing humanities projects have begun to take advantage of the continuing miniaturization of computing technology. One in particular excited my imagination: Library Box, which repurposes a wireless router into a “portable digital file distribution tool…that enables delivery of educational, healthcare, and other vital information to individuals off the grid.” The possibilities for ’embedded’, specialized miniature computers are massive.

What is needed to run DH Box?

Our first major goal is to get DH Box running on the Raspberry Pi. Once that’s done, DH Box will also be runnable on nearly any Linux computer! We are also targeting OS X.

Who do you think will use DH Box?

Anyone and everyone who is interested in learning Digital Humanities inquiry techniques, but especially those who may not have any prior programming experience. We hope that instructors will use our tools to set up almost instant DH labs, and that students will use DH Box to get an edge in their research.

We see DH Box as an example of what is likely to be a robust and interesting future field, ‘humanities hardware’.

Who are we?

We are an interdisciplinary team of learners and do-ers, librarians and developers and digital humanists and more — with an interest in making DH work more accessible. Find us:

dhbox.org
@DH_Box
hello@dhbox.org

More to come as we continue to develop DH Box!

Refining our focus and finding connections

The DH Box team has been working hard on defining the scope for DH Box and setting up our project plan. We’ve started using Asana as our project management tool. As the project manager, I’m really enjoying Asana. It’s flexible, easy, and it allows our team to collaborate on building the plan as we go. It’s also very nice that it tracks everything and sends out plenty of reminders!

Our scope has been narrowing down as we refine our concept of DH Box. We are thinking more about who will use DH Box and thinking about the best way to make it a valuable toolkit for introductory students in digital humanities classes.

Pedagogy is a key part of the digital humanities at the CUNY Graduate Center and the Praxis Network. Our focus for the first phase of development will be text analysis and topic modeling including key tools such as MalletNatural Language Toolkit (NLTK), and the Stanford Named Entity Recognizer. We are going to build an interactive textbook using IPython Notebook. The textbook will be bundled with the DH Box install scripts and it will help orient students with the tools through interactive code execution. We have also thought more about our platform and what would be most useful for our users. We are going to make DH Box available for download not only for Raspberry Pi but also for Linux, Mac, and hopefully Windows.

As we have narrowed down our scope, we are also discovering a much wider range of connections to the DH community. Our professor, Matt Gold, has put us in touch with his colleague Dennis TenenGC Digital Fellow  Micki Kaufman suggested we check out Ian Milligan’s work and we’ve found amazing stuff in Big Digital History: Exploring Big Data through a Historian’s Macroscope, a co-written manuscript by Shawn Graham, Ian Milligan, and Scott Weingart. My library colleague Roxanne Shirazi, who edits the dh+lib blog, suggested we check out an idea for a project called DH creator stick which George Williams proposed at THATCamp Piedmont 2012 (see also a blog post by Mark Sample).

We’re amazed by the range of rich ideas we are beginning to discover. We hope to reach out to the DH community and ask for advice and feedback as DH Box takes shape.

Beyond Citation: Understanding Databases

Every year, more and more research is done by scholars online via academic databases. Print journals, scholarly monographs, newspapers, periodical indexes, and even ephemera and image collections are steadily transitioning from print to electronic.

Historically, research using print collections took place in library reading rooms with material owned by the library. Increasingly, research using electronic collections takes place outside of the library using proprietary digital platforms subscribed to by libraries. This change greatly affects how libraries function — an ownership model morphs into an access model — and how research is done. Database searches are crucial to uncovering information, but little is known about how these searches work. Additionally, it’s not always easy to find what full text content is covered in these database titles.

The goal of Beyond Citation is to help the researcher to better understand how academic databases work, and provide easier access to the database’s holdings information. For the CUNY Digital Praxis Seminar, the Beyond Citation team needed to determine which databases to feature in its initial launch, and what information to gather about each title.

First, we wanted to feature humanities databases and steer away from STEM titles. (Science, Technology, Engineering, and Mathematics.) Second, we ideally wanted to cover titles that were available at the CUNY Graduate Center’s Mina Rees Library, and we wanted representation from the big three “e” vendors: EBSCO, Gale, and ProQuest. Additionally, we wanted to cover different kinds of content, including historical newspapers, scholarly journals, and historical e-books from both non-profit and for-profit companies.

After much discussion, the Beyond Citation team has decided to focus on the following databases and collections for its initial launch.

Google Books

HathiTrust

ArtStor

ProQuest Historical Newspapers

19th Century U.S. Newspapers (Gale)

Early English Books Online (EEBO) with TCP (Text Creation Partnership) (ProQuest)

Gale Artemis: Primary Sources – Nineteenth Century Collections Online (NCCO) and Eighteenth Century Collections Online (ECCO).

JSTOR

Project Muse (Johns Hopkins University Press)

Artemis Literature Resources (Gale)

EBSCO Humanities Source

We are open to and eager for feedback from users of these titles, or from any other researchers and librarians who use databases in their research. More to come in future posts on what information we hope to gather from each title, and how that information will be displayed. You can reach us at BeyondCitation [at] gmail.com

Travelogue team journal post #2

Travelogue group members
Sarah – Project Manager
Amy – Technology and Design
Melanie – Outreach and Communication
Evonne – Research
Adam – Technology and Design

Monday, February 24th

Since the last class meeting, the Travelogue team has decided to focus on two American authors, Zora Neale Hurston and Ernest Hemingway.

Amy has created the Travelogue Commons site, which includes photos of the two chosen authors, the Travelogue logo, Twitter button, contact form (including a Travelogue gmail account) and a bio page featuring photos of the Travelogue team members.  Each team member has been working on a short bio and those will be posted soon.  The Travelogue email includes a signature with the team’s Twitter handle.

Evonne has created a research plan for the project and added it to the Travelogue Google Drive folder.  She also created a Zotero folder for the project, as to track resources and references. Evonne will cross post the resources and references in the Google Drive folders for each author.

Adam has updated the Travelogue logo that can now be seen on the Commons site and soon on the Twitter page.  He has continued to research Omeka+Neatline.  Adam is exploring HTML, CSS and other resources that will be helpful once a mapping platform has been chosen for the project.

Sarah has organized a consultation meeting for the team with Steven Romalewski.  The goal is to decide on a mapping platform that fits the Travelogue project scope.  Sarah has also provided the team with a list of “action items” and organized a schedule of weekly check-ins for the team.

In thinking about Travelogue as a pedagogical tool, but also an accessible resource for those outside of an academic environment, I have been exploring how to identify who the target audience is.  I have been using the Journal of Digital Humanities as a resource to research best publicity practices for a DH project.  I have continued to document the Travelogue team’s progress in journal posts and updated the team’s Twitter.

-Melanie

If you want to contact us please do. Our project blog is at  travelogue.commons.gc.cuny.edu. Email us at dhtravelogue [at] gmail [dot] com or follow us on Twitter @DhTravelogue

Travelogue team journal post #1

Travelogue group members
Sarah  – Project Manager
Amy  – Technology and Design
Melanie  – Outreach and Communication
Evonne  – Research
Adam  – Technology and Design

The Travelogue project will disrupt and broaden the expatriate narrative, while at the same time compiling American literary travel narratives and timelines with web mapping.  Mapping these journeys for display on an interactive website will provide both a visual and theoretical representation of modern literary movements in America, enabling the humanities community to gain a broader understanding of the history and underlying structure of these works.  It will also act as a pedagogical tool, allowing students to see narratives and literary movements represented through interactive, visual means, and as a general source of information for a wider public audience.

Thursday, February 20th

The team has been off to a successful start, communicating consistently through the Travelogue CUNY Commons group page that Amy created.  As a group, we have been discussing what the scope of the project is and what we would like it to look like.

Sarah created a Google folder for the project.  The folder features the project plan Excel spreadsheet and sheets for info on each of the four authors Travelogue will feature.  Sarah has been providing an outline for the project scope, noting details of the author’s “life journey” that Travelogue should be highlighting.

We have been exploring a diverse list of American authors that have traveled substantially and or lived abroad.  This week we plan on solidifying the list of four authors.  Zora Neale Hurston http://chdr.cah.ucf.edu/hurstonarchive/ and Ernest Hemingway http://www.jfklibrary.org/Research/The-Ernest-Hemingway-Collection.aspx will most likely be featured.  Evonne has been researching the authors, narrowing down the list to authors that fit the Travelogue criteria, and have the greatest volume of digital content available.  She has created a Google doc with the data collected.

Amy and I have been researching tutorials and guides for the possible platforms.  We have been sharing the info and links on the group’s Commons page.  Amy and I have also researched possible authors to feature, focusing on female authors.  I created a Twitter account for Travelogue and shared the account info with the group.  During the next collaborative class session, I will inquire as to what the best practices are for sharing project progression details publicly through social media.

Possible platforms the group has discussed:

– CartoDB
– Mapbox
– Google Maps + Google Fusion Tables
– Omeka + Neatline

Adam sketched a logo for Travelogue.  We all agreed it was great.  He has scanned it and has been actively sharing drafts of the logo with the group as he works on the design.  Adam has also been researching Neatline+Omeka, along with other platforms and tutorials.  The group is looking forward to consulting with Steven Romalewski on which platform would be best and most feasible within the scope of the project.  The front runner, platform-wise, has been Omeka+Neatline.  Sarah has also been researching CartoDB, its functionalities and  the cost involved in the usage of CartoDB.

If you want to contact us please do. Our project blog is at  travelogue.commons.gc.cuny.edu. Email us at dhtravelogue [at] gmail [dot] com or follow us on Twitter @DhTravelogue

DH Box: Tackling Project Scope

We have this great Digital Humanities project idea, but what happens between now and launch time?

With an idea like DH Box (a customized linux OS with preinstalled DH Tools and the flexibility to operate on a computer as cheap and portable as the Raspberry Pi) there are a number of directions we could take, and will certainly consider for further iterations of DH Box beyond the Spring term (this blog currently documents the experiences of a project team enrolled in a graduate course in Digital Humanities Praxis at the Graduate Center, CUNY).

In order to refine the scope of our tool, we asked ourselves some questions:

  • What approach will we take around educating users about coding, the infrastructure around the DH Box software, hardware, and operating system?
  • Which DH Tools should we include? See Alan Liu’s curated list for more info on the scope of DH tools out there
  • What user(s) are we building this for?

The success of our project hinges on our ability to carefully model the scope of the tool by shaping the answers to these questions . . . all by May 12th (public launch date)!

Educational Value

Beyond providing a collection of accessible DH Tools, we want DH Box to help bridge knowledge gaps by delivering a strong educational component. We’d like for instance, undergraduate English students to gain exposure and develop proficiency in Digital Humanities inquiry through the kind of guidance and practical experience DH Box will offer. To that end, we will begin an interactive textbook to provide instruction about the specific tools included in this first iteration of DH Box. We are most inspired by the Learn Code the Hard Way interactive textbook series by Zed Shaw.

Tools

We are gearing this version of DH Box to bring Topic Modeling and Text Analysis to Humanities students!

We began by considering the most popular DH Tools out there and quickly realized it made a lot of sense to whittle the list down for this current project phase. We’ve made choices based on optimal software performance with the Raspberry Pi. We also want to provide DH Tools that haven’t yet had the level of proliferation like some of the more popular content management systems such as WordPress.

Users

Undergraduate Humanities students currently have little familiarity with terms like tokenizationsentiment analysis, etc., and how these components of text analysis can open expansive modes of textual inquiry. As part of its mission, DH Box will work to make these methods accessible to a broad audience!

Stay tuned for exciting updates on implementing the install scripts, using IPython Notebook, and more!

 

Questions? Comments? Tweet us!