Welcome to our community.

In this community, you can submit ideas, vote on existing ideas, or add comments.

To submit an idea, please click the Submit New Idea button at the top of the navigation sidebar. You will then be asked to add a title and choose a campaign for the new idea. You will also have the option to add tags to the idea. To vote on an idea, simply click the up or down arrows to the right of the idea title/description. And to add a comment, click in the box below the idea.

If you would like to see all ideas created with a specific tag, you can click on the word or phrase via the tagcloud in the navigation sidebar area under "What we're discussing". You can also view ideas sorted by Campaigns from the right navigation area. To return to this page, click the All Ideas link.

Campaign: 2014 Digital Stewardship Idea Challenge

File fix-it daemon

A lightweight common daemon that can run on any kind of OS, and be set up with a variety of parameters, with the goal of paging through files in a designated location (like a file system) and doing designated "fix-it" tasks, and then reporting back to humans on what it did.

Submitted by (@jman00)

Category : Applied Research for Cost Modeling and Audit Modeling

Voting

2 votes
Active

Campaign: 2014 Digital Stewardship Idea Challenge

Audio Content Analysis for Data Mining

How can we use audio content analysis algorithms to augment speech-to-text analysis and entity extraction when generating metadata for audio at scale? Pitch detection and spectrogram analysis could improve speech-to-text by providing speaker differentiation and identifying and filtering background noises during the transcription process. This research should result in identifying algorithms that are both accurate and ...more »

Submitted by

Category : Preservation at Scale

Voting

2 votes
Active

Campaign: 2014 Digital Stewardship Idea Challenge

Text and Audio Equavliance

Use a mixture of text to speech and speech to text algorithms against a test corpus of audio recordings, digitized transcripts of recordings, and digital transcripts of the recordings to develop a means to algorithmicly identify relationships between audio, image and text materials. Matching up these different formats of files could then serve as a basis to map networks of relation between items and serve in the purpose ...more »

Submitted by (@trow00)

Category : Understanding Information Equivalence and Significance

Voting

1 vote
Active

Campaign: 2014 Digital Stewardship Idea Challenge

"The Brooklyn Corpus" Testbed for Digital Preservation Research

The Canterbury corpus (http://en.wikipedia.org/wiki/Canterbury_corpus) was created as a testbed of files for use as a benchmark in testing compression algorithms. This DSI Challenge Idea is for the creation of "The Brooklyn Corpus," a similar testbed of both obsolete and contemporary file types that can be used for a variety of purposes in digital preservation research. Working with a defined body of files encompassing ...more »

Submitted by

Category : The Evidence Base for Digital Preservation

Voting

1 vote
Active

Campaign: 2014 Digital Stewardship Idea Challenge

Technical Knowledge Transfer (TKT)

#Problem Scale is a key aspect of digital preservation in the number of objects, size of objects, and timeframe for action. And yet, most education deals with the same scenario, one object passed into (Droid, JHOVE, etc) and the output analyzed. At the same time, the community has created tools and projects working on a larger scale. However, the expertise to implement these tools remains locked in a range of inaccessible ...more »

Submitted by

Category : Preservation at Scale

Voting

1 vote
Active

Campaign: 2014 Digital Stewardship Idea Challenge

Using XML and XSLT as a repository for digital collections

This idea was originally designed as a business continuity solution, but it can best apply to a trusted framework approach within digital collections. At its very core, digital collections are really just a digital item (PDF, JPEG, etc.), metadata (in XML), and a search. If you were to break the digital collection down into these core parts then the possibility of independence away from single repositories or storage ...more »

Submitted by

Category : Policy Research on Trust Frameworks

Voting

0 votes
Active