KEMBAR78
Workpackage 1 Presentation at DM2E Project Meeting 3, London | PPTX
co-funded by the European Union
WP1 - Content
DM2E All WP Meeting
London, 11 June 2013
Doron Goldfarb, Austrian National Library (ONB)
Outline
1. WP1 Status
2. Specification for annotatable content
3. MINT Questionnaire
DM2E All WP Meeting: Work Package 111.06.2013 2
WP1 Status – Administrative issues
DM2E All WP Meeting: Work Package 111.06.2013
 Doron Goldfarb took over WP1 lead from Ewelina
Rockenbauer with 1st of March 2013
 New member at DM2E since 1st of May 2013: Kristin Dill
 Doron will provide technical lead for WP1 metadata ingestion and
content provision process
 Kristin will deal with the Digital Humanities aspects of WP1 –
Task 1.4: “Setup a test scenario for the prototype platform”
and also take over ONB’s role in WP3
Task 3.4: “Background research on scholarly principles”
3
WP1 Status – New associated partners
DM2E All WP Meeting: Work Package 111.06.2013
 National Library of the Netherlands (KB)
(Contact: Steven Claeyssens)
 Collection of digitized illuminations (= images) from
medieval manuscripts – Data already in Europeana!
 Metadata about 400 manuscripts and 11.141
illuminations & the 11.141 illuminations themselves
for annotation
4
WP1 Status – New associated partners
DM2E All WP Meeting: Work Package 111.06.2013
 Georg Eckert Institute for Textbook Research (GEI)
(Contact: Esther Chen)
 Collection of currently ~2.800 textbooks
encompassing ~500.000 pages
Data (parts of ?) already in Europeana
 Will provide full text for annotation
5
WP1 Status – New associated partners
DM2E All WP Meeting: Work Package 111.06.2013
• KB
– Content & Metadata Questionnaire returned
• GEI
– Content & Metadata Questionnaire returned
• BAS
– Content & Metadata Questionnaire sent, awaiting response
• BRANDEIS
– Content & Metadata Questionnaire sent, awaiting response
6
WP1 Status - SBB
DM2E All WP Meeting: Work Package 111.06.2013
 Maintained XSLT-Script for BBAW-TEI
 Wrote a script to query OAI-PMH-interfaces
(especially for BBAW)
 Set up D2R-Server for SBB-Kalliope
7
WP1 Status - MPIWG
DM2E All WP Meeting: Work Package 111.06.2013
 Contextualizing data „in-house“
 Would like to finish this step bevor performing mass
ingestion to DM2E
 Participated in creation of MINT questionnaire
8
WP1 Status - UIB
DM2E All WP Meeting: Work Package 111.06.2013
 Technical mapping of metadata
 Modified Kilian Schmidtner’s XSLT-Skript to match UIB’s
current version of transcriptions
 Ready to produce RDF
9
WP1 Status - EAJC
DM2E All WP Meeting: Work Package 111.06.2013
 EAJC and NLI maintain contact with EJ partners and
supported them with completing the content & metadata
questionnaires
 Expand of VIAF ID from authority records to bibliographic
records for NLI data
 Consider APEX EAD -> EDM conversion tools for JDC EAD
data
 Will have Skype conference with BRANDEIS this evening
regarding support for content & metadata questionnaires
10
WP1 Status - UBFFM
DM2E All WP Meeting: Work Package 111.06.2013
 Digitized 150 additional medieval manuscripts of about
55.000 pages – currently about 650 medieval
manuscripts online
 Performed XSLT-based mapping from METS/MODS to the
new DM2E-model
 Mapping on item- and page-level, including authority
data – will be presented tomorrow during breakout
sessions.
11
WP1 Status - ONB
DM2E All WP Meeting: Work Package 111.06.2013
 Metadata for Codices Manuscripts mapped to the latest
DM2E model using MINT and handcrafted XSLT, some
properties missing
 ABO – Google project
 Open questions how to deal with updated content and
how to prevent mass downloads
12
Outline
1. WP1 Status
2. Specification for annotatable content
3. MINT Questionnaire
DM2E All WP Meeting: Work Package 111.06.2013 13
WP1 Specification for annotatable content
DM2E All WP Meeting: Work Package 111.06.2013
• WP1, WP2 and WP3 collaborate on specification for
annotatable content for Pundit
• Original specification requires content to be provided in
form of HTML
• Additional option for content providers who can only
provide „raw“ content such as images of digitized pages
or plain text OCR/transcriptions of single pages.
14
WP1 Specification for annotatable content
DM2E All WP Meeting: Work Package 111.06.2013
• Three options for content providers:
1) Provide content according to initial WP3 specification
2) Provide raw content on page level (Digitized images, plain text)
3) Provide link to digital content on item level
Use Pundit bookmarklet tool for annotation - Fallback
solution, annotation functionality is not guaranteed !!!
15
WP1 Specification for annotatable content
DM2E All WP Meeting: Work Package 111.06.2013 16
WP1 Specification for annotatable content
DM2E All WP Meeting: Work Package 111.06.2013 17
WP1 Specification for annotatable content
DM2E All WP Meeting: Work Package 111.06.2013 18
Outline
1. WP1 Status
2. Specification for annotatable content
3. MINT Questionnaire
DM2E All WP Meeting: Work Package 111.06.2013 19
WP1 MINT Questionnaire
DM2E All WP Meeting: Work Package 111.06.2013 20
• „Kickoff“ for WP1 task 1.3:
„Testing the user interface for creating mapping,
interlinking heuristics and for configuring the workflow“
• Sent out on May 28th 2013
• ONB and MPIWG have performed initial tests
• Will discuss preliminary results tomorrow during
breakout sessions
co-funded by the European Union
Thank You !

Workpackage 1 Presentation at DM2E Project Meeting 3, London

  • 1.
    co-funded by theEuropean Union WP1 - Content DM2E All WP Meeting London, 11 June 2013 Doron Goldfarb, Austrian National Library (ONB)
  • 2.
    Outline 1. WP1 Status 2.Specification for annotatable content 3. MINT Questionnaire DM2E All WP Meeting: Work Package 111.06.2013 2
  • 3.
    WP1 Status –Administrative issues DM2E All WP Meeting: Work Package 111.06.2013  Doron Goldfarb took over WP1 lead from Ewelina Rockenbauer with 1st of March 2013  New member at DM2E since 1st of May 2013: Kristin Dill  Doron will provide technical lead for WP1 metadata ingestion and content provision process  Kristin will deal with the Digital Humanities aspects of WP1 – Task 1.4: “Setup a test scenario for the prototype platform” and also take over ONB’s role in WP3 Task 3.4: “Background research on scholarly principles” 3
  • 4.
    WP1 Status –New associated partners DM2E All WP Meeting: Work Package 111.06.2013  National Library of the Netherlands (KB) (Contact: Steven Claeyssens)  Collection of digitized illuminations (= images) from medieval manuscripts – Data already in Europeana!  Metadata about 400 manuscripts and 11.141 illuminations & the 11.141 illuminations themselves for annotation 4
  • 5.
    WP1 Status –New associated partners DM2E All WP Meeting: Work Package 111.06.2013  Georg Eckert Institute for Textbook Research (GEI) (Contact: Esther Chen)  Collection of currently ~2.800 textbooks encompassing ~500.000 pages Data (parts of ?) already in Europeana  Will provide full text for annotation 5
  • 6.
    WP1 Status –New associated partners DM2E All WP Meeting: Work Package 111.06.2013 • KB – Content & Metadata Questionnaire returned • GEI – Content & Metadata Questionnaire returned • BAS – Content & Metadata Questionnaire sent, awaiting response • BRANDEIS – Content & Metadata Questionnaire sent, awaiting response 6
  • 7.
    WP1 Status -SBB DM2E All WP Meeting: Work Package 111.06.2013  Maintained XSLT-Script for BBAW-TEI  Wrote a script to query OAI-PMH-interfaces (especially for BBAW)  Set up D2R-Server for SBB-Kalliope 7
  • 8.
    WP1 Status -MPIWG DM2E All WP Meeting: Work Package 111.06.2013  Contextualizing data „in-house“  Would like to finish this step bevor performing mass ingestion to DM2E  Participated in creation of MINT questionnaire 8
  • 9.
    WP1 Status -UIB DM2E All WP Meeting: Work Package 111.06.2013  Technical mapping of metadata  Modified Kilian Schmidtner’s XSLT-Skript to match UIB’s current version of transcriptions  Ready to produce RDF 9
  • 10.
    WP1 Status -EAJC DM2E All WP Meeting: Work Package 111.06.2013  EAJC and NLI maintain contact with EJ partners and supported them with completing the content & metadata questionnaires  Expand of VIAF ID from authority records to bibliographic records for NLI data  Consider APEX EAD -> EDM conversion tools for JDC EAD data  Will have Skype conference with BRANDEIS this evening regarding support for content & metadata questionnaires 10
  • 11.
    WP1 Status -UBFFM DM2E All WP Meeting: Work Package 111.06.2013  Digitized 150 additional medieval manuscripts of about 55.000 pages – currently about 650 medieval manuscripts online  Performed XSLT-based mapping from METS/MODS to the new DM2E-model  Mapping on item- and page-level, including authority data – will be presented tomorrow during breakout sessions. 11
  • 12.
    WP1 Status -ONB DM2E All WP Meeting: Work Package 111.06.2013  Metadata for Codices Manuscripts mapped to the latest DM2E model using MINT and handcrafted XSLT, some properties missing  ABO – Google project  Open questions how to deal with updated content and how to prevent mass downloads 12
  • 13.
    Outline 1. WP1 Status 2.Specification for annotatable content 3. MINT Questionnaire DM2E All WP Meeting: Work Package 111.06.2013 13
  • 14.
    WP1 Specification forannotatable content DM2E All WP Meeting: Work Package 111.06.2013 • WP1, WP2 and WP3 collaborate on specification for annotatable content for Pundit • Original specification requires content to be provided in form of HTML • Additional option for content providers who can only provide „raw“ content such as images of digitized pages or plain text OCR/transcriptions of single pages. 14
  • 15.
    WP1 Specification forannotatable content DM2E All WP Meeting: Work Package 111.06.2013 • Three options for content providers: 1) Provide content according to initial WP3 specification 2) Provide raw content on page level (Digitized images, plain text) 3) Provide link to digital content on item level Use Pundit bookmarklet tool for annotation - Fallback solution, annotation functionality is not guaranteed !!! 15
  • 16.
    WP1 Specification forannotatable content DM2E All WP Meeting: Work Package 111.06.2013 16
  • 17.
    WP1 Specification forannotatable content DM2E All WP Meeting: Work Package 111.06.2013 17
  • 18.
    WP1 Specification forannotatable content DM2E All WP Meeting: Work Package 111.06.2013 18
  • 19.
    Outline 1. WP1 Status 2.Specification for annotatable content 3. MINT Questionnaire DM2E All WP Meeting: Work Package 111.06.2013 19
  • 20.
    WP1 MINT Questionnaire DM2EAll WP Meeting: Work Package 111.06.2013 20 • „Kickoff“ for WP1 task 1.3: „Testing the user interface for creating mapping, interlinking heuristics and for configuring the workflow“ • Sent out on May 28th 2013 • ONB and MPIWG have performed initial tests • Will discuss preliminary results tomorrow during breakout sessions
  • 21.
    co-funded by theEuropean Union Thank You !