-
Notifications
You must be signed in to change notification settings - Fork 2
Expand file tree
/
Copy pathfeed.xml
More file actions
118 lines (97 loc) · 22.7 KB
/
feed.xml
File metadata and controls
118 lines (97 loc) · 22.7 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
<?xml version="1.0" encoding="utf-8"?><feed xmlns="http://www.w3.org/2005/Atom" ><generator uri="https://jekyllrb.com/" version="4.3.4">Jekyll</generator><link href="https://ocr-d.de/feed.xml" rel="self" type="application/atom+xml" /><link href="https://ocr-d.de/" rel="alternate" type="text/html" /><updated>2026-04-08T12:05:07+02:00</updated><id>https://ocr-d.de/feed.xml</id><title type="html">OCR-D</title><subtitle>Write an awesome description for your new site here. You can edit this line in _config.yml. It will appear in your document head meta (for Google search results) and in your feed.xml site description.</subtitle><entry xml:lang="en"><title type="html">OCR-D Phase III started</title><link href="https://ocr-d.de/en/2021/08/06/kick-off-phase3.html" rel="alternate" type="text/html" title="OCR-D Phase III started" /><published>2021-08-06T00:00:00+02:00</published><updated>2021-08-06T00:00:00+02:00</updated><id>https://ocr-d.de/en/2021/08/06/kick-off-phase3</id><content type="html" xml:base="https://ocr-d.de/en/2021/08/06/kick-off-phase3.html"><![CDATA[<p>On 30 July, our kick-off workshop took place, heralding phase III of OCR-D.</p>
<p>The day before, the project participants met internally to get to know each other and coordinate their work. On the public workshop day, the team of the Coordination Project gave an introduction into the <a href="https://ocr-d.de/assets/kick-off/phase3.pdf">objectives in phase III and public communication channels of OCR-D</a>, the <a href="/assets/kick-off/spec_core_ocrd_all.pdf">current status and plans of the OCR-D software</a>, the <a href="/assets/kick-off/api.pdf">Web API</a> and the handling of <a href="/assets/kick-off/gt.pdf">Ground Truth Data in OCR-D</a>. Also, the Coordination Project gave an insight into Best Practices of <a href="/assets/kick-off/software-development.pdf">Software Developing</a> in the past phase of OCR-D, as well as ideas for the community, how to contribute.</p>
<p>In addition, the implementation and module projects presented themselves in <a href="/assets/kick-off/lightning-talks.pdf">short presentations</a> to the interested community and our <a href="/en/contact#cooperation-partners">cooperation partners</a></p>
<p>UB Braunschweig, SLUB Dresden UB Mannheim are extending both OCR-D and Kitodo for productive mass digitisation; SUB Göttingen and GWDG are working on Performance Optimisation and Integration, deploying OCR-D on a High Performance Cluster; GEI Braunschweig, HCI and ZPD of the University of Würzburg will implement OCR-D features in OCR4all, making OCR-D available via their software; the ULB Sachsen-Anhalt will implement OCR-D in their Open Source mass digitization infrastructure .
While these project partners will work on four implementation scenarios, we have three module projects, improving OCR-D processors: UB Mannheim enabling work-specific training with Tesseract and Calamari; JGU Mainz and FAU Erlangen-Nürnberg improving font group recognition for better fitting OCR-models; and OLA-HD by SUB Göttingen and GWDG, optimising reliability, searchability and fine-grained referencing of the OLA-HD long-term archiving repository.</p>
<p>In our chat channel, the <a href="https://gitter.im/OCR-D/Lobby">gitter lobby</a>, we always keep you informed about public OCR-D events. Further information about how to stay in touch and contribute to OCR-D can be found in our overview of <a href="/en/platforms">platforms</a>.</p>]]></content><author><name>Lena Hinrichsen</name></author><category term="en" /><category term="Phase 3" /><summary type="html"><![CDATA[On 30 July, our kick-off workshop took place, heralding phase III of OCR-D.]]></summary></entry><entry xml:lang="en"><title type="html">OCR-D at the Bibliothekartag 2021</title><link href="https://ocr-d.de/en/2021/06/11/bibtag.html" rel="alternate" type="text/html" title="OCR-D at the Bibliothekartag 2021" /><published>2021-06-11T00:00:00+02:00</published><updated>2021-06-11T00:00:00+02:00</updated><id>https://ocr-d.de/en/2021/06/11/bibtag</id><content type="html" xml:base="https://ocr-d.de/en/2021/06/11/bibtag.html"><![CDATA[<p>OCR-D will also be present at this year’s Bibliothekartag, which will take place virtually from 16-18 June 2021 and on two
days in Bremen. The OCR-D project is participating with two presentations on the current status of the funding initiative
and on the collaborative creation of training materials.</p>
<p><a href="https://dbt2021.abstractserver.com/program/#/details/presentations/70">Elisabeth Engl</a> describes the current status of
the OCR-D software and gives an outlook on the many planned application scenarios, which are at the centre of the third project phase.
<a href="https://dbt2021.abstractserver.com/program/#/details/presentations/184">Kay-Michael Würzner and Robert Sachunsky</a>
<a href="https://wrznr.github.io/bibliothekartag-2021">report</a> on their experiences with setting up and conducting a collaborative transcription initiative at SLUB Dresden to create OCR training material and models by means of OCR-D and low-threshold annotation tools.</p>]]></content><author><name>Elisabeth Engl</name></author><category term="en" /><category term="Bibliothekartag" /><category term="presentation" /><category term="phase 3" /><summary type="html"><![CDATA[OCR-D will also be present at this year’s Bibliothekartag, which will take place virtually from 16-18 June 2021 and on two days in Bremen. The OCR-D project is participating with two presentations on the current status of the funding initiative and on the collaborative creation of training materials.]]></summary></entry><entry xml:lang="en"><title type="html">Implementation and module projects granted</title><link href="https://ocr-d.de/en/2021/06/10/projects.html" rel="alternate" type="text/html" title="Implementation and module projects granted" /><published>2021-06-10T00:00:00+02:00</published><updated>2021-06-10T00:00:00+02:00</updated><id>https://ocr-d.de/en/2021/06/10/projects</id><content type="html" xml:base="https://ocr-d.de/en/2021/06/10/projects.html"><![CDATA[<p>In addition to the coordination project, the DFG also approved seven implementation
and module projects that will begin their work in the coming months.</p>
<p>Of the eleven proposals submitted, which were developed in the course of extensive piloting
of the OCR-D software in the summer of 2020, four implementation and three module projects
will be funded by the DFG.</p>
<p>The module projects will further develop selected OCR-D tools.</p>
<ul>
<li>Workflow for work-specific training based on generic models with OCR-D as well as ground truth enhancement (UB Mannheim).</li>
<li>Font Group Recognition for Improved OCR (JGU Mainz, FAU Erlangen)</li>
<li>OLA-HD Service - A Generic Service for Long-Term Archiving of Historical Prints (SUB Göttingen, GWDG)</li>
</ul>
<p>Together with the ca. 65 other OCR-D tools, these form the basis for the work of the four implementation projects, which will prepare the existing OCR-D software for its productive use in mass digitisation.</p>
<ul>
<li>Integration of Kitodo and OCR-D for productive mass digitisation (UB Braunschweig, SLUB Dresden, UB Mannheim).</li>
<li>OPERANDI: OCR-D Performance Optimisation and Integration (SUB Göttingen, GWDG)</li>
<li>OCR-D Software in Modular Mass Digitisation Workflows (ULB Halle)</li>
<li>OCR4all libraries full text recognition of historical collections (GEI Braunschweig, HCI and ZPD of the University of Würzburg)</li>
</ul>
<p>We are very much looking forward to this next project phase and the (further) cooperation with the implementation and module projects, which will prepare the use of the OCR-D software in different usage scenarios.</p>]]></content><author><name>Elisabeth Engl</name></author><category term="en" /><category term="DFG" /><category term="grant" /><category term="phase 3" /><summary type="html"><![CDATA[In addition to the coordination project, the DFG also approved seven implementation and module projects that will begin their work in the coming months.]]></summary></entry><entry xml:lang="en"><title type="html">OCR(-D) &amp; Co starting in May</title><link href="https://ocr-d.de/en/2021/04/26/barcamp.html" rel="alternate" type="text/html" title="OCR(-D) &amp; Co starting in May" /><published>2021-04-26T00:00:00+02:00</published><updated>2021-04-26T00:00:00+02:00</updated><id>https://ocr-d.de/en/2021/04/26/barcamp</id><content type="html" xml:base="https://ocr-d.de/en/2021/04/26/barcamp.html"><![CDATA[<p>On 7 May 2021 will be the inaugural session of our new barcamp-like monthly event <strong>OCR(-D) & Co</strong>.
barcamp format, developers, users and all other interested persons will be will be given the opportunity to talk
about OCR(-D).</p>
<p>OCR(-D) & Co will take place every first Friday of the month from May 7 onwards at 10-11 am CET in a <a href="https://meet.gwdg.de/b/kon-v6q-azq-3el">BBB room</a>.
At the beginning of each meeting, participants have the opportunity to suggest topics of interest, which can then
be discussed by small groups in breakout rooms. In addition to the <a href="https://hackmd.io/OOMgg3ZeSqK4vfKL1wRbwQ?view">open TechCall</a>,
where the OCR-D community discusses technical topics every second Wednesday, <strong>OCR(-D) & Co</strong> also offers participants without in-depth
OCR-D knowledge the opportunity to contribute their own questions and ideas to the discussion and to openly exchange ideas
with other OCR-interested people. We look forward to your participation!</p>]]></content><author><name>Elisabeth Engl</name></author><category term="en" /><category term="open call" /><category term="phase 3" /><category term="community" /><summary type="html"><![CDATA[On 7 May 2021 will be the inaugural session of our new barcamp-like monthly event OCR(-D) & Co. barcamp format, developers, users and all other interested persons will be will be given the opportunity to talk about OCR(-D).]]></summary></entry><entry xml:lang="en"><title type="html">Phase III of the OCR-D-coordination project granted</title><link href="https://ocr-d.de/en/2021/01/19/phase3.html" rel="alternate" type="text/html" title="Phase III of the OCR-D-coordination project granted" /><published>2021-01-19T00:00:00+01:00</published><updated>2021-01-19T00:00:00+01:00</updated><id>https://ocr-d.de/en/2021/01/19/phase3</id><content type="html" xml:base="https://ocr-d.de/en/2021/01/19/phase3.html"><![CDATA[<p>The coordination project’s application for the third phase of the OCR-D funding initiative was approved by the DFG in January 2021.
In phase III, we will optimise the results of the previous module project phase and we will initiate the productive use of the
OCR-D software in mass digitisation both technically and organisationally.</p>
<p>The previous project partners from BBAW, HAB and SBB will now be joined by SUB Göttingen and GWDG, whereas the KIT has left the project.
Together, the partners can continue to support and coordinate the work of the other OCR-D projects in Phase III.
In addition, the OCR-D software will be optimised for its use in mass digitisation and the functioning of the
overall OCR-D workflow will be ensured. Great importance will be put on ensuring the permanent support and
further development of the OCR-D software and on communicating the results of the implementation work to a
broad circle of users who will use it for the efficient full-text digitisation of VD materials.</p>
<p>The coordination project is delighted to continue the work of the two previous project
phases and looks forward to working with the other OCR-D projects.</p>]]></content><author><name>Elisabeth Engl</name></author><category term="en" /><category term="DFG" /><category term="grant" /><category term="phase 3" /><summary type="html"><![CDATA[The coordination project’s application for the third phase of the OCR-D funding initiative was approved by the DFG in January 2021. In phase III, we will optimise the results of the previous module project phase and we will initiate the productive use of the OCR-D software in mass digitisation both technically and organisationally.]]></summary></entry><entry xml:lang="en"><title type="html">OCR-D at the Mini-ELAG</title><link href="https://ocr-d.de/en/2020/10/02/elag.html" rel="alternate" type="text/html" title="OCR-D at the Mini-ELAG" /><published>2020-10-02T00:00:00+02:00</published><updated>2020-10-02T00:00:00+02:00</updated><id>https://ocr-d.de/en/2020/10/02/elag</id><content type="html" xml:base="https://ocr-d.de/en/2020/10/02/elag.html"><![CDATA[<p>On October 20, 2020 the <a href="https://elag.org/2020/09/24/mini-elag-program/">Mini-ELAG (European Library Automation Group)</a>
takes place, where librarians and IT professionals discuss new information technologies
and their application in libraries and documentation centers. OCR-D will be represented at
virtual conference with a lecture by Clemens Neudecker (SBB) on
<em>OCR-D: An open ecosystem for improving OCR on historical documents</em>.</p>
<p>The talk will present the OCR-D software and its functions and disckuss the open, participative
development strategy of the DFG project.</p>]]></content><author><name>Elisabeth Engl</name></author><category term="en" /><category term="conference" /><category term="ELAG" /><summary type="html"><![CDATA[On October 20, 2020 the Mini-ELAG (European Library Automation Group) takes place, where librarians and IT professionals discuss new information technologies and their application in libraries and documentation centers. OCR-D will be represented at virtual conference with a lecture by Clemens Neudecker (SBB) on OCR-D: An open ecosystem for improving OCR on historical documents.]]></summary></entry><entry xml:lang="en"><title type="html">OCR-D at the virtual workshop FAIR &amp; Co</title><link href="https://ocr-d.de/en/2020/09/22/fair.html" rel="alternate" type="text/html" title="OCR-D at the virtual workshop FAIR &amp; Co" /><published>2020-09-22T00:00:00+02:00</published><updated>2020-09-22T00:00:00+02:00</updated><id>https://ocr-d.de/en/2020/09/22/fair</id><content type="html" xml:base="https://ocr-d.de/en/2020/09/22/fair.html"><![CDATA[<p>From October 7 to 8, the <a href="https://www.akademienunion.de/en/working-groups/working-group-on-ehumanities/">eHumanities working group of the Union of German Academies of Sciences and Humanities</a>,
in cooperation with the <a href="https://adw-goe.de/en/home/">Göttingen Academy of Sciences and Humanities</a>,
is organizing the workshop <em>FAIR & Co: Visibility and Availability of Digital Academy Research in a Networked Scientific Landscape</em>.
The OCR-D project will be represented with a lecture on the topic <em>Digital Transformation: OCR-D, Offer and Vision</em> by Matthias Boenig (BBAW).
Using the example of the German Text Archive, it will be shown how the application spectrum of this reference
corpus can be extended by the area of machine learning to improve character and structure
recognition. For the whole program of the workshop see the <a href="https://workshop.adw-goe.de/programm/">website of this workshop</a>.</p>]]></content><author><name>Elisabeth Engl</name></author><category term="en" /><category term="workshop" /><category term="FAIR" /><category term="DTA" /><summary type="html"><![CDATA[From October 7 to 8, the eHumanities working group of the Union of German Academies of Sciences and Humanities, in cooperation with the Göttingen Academy of Sciences and Humanities, is organizing the workshop FAIR & Co: Visibility and Availability of Digital Academy Research in a Networked Scientific Landscape. The OCR-D project will be represented with a lecture on the topic Digital Transformation: OCR-D, Offer and Vision by Matthias Boenig (BBAW). Using the example of the German Text Archive, it will be shown how the application spectrum of this reference corpus can be extended by the area of machine learning to improve character and structure recognition. For the whole program of the workshop see the website of this workshop.]]></summary></entry><entry xml:lang="en"><title type="html">Workshop for the implementation plans</title><link href="https://ocr-d.de/en/2020/08/01/implementation-workshop.html" rel="alternate" type="text/html" title="Workshop for the implementation plans" /><published>2020-08-01T00:00:00+02:00</published><updated>2020-08-01T00:00:00+02:00</updated><id>https://ocr-d.de/en/2020/08/01/implementation-workshop</id><content type="html" xml:base="https://ocr-d.de/en/2020/08/01/implementation-workshop.html"><![CDATA[<p>Following the successful first (virtual) meeting of those interested in the DFG-call for the
implementation of the OCR-D-Software, a further workshop will be held on <strong>7 August, 9-13 p.m.</strong>
to prepare the OCR-D grant proposals.</p>
<p>At this second meeting the applicants can inform each other about their previous tests of
the OCR-D software and exchange experiences. In addition, the project plans,
which have been further developed in the meantime, will be discussed in order to identify
possible synergies between the grant proposals.</p>
<p>We are looking forward to this new exchange and a continuing successful pilot phase!</p>]]></content><author><name>Elisabeth Engl</name></author><category term="en" /><category term="DFG" /><category term="call" /><category term="pilottest" /><summary type="html"><![CDATA[Following the successful first (virtual) meeting of those interested in the DFG-call for the implementation of the OCR-D-Software, a further workshop will be held on 7 August, 9-13 p.m. to prepare the OCR-D grant proposals.]]></summary></entry><entry xml:lang="en"><title type="html">Kick-off pilot phase</title><link href="https://ocr-d.de/en/2020/06/04/pilot.html" rel="alternate" type="text/html" title="Kick-off pilot phase" /><published>2020-06-04T00:00:00+02:00</published><updated>2020-06-04T00:00:00+02:00</updated><id>https://ocr-d.de/en/2020/06/04/pilot</id><content type="html" xml:base="https://ocr-d.de/en/2020/06/04/pilot.html"><![CDATA[<p>We are very happy about the <a href="https://www.dfg.de/download/pdf/foerderung/programme/lis/absichtserklaerungen_ocrd_2020/ocrd_absichtserklaerungen_liste.pdf">great interest in the DFG call for proposals for the implementation of the OCR-D software</a>. As OCR-D coordination project
we will support the planned projects from the pilot phase onwards and promote the exchange of information among interested parties as desired by the DFG.
To kick off the pilot phase, we are organising a large <strong>video conference</strong> on <strong>19 June, 9-13 o’clock</strong>, at which all interested parties can get to know
each other and the pilot tests can be coordinated.</p>
<p>Interested parties who have not submitted a letter of intent themselves and who are still looking for a suitable partner with whom
they could engage in the third phase of OCR-D are also welcome to the video conference.</p>
<p>If you are interested, please register for the video conference <strong>by 12 June</strong> at engl@hab.de.</p>
<p>We look forward to working with you and to a successful pilot phase!</p>]]></content><author><name>Elisabeth Engl</name></author><category term="en" /><category term="DFG" /><category term="call" /><category term="pilottest" /><summary type="html"><![CDATA[We are very happy about the great interest in the DFG call for proposals for the implementation of the OCR-D software. As OCR-D coordination project we will support the planned projects from the pilot phase onwards and promote the exchange of information among interested parties as desired by the DFG. To kick off the pilot phase, we are organising a large video conference on 19 June, 9-13 o’clock, at which all interested parties can get to know each other and the pilot tests can be coordinated. Interested parties who have not submitted a letter of intent themselves and who are still looking for a suitable partner with whom they could engage in the third phase of OCR-D are also welcome to the video conference. If you are interested, please register for the video conference by 12 June at engl@hab.de. We look forward to working with you and to a successful pilot phase!]]></summary></entry><entry xml:lang="en"><title type="html">Call for OCR-D Implementation online!</title><link href="https://ocr-d.de/en/2020/02/25/dfg-call.html" rel="alternate" type="text/html" title="Call for OCR-D Implementation online!" /><published>2020-02-25T00:00:00+01:00</published><updated>2020-02-25T00:00:00+01:00</updated><id>https://ocr-d.de/en/2020/02/25/dfg-call</id><content type="html" xml:base="https://ocr-d.de/en/2020/02/25/dfg-call.html"><![CDATA[<p>The <a href="https://www.dfg.de/download/pdf/foerderung/programme/lis/ausschreibung_ocr_implementierung.pdf">call for the implementation of the OCR-D software for the full text digitisation of historical prints</a>
is now available <a href="https://www.dfg.de/foerderung/programme/infrastruktur/lis/">on the website of the German Research Association (DFG)</a>.</p>
<p>The aim of the OCR-D coordination project, which was launched in autumn 2015,
is to describe procedures and develop guidelines in order to achieve an optimal
workflow and the greatest possible standardisation of OCR-related processes and
metadata. Furthermore, the complete transformation of the written German
cultural heritage into a machine-readable form (structured full text) is to be
prepared conceptually. Primarily, works from the Union Catalogue of Books
Printed in German Speaking Countries in the 16th-18th century (VD) as well as
books published in the 19th century in the German language area will be
considered. The VD projects comprise about 1 million titles that are currently
being digitized and are to be processed by means of OCR in the future.</p>
<p>The work done so far in the OCR-D initiative has led to significant
improvements in the optical character and layout recognition of historical
prints. The Software prototype promises flexible integration into existing
(librarian) workflow and digitisation systems due to its modular design. The
implementation of the OCR-D software in libraries, archives and other
collection holding institutions is the next necessary step to ensure that
high-quality full texts can be produced.</p>
<p>The central goal of the new call is the development of (generic) implementation
packages with acceptable performance for different requirements. It is
primarily aimed at collection holding institutions, the application is due on
October 7, 2020. The DFG expects applicants to hand in a letter of intent by May <del>5</del> 22. If you are interested in participating, please don’t hesitate to
<a href="/en/contact">contact us</a>.</p>]]></content><author><name>Elisabeth Engl</name></author><category term="en" /><category term="DFG" /><category term="call" /><summary type="html"><![CDATA[The call for the implementation of the OCR-D software for the full text digitisation of historical prints is now available on the website of the German Research Association (DFG).]]></summary></entry></feed>