Quantcast
Channel: Work progress – Digital Intellectuals
Browsing latest articles
Browse All 15 View Live

Encoding an XML Tree model for my corpus

I work on a project called “Digital edition of historical manuscripts” which aims to diffuse on a public platform multiple corpora that all have one thing in common: they are ego documents, which...

View Article


Image may be NSFW.
Clik here to view.

Starting a new project – Discovering its source material

At the beginning of this month, I started to work at the French Institute for Research in Computer Science and Automation (INRIA) where I have been hired to work on the DAHN project. The project, that...

View Article

Image may be NSFW.
Clik here to view.

Working through minor issues

While I worked on the creation of my XML tree model, I realized that some recurrent elements of the letters will be more difficult to encode, mainly because they are neither mentioned nor explained on...

View Article

Image may be NSFW.
Clik here to view.

How to produce a model for the transcription

My work on the DAHN project can be divided into two major parts: transcribing and encoding. I already mentioned the encoding in two previous posts (there and there) and now it is time I talk a little...

View Article

Image may be NSFW.
Clik here to view.

Difficulties in creating the transcription model

In my last article, I presented and detailed the early stages for the production of a transcription model, explaining the technical method to follow in order to do so. I will now develop another part...

View Article


Image may be NSFW.
Clik here to view.

How to produce a model for the segmentation

In order to facilitate the work on the automatic transcription of d’Estournelles’ correspondence, we identified the need to develop a specific segmentation model, in addition to a transcription model....

View Article

Image may be NSFW.
Clik here to view.

Transcribing the corpus

In all the posts I have already published, I mentioned multiples stages of my work, as well as some difficulties I have encountered while trying to realize it. More recently, I expressed the problems...

View Article

Image may be NSFW.
Clik here to view.

Encoding the corpus

I am now at the second part of the trinity previously mentioned in this article: the encoding. This portion of the work has already been partly explained in some of the first posts (there and there)...

View Article


Image may be NSFW.
Clik here to view.

Publication of my digital edition – Working with TEI Publisher

In March, I started working on the DAHN Project and the Paul d’Estournelles de Constant corpus. Since then, as I presented and explained it in multiple articles, I have developed a model for the...

View Article


Image may be NSFW.
Clik here to view.

Storing and sharing the project

I have been working on the DAHN Project for more than a year now: I executed a lot of tasks over the year and developed the project extensively. All of this work represents many files created – be...

View Article

Image may be NSFW.
Clik here to view.

Publication of my digital edition – Developing my TEI Publisher application

In a recent blog post, I introduced the ‘publication’ phase of the digital edition pipeline I am currently developing within the DAHN project. I presented, explained and developed how we are using TEI...

View Article

Image may be NSFW.
Clik here to view.

Availability and high quality: distributing the facsimile with NAKALA

Through the DAHN project, we are developing a pipeline for the digital edition of archival documents with TEI Publisher as the final publication step. This pipeline is divided into six steps. I...

View Article

Image may be NSFW.
Clik here to view.

Publication of my digital edition – Online launch of the TEI Publisher...

A year ago, almost to the day, I wrote my first post about TEI Publisher, as I began working with it, to execute the ‘publication’ phase of the pipeline I mentioned many times in my posts. I had...

View Article


Image may be NSFW.
Clik here to view.

Recognizing and encoding the corpus’ named entities

Many months ago, I posted an article about the work I had to do to obtain a publishable edition of my corpus. I called it a trinity because it would be divided into three parts:  transcription;...

View Article

Image may be NSFW.
Clik here to view.

Overcoming challenges in DiScholEd’s development: a journey of...

As one of the interns involved in the development of the application, I encountered several challenges throughout the process. This blog post outlines the problems faced, the attempted solutions, and...

View Article

Browsing latest articles
Browse All 15 View Live