Applications Stack
Hi. In this post i'll talk about data pipeline in the library. We have raw data in different formats. It can be databases, txt files, pdfs and other text formats. In the next posts i refer detailed about each step in pipeline. First is a text transformation into json and saving it in Apache Solr.
1. I transform data to json to have an opportunity to save it in Apache Solr -> https://solr.apache.org/.
With java script library ajax-solr i have a web view.
Hier a link how it looks on our website: link
2. Fedora repository -> https://duraspace.org/fedora/
All data, not only text, but original formats, images, video and do on, i save in Fedora repository. In this case i modify already existing Json files into JSON-LD files and ingest it into Fedora.
3. I save their (Fedora repository) IIIF data too. I create collections and manifests and with IIIF Image Server display it with Madoc.
Here is the main steps in my stack











