{"id":3200,"date":"2018-06-15T22:24:00","date_gmt":"2018-06-15T22:24:00","guid":{"rendered":"https:\/\/notebooks.dataone.org\/?p=3200"},"modified":"2019-05-24T17:47:34","modified_gmt":"2019-05-24T17:47:34","slug":"week-3-custom-mimetypesmagic-file-for-the-dataone-file-formats-for-identification-using-apache-tikaapache-web-server","status":"publish","type":"post","link":"https:\/\/notebooks.dataone.org\/extending-libmagic\/week-3-custom-mimetypesmagic-file-for-the-dataone-file-formats-for-identification-using-apache-tikaapache-web-server\/","title":{"rendered":"Week 3: Custom mimetypes\/magic file for the DataONE file formats for identification using Apache Tika\/Apache web server"},"content":{"rendered":"

Hi All,<\/p>\n

This blog is in conjunction with my earlier blogs for the Project 4: Extending Libmagic for Identification of Science Resources<\/a>. In the last week we were able to create the magic file for the file <\/strong>command and the repository admins of it also accepted and committed the changes in the library. This week we wanted to explore the tool Apache Tika<\/a> in more depth and wanted to have a similar functionality available in it for identifying the DataONE file<\/a> formats. In this week, we completed the below tasks and will be setting new goals for the internship in the coming weeks.<\/p>\n