Week 9: Finializing the Structure of PBase Repository and Documenting the Project’s Results

This week I was mostly working on the convertor (from PROV-XML to Cypher scripts) code to make some changes in the way the traces are stored in PBase repository. Here is a quick look at the current structure of PBase repository: The blue boxes show the properties of each node. Continue reading Week 9: Finializing the Structure of PBase Repository and Documenting the Project’s Results

Week 7: Improving the Structure of PBase Repository (Continuation from Last Week)

This week I was mostly working on applying the changes that I discussed in my last week blog entry on PBase repository. Now we can convert the traces of multiple runs of a number of workflows from PROV-XML format to a series of Cypher creation commands (as well as Geoff Continue reading Week 7: Improving the Structure of PBase Repository (Continuation from Last Week)

Week 6: Extending the PBase Repository by Importing More Traces into It and Defining More Properties for Each Node

After looking through the final list of queries that have to be implemented in PBase project, I realized that some of them are based on nodes (actors and data entities) properties (e.g. type, version, …) and also, some of the queries refer to multiple runs of a workflow. After a Continue reading Week 6: Extending the PBase Repository by Importing More Traces into It and Defining More Properties for Each Node

Week 4: More on Cypher and its Comparison with other Graph Query Languages

The main focus of this week was on translating some of the main provenance queries into Cypher format and compare it with its equivalence in Gremlin. The first thing to do was to add some proper indexes to our Neo4j database. Currently, we have only one property for each node Continue reading Week 4: More on Cypher and its Comparison with other Graph Query Languages

Week 3: Creating PBase Repository and Working on Queries

The main focus of this week was to build PBase main repository, set up the development environment, and get more familiar with the format of Cypher queries. The first thing to do was to translate workflow provenance traces to a Neo4j compatible format. Some of the approaches are: Neo4j Import Continue reading Week 3: Creating PBase Repository and Working on Queries

Week 2: ProvWG Meeting and Plans for Next Actions on PBase

This week I attended ProvWG meeting at NYU-Poly. This was a two day meeting with the main focus on PBase and D-PPROV [MDB+13], a recent project going on in ProvWG that some of its ideas might be related to PBase project. D-PPROV is an extension to W3C PROV provenance model aimed at representing Continue reading Week 2: ProvWG Meeting and Plans for Next Actions on PBase