Changes

#142 (Apr 19, 2024 9:46:03 PM)

  1. [BETA] added the deployment instruction in the jenkins file for beta related to the creation of the action set to include the results tagged with FoS without a doi — Miriam Baglioni / detail
  2. [DUMP] added Jenkins file for the deployment of the dumps — Miriam Baglioni / detail
  3. indentation — Claudio Atzori / detail

#142 (Apr 19, 2024 9:46:03 PM)

  1. code formatted — Sandro La Bruzzo / detail
  2. added first part of refactoring of the code generating MAG, — Sandro La Bruzzo / detail
  3. Implemented class that generates a normalized table of MAG, which is the starting point for the creation of the mag source — Sandro La Bruzzo / detail
  4. added oozie workflow — Sandro La Bruzzo / detail
  5. update mag mapping — Sandro La Bruzzo / detail
  6. Moved Crossref Mapping on dhp-aggregations, — Sandro La Bruzzo / detail
  7. code refactor — Sandro La Bruzzo / detail
  8. applied cherry pick — Sandro La Bruzzo / detail
  9. update crossref mapping to be runnable separately as a single datasource outside doiboost — Sandro La Bruzzo / detail
  10. update crossref mapping to be transformed together with UnpayWall — Sandro La Bruzzo / detail
  11. Improved Crossref mapping to include also unpaywall tested — Sandro La Bruzzo / detail
  12. fixed the result_country definition — Antonis Lempesis / detail
  13. Add action set creation for Datacite affiliations — Serafeim Chatzopoulos / detail
  14. added new orgs in monitor — antleb / detail
  15. [UsageCount] add check in case the datasource is not matched against those present in the graph — Miriam Baglioni / detail
  16. [UsageCount] fixed error — Miriam Baglioni / detail
  17. Fix datacite input path in properties file — Serafeim Chatzopoulos / detail
  18. [OpenCitation] add compression option when writing the sequence file — Miriam Baglioni / detail
  19. base datainfo with trust=0.89 — Michele Artini / detail
  20. [NOAMI] removed entry for Health and Social Care Board from the list of funders. Modified IRC putting 1596 and 1597 as synonyms, as required in ticket 9635 — Miriam Baglioni / detail
  21. - Update the code which acquires the "IMPALA_HDFS_NODE", to test the "tmp"-dir, instead of the base-dir and introduce retries, to overcome potential file-system failures. This change was suggested by "Sebastian Tymkow" and "Grzegorz Bakalarski". — Lampros Smyrnaios / detail
  22. [MapToFunderLink]added references for HFRI and Erasmus+ for the creation of links for funders — Miriam Baglioni / detail
  23. Implemented first part of the new MAG mapping — Sandro La Bruzzo / detail
  24. [DataciteHostedByMap] added entry for EBRAINS — Miriam Baglioni / detail
  25. [DataciteHostedByMap] added entry for EBRAINS — Miriam Baglioni / detail
  26. completed mapping from paper to OAF, not tested — Sandro La Bruzzo / detail
  27. Updated mapping — Sandro La Bruzzo / detail
  28. mapping generated for MAG, — Sandro La Bruzzo / detail
  29. added instanceTypeMapping field on MAG — Sandro La Bruzzo / detail
  30. fixed duplicated property dhp-schemas.version — Claudio Atzori / detail
  31. Upgrade the copying operation to Impala Cluster: — Lampros Smyrnaios / detail
  32. Use the "HADOOP_USER_NAME" value from the "workflow-property", in "copyDataToImpalaCluster.sh", in "stats-monitor-updates". — Lampros Smyrnaios / detail
  33. updated wf of MAG and crossref to use transaction — Sandro La Bruzzo / detail
  34. added vocabulary tu instanceTypeMApping of Mag — Sandro La Bruzzo / detail
  35. code formatted — Sandro La Bruzzo / detail
  36. Miscellaneous updates to the copying operation to Impala Cluster: — Lampros Smyrnaios / detail
  37. Minor updates to the copying operation to Impala Cluster: — Lampros Smyrnaios / detail
  38. - Bug fix in matchOrderedTokenAndAbbreviations algorithms where tokens with same initial character were always considered equal — Giambattista Bloisi / detail
  39. test — Michele Artini / detail
  40. added a couple more invalid author names — Claudio Atzori / detail
  41. Added Action set generation for the MAG organization — Sandro La Bruzzo / detail
  42. removed the funder id : 100011062 Asian Spinal Cord Network, wrongly associated to Ireland — Miriam Baglioni / detail
  43. Refinements to PR #404: refactoring the Oaf records merge utilities into dhp-common — Giambattista Bloisi / detail
  44. Various fixes for the stats DB update workflow, step16-createIndicatorsTables.sql — Claudio Atzori / detail
  45. integrating changes from PR#424 — Claudio Atzori / detail
  46. updated Ignore annotation that is deprecated to Disabled — Sandro La Bruzzo / detail
  47. [graph indexing] sets spark memoryOverhead in the join operations to the same value used for the memory executor — Claudio Atzori / detail
  48. [transformative agreement] including reuslt-funder relations to the information imported from the TRs — Claudio Atzori / detail
  49. updated schema version — Claudio Atzori / detail

#137 (Apr 2, 2024 11:09:22 AM)

  1. added workflow for updating the dedup pivot history database — Claudio Atzori / detail
  2. updated deployment spec for PROD — Claudio Atzori / detail
  3. Updated deployments of orcid collection from api — Sandro La Bruzzo / detail
  4. Change DNET_HADOOP_REPO_BRANCH default value to "beta" for BETA pipelines — Giambattista Bloisi / detail
  5. added deployment specs for dhp-stats-monitor-update, dhp-stats-monitor-irish, dhp-stats-hist-snaps — Claudio Atzori / detail
  6. removed deployment spec for dhp-stats-monitor-update — Claudio Atzori / detail
  7. added deployment specs for dhp-stats-monitor-irish, dhp-stats-hist-snaps — Claudio Atzori / detail
  8. [BETA] added the deployment instruction in the jenkins file for beta related to the creation of the action set to extend the results with the open apc transformative agreemnt file information — Miriam Baglioni / detail

#137 (Apr 2, 2024 11:09:22 AM)

  1. [UsageCount] split the count for result at the level of the datasource. for each indicator one unit is specified for each datasource contrinuting to that indicator value. The datasource key is the value of the key element in the unit for the measure, while the count for that datasource is in the value — Miriam Baglioni / detail
  2. refactoring — Miriam Baglioni / detail
  3. [Transformative Agreement] added code to extract relations from the transformative agreement file for the IE products got from OpenAPC — Miriam Baglioni / detail
  4. [Transformative Agreement] removed not needed class. Read directly the json and no need to pass from the csv — Miriam Baglioni / detail
  5. [Transformative Agreement] added check to verify the APC were paid byu the IReL funder — Miriam Baglioni / detail
  6. Changes to indicators and funders definition — dpierrakos / detail
  7. Monitor Irish Stats WF — dpierrakos / detail
  8. Historical Snapshots Workflow — dpierrakos / detail
  9. Update buildIrishMonitorDB.sql — dpierrakos / detail
  10. fixed the result_country definition — Antonis Lempesis / detail
  11. Changes to beta db names — dpierrakos / detail
  12. Changes to indicators — dpierrakos / detail
  13. Implemented Download update of ORCID — Sandro La Bruzzo / detail
  14. Added workflow — Sandro La Bruzzo / detail
  15. code refactor — Sandro La Bruzzo / detail
  16. added some useful comment — Sandro La Bruzzo / detail
  17. creating result_instances even when no pids exist for the instance — Antonis Lempesis / detail
  18. fix issue on FoS integration. Removing the null values from FoS — Miriam Baglioni / detail
  19. fixed missing parameter on download update — Sandro La Bruzzo / detail
  20. Fixed error of connection timeout — Sandro La Bruzzo / detail
  21. Reusable RunSQLSparkJob for executing SQL in Spark through Oozie Spark Actions — Giambattista Bloisi / detail
  22. [enrichment single step] refactoring to fix issue in disappeared result type — Miriam Baglioni / detail
  23. [enrichment single step] refactoring to fix issues in disappeared result type — Miriam Baglioni / detail
  24. [enrichment single step] remove parameter from execution — Miriam Baglioni / detail
  25. - — Miriam Baglioni / detail
  26. [enrichment single step] moving parameter file in correct location — Miriam Baglioni / detail
  27. [enrichment single step] adding <end> element in wf definition — Miriam Baglioni / detail
  28. increased shuffle partitions for publications in the country propagation workflow — Claudio Atzori / detail
  29. [orcid enrichment] drop paths before copying the non-modifyed contents — Claudio Atzori / detail
  30. [graph provision] obtain context info from the context API instead from the ISLookUp service — Claudio Atzori / detail
  31. code formatting — Claudio Atzori / detail
  32. [graph provision] updated param specification for the XML converter job — Claudio Atzori / detail
  33. [BulkTagging] extend the definition of the pathMap to include also actions that should be performed of the value extracted from the result befor applying the constraint — Miriam Baglioni / detail
  34. compilation after merging — Miriam Baglioni / detail
  35. logg added during download — Sandro La Bruzzo / detail
  36. [collection] increased logging from the oai-pmh metadata collection process — Claudio Atzori / detail
  37. [graph provision] retrieve all the context information by adding all=true to the requests issued to thr API — Claudio Atzori / detail
  38. added code of conduct and contributing files — Claudio Atzori / detail
  39. minor — Claudio Atzori / detail
  40. Update 'CONTRIBUTING.md' — Claudio Atzori / detail
  41. max mem of joins (hive.mapjoin.followby.gby.localtask.max.memory.usage) now 80%, up from 55%. — Antonis Lempesis / detail
  42. max mem of joins (hive.mapjoin.followby.gby.localtask.max.memory.usage) now 80%, up from 55%. — Claudio Atzori / detail
  43. Added workflow to update ORCID and replaced some parsing, because the update works and employments xml differs from the dump one. — Sandro La Bruzzo / detail
  44. Changed step16-createIndicatorsTables to use a spark oozie action instead of hive — antleb / detail
  45. [collection] increased logging from the oai-pmh metadata collection process — Claudio Atzori / detail
  46. Fixed problem on missing author in crossref Mapping — Sandro La Bruzzo / detail
  47. Use SparkSQL in place of Hive for executing step16-createIndicatorsTables.sql of stats update wf — Giambattista Bloisi / detail
  48. Added exception throwing in Hadoop transformation when TR is not syntactically valid — Sandro La Bruzzo / detail
  49. [UsageCount] code extention to include also the name of the datasource — Miriam Baglioni / detail
  50. [orcid-enrichment] change the value of parameters. — Miriam Baglioni / detail
  51. [bulkTagging] removing checks while performing the substring action so that it will fire an Exception if the paramneters are wrongly set — Miriam Baglioni / detail
  52. changed orcid ids to all capital — antleb / detail
  53. test for Italian records from IRS repositories — Alessia Bardi / detail
  54. [orcid enrichment] fixed directory cleanup before distcp — Claudio Atzori / detail
  55. [graph cleaning] rule out datasources without an officialname — Claudio Atzori / detail
  56. [actiosets] introduced support for the PromoteAction strategy — Claudio Atzori / detail
  57. [actiosets] fixed join type — Claudio Atzori / detail
  58. fixed import of ORPs stored on HDFS in the internal graph format (e.g. Datacite) — Claudio Atzori / detail
  59. added 2 new institutions in monitor — antleb / detail
  60. Dedup aliases, created when a dedup in a previous build has been merged in a new dedup, need to be marked as "deletedbyinference", since they are "merged" in the new dedup — Giambattista Bloisi / detail
  61. [graph raw] fixed mapping of the original resource type from the Datacite format — Claudio Atzori / detail
  62. [Transformative Agreement] add results with information abount the agreement and the country of the organization paid for it — Miriam Baglioni / detail
  63. [Tagging Projects and Datasource] first extention of bulktagging to add the context to projects and datasource — Miriam Baglioni / detail
  64. [Tagging Projects and Datasource] added test to check datasource tagging. Fixed issue — Miriam Baglioni / detail
  65. - — Miriam Baglioni / detail
  66. Promote "Research" to a jolly instanceType in dedup comparisons — Giambattista Bloisi / detail
  67. Promote "Research" to a jolly instanceType in dedup comparisons — Giambattista Bloisi / detail
  68. Add Action Set creation for affiliations inferred from the OpenAPC data — Serafeim Chatzopoulos / detail
  69. [Tagging Projects and Datasource] changed the way the pathMap parameter is passed. It was too long and was truncated — Miriam Baglioni / detail
  70. [Transformative Agreement] removed the relations from the ActionSet waiting to have the gree light from Ioanna — Miriam Baglioni / detail
  71. mapping of project PIDs — Michele Artini / detail
  72. Implemented workflow for updating table , added step to check if the new generated table is valid — Sandro La Bruzzo / detail
  73. Revised procedure when converting json data into xml: — Giambattista Bloisi / detail
  74. following the comment on the pull requests: — Sandro La Bruzzo / detail
  75. formatted code — Sandro La Bruzzo / detail
  76. WIP — Claudio Atzori / detail
  77. When converting json to XML, remove characters that are not allowed in the XML 1.0 specs, as they will cause xpath failures even if escaped — Giambattista Bloisi / detail
  78. [OCNEW] first implementation — Miriam Baglioni / detail
  79. using distinct apcs per publication to avoid huge sums — Antonis Lempesis / detail
  80. [OCNEW] added creation of the actionset for the results classified with FoS based ont he OpenAIRE identifier — Miriam Baglioni / detail
  81. [FOSNEW] removed test class — Miriam Baglioni / detail
  82. fixed the irish result subset — antleb / detail
  83. WIP: extended provision workflow to create the JSON based payload — Claudio Atzori / detail
  84. Enrich authors with ORCID info using new matching algorithm — Giambattista Bloisi / detail
  85. selecting distinct peer_reviewed — antleb / detail
  86. WIP: updated provision workflow to create a JSON based representation of the payload — Claudio Atzori / detail
  87. [OC New] last fix — Miriam Baglioni / detail
  88. [OC New] last fix — Miriam Baglioni / detail
  89. including related organization url in the XML record serialization (ticket #9498) — Claudio Atzori / detail
  90. expanded paper abstract in the result/children XML element (ticket #9497) — Claudio Atzori / detail
  91. implemented changes from #9497: sort abstracts by string length, included author fullnames in the related results, expanded instance details within each children/result XML element — Claudio Atzori / detail
  92. cleanup — Claudio Atzori / detail
  93. new plugin to collect from a dump of BASE — Claudio Atzori / detail
  94. mapped oaf:country from results — Michele Artini / detail
  95. apply commits from master — Michele Artini / detail
  96. updated BASE filter param — Michele Artini / detail
  97. xslt rules — Michele Artini / detail
  98. Unify merge logic of entities in MergeUtils.class — Giambattista Bloisi / detail
  99. Commit monitor-updates-wf — dpierrakos / detail
  100. code cleanup — Antonis Lempesis / detail
  101. code cleanup — antleb / detail
  102. code cleanup — antleb / detail
  103. fixed an identifier xpath — Michele Artini / detail
  104. Fix conditions that prevented ORCID Enrichment — Giambattista Bloisi / detail
  105. refactoring the Oaf records merge utilities into dhp-common — Claudio Atzori / detail
  106. fixed a problem with multiple nodes — Michele Artini / detail
  107. xslt rules and tests — Michele Artini / detail
  108. implemented default merge procedure applied to result.instance — Claudio Atzori / detail
  109. align dhp-schema.version with the beta branch — Claudio Atzori / detail
  110. integrated minor change from beta branch — Claudio Atzori / detail
  111. align dhp-schema.version with the beta branch — Claudio Atzori / detail
  112. further follow up changes from integrating the mergeutils branch — Claudio Atzori / detail
  113. included new stats* workflows in parent pom list of modules, code formatting — Claudio Atzori / detail
  114. Use the ACTIVE HDFS NODE for Impala cluster, in "copyDataToImpalaCluster.sh" script. — Lampros Smyrnaios / detail
  115. Automatically select the ACTIVE HDFS NODE for Impala cluster, in all "copyDataToImpalaCluster.sh" scripts. — Lampros Smyrnaios / detail
  116. Generate tables with parquet-files, instead of csv, in "dhp-stats-update/.../contexts.sh" script. — Lampros Smyrnaios / detail
  117. [BulkTagging - tag datasource and projects]merging with branch beta — Miriam Baglioni / detail
  118. code formatting — Claudio Atzori / detail
  119. added missing EOS — Antonis Lempesis / detail
  120. fixed typo in indicator query — Antonis Lempesis / detail

#136 (Jan 11, 2024 10:17:29 PM)

  1. enrichment with subworkflows — Claudio Atzori / detail

#136 (Jan 11, 2024 10:17:29 PM)

  1. [graph cleaning] added cleaning for result.publisher and result.instance.license — Claudio Atzori / detail
  2. fixed doiboost process workflow, removed references to the ProcessORCID step — Claudio Atzori / detail
  3. Extracted the correct original type to pass to instanceTypeMapping in Crossref Mapping — Sandro La Bruzzo / detail
  4. code formatting — Claudio Atzori / detail
  5. [graph grouping] added isLookupUrl to the workflow definition, passed to the grouping spark aciton — Claudio Atzori / detail
  6. avoid NPEs in Vocabulary.getTermBySynonym — Claudio Atzori / detail
  7. avoid NPEs — Claudio Atzori / detail
  8. avoid NPEs — Claudio Atzori / detail
  9. [bulktagging] fixed workflow parameters — Claudio Atzori / detail
  10. [community_organization propagation] fixed workflow parameters — Claudio Atzori / detail
  11. added serialization for the new fields imported for the Irish tender — Claudio Atzori / detail
  12. [dedup] added isLookupUrl to the graph consistency workflow definition, required now by the entity grouping phase — Claudio Atzori / detail
  13. [orcid enrichment] fixed workflow definition — Claudio Atzori / detail
  14. first version of the workflow single step — Miriam Baglioni / detail
  15. [bulktagging] setting first step of bulktaggin as the copy of the entities and relations not involved in the tagging' — Miriam Baglioni / detail
  16. [community_result_propagation] adjusting starting poit of workflow — Miriam Baglioni / detail
  17. [enrichment] passing the community API base URL — Claudio Atzori / detail
  18. logging typo — Claudio Atzori / detail
  19. [graph cleaning] avoid stack overflow error when navigating Oaf objects declaring an Enum — Claudio Atzori / detail
  20. code formatting — Claudio Atzori / detail
  21. adjusting workflow definition — Miriam Baglioni / detail
  22. removed not needed parameter — Miriam Baglioni / detail
  23. [graph provision] added tests for the new model fields — Claudio Atzori / detail
  24. [cleaning] allow enriched orcids to pass the cleaning, rule out non-orcid author pids — Claudio Atzori / detail
  25. code formatting — Claudio Atzori / detail
  26. [graph provision] added tests for new peerreviewed field — Claudio Atzori / detail
  27. - — Miriam Baglioni / detail
  28. [doiboost - preprocess] remove transition to orcid preparation from sequence of steps at the beginning of the workflow — Miriam Baglioni / detail
  29. - — Miriam Baglioni / detail
  30. updated the transformation Baseline workflow to include mdstore rollback/commit action — Sandro La Bruzzo / detail
  31. uploaded input parameters on CreateBaseline WF — Sandro La Bruzzo / detail
  32. added needed parameter — Miriam Baglioni / detail
  33. - — Miriam Baglioni / detail
  34. refactoring after compiletion — Miriam Baglioni / detail
  35. added metaresourcetype to the result hive DB view — Claudio Atzori / detail
  36. adjustments for country propagation — Miriam Baglioni / detail
  37. adding the bulkTag parameter file in the folder for the oozie workflow for bulkTagging. Changes the path in the class — Miriam Baglioni / detail
  38. changed the path to the parameter file in the class for entitytoorganization propagation — Miriam Baglioni / detail
  39. added properties file in the forlder for the workflow of orcid propagation. Changes the path in the classes implementing the propagationchanged the path to the parameter file in the class for entitytoorganization propagation — Miriam Baglioni / detail
  40. changed in the classes the path for the property files for the propagation of community from project — Miriam Baglioni / detail
  41. added properties file in the forlder for the workflow of project to result propagation. Changes the path in the classes implementing the propagation — Miriam Baglioni / detail
  42. added properties file in the forlder for the workflow of result to community from organization propagation. Changes the path in the classes implementing the propagation — Miriam Baglioni / detail
  43. added properties file in the forlder for the workflow of result to community from semrel propagation. Changes the path in the classes implementing the propagation — Miriam Baglioni / detail
  44. added properties file in the forlder for the workflow of result to organization from inst repo propagation. Changes the path in the classes implementing the propagation — Miriam Baglioni / detail
  45. SparkCreateSimRels: — Giambattista Bloisi / detail
  46. Do no longer use dedupId information from pivotHistory Database — Giambattista Bloisi / detail
  47. Generate "merged" dedup id relations also for records that are filtered out by the cut parameters — Giambattista Bloisi / detail
  48. Use dedup_wf_002 in place of dedup_wf_001 to make explicit a different algorithm has been used to generate those kind of ids — Giambattista Bloisi / detail
  49. Create dedup record for "merged" pivots — Giambattista Bloisi / detail
  50. refined mapping for the extraction of the original resource type — Claudio Atzori / detail

#134 (Dec 1, 2023 3:53:07 PM)

  1. replaced bip scores workflow with the Software Heritage one — Claudio Atzori / detail
  2. added step resulttocommunityfromproject — Claudio Atzori / detail
  3. renamed step resulttocommunityfromproject — Claudio Atzori / detail
  4. added step resulttocommunityfromproject to the BETA deployment — Claudio Atzori / detail
  5. added deploy specs for stats_actionset, download_orcid_dump, horizontal orcid enrichment — Claudio Atzori / detail

#134 (Dec 1, 2023 3:53:07 PM)

  1. changes to use the API instead of the IS the get the information for the communities to be used during bulktagging and context propagation — Miriam Baglioni / detail
  2. refactoring — Miriam Baglioni / detail
  3. [raw graph] adopting the new COAR based vocabularies for the resource typing — Claudio Atzori / detail
  4. used the API instead of the IS for bulktagging and propagation for community through organization. Added a new propagation step for communities through projects. Still using the API and not the IS — Miriam Baglioni / detail
  5. [raw graph] WIP: mapping original resource types — Claudio Atzori / detail
  6. testing and fix some issues — Miriam Baglioni / detail
  7. new spark parrameter updated — Sandro La Bruzzo / detail
  8. [raw graph] mapping original resource types — Claudio Atzori / detail
  9. more NPE checks — Claudio Atzori / detail
  10. [graph raw] URL Validator to accept double slashes — Claudio Atzori / detail
  11. Add actionset creation for pubmed affiliations — Serafeim Chatzopoulos / detail
  12. fixing issue on propagation organization. added --config to workflow definition. added oozie_app to communtiy project — Miriam Baglioni / detail
  13. Change the description of the workflow — Serafeim Chatzopoulos / detail
  14. StatsDB workflow to export actionsets about OA routes, diamond, and publicly-funded — dpierrakos / detail
  15. Renaming input param for crossref input path — Serafeim Chatzopoulos / detail
  16. Adjust tests to new WF input params — Serafeim Chatzopoulos / detail
  17. [graph cleaning] implemented further suggestions from https://support.openaire.eu/issues/8898 — Claudio Atzori / detail
  18. [graph cleaning] cleanup — Claudio Atzori / detail
  19. test for project propagation — Miriam Baglioni / detail
  20. removed not needed test class — Miriam Baglioni / detail
  21. - — Miriam Baglioni / detail
  22. refactoring and test — Miriam Baglioni / detail
  23. changing test for new implementation — Miriam Baglioni / detail
  24. refactoring — Miriam Baglioni / detail
  25. - — Miriam Baglioni / detail
  26. Changes to actionsets — dpierrakos / detail
  27. Implemented ORCID Workflow on DHP-Aggregation for retrieving ORCID DUMP and generating tables — Sandro La Bruzzo / detail
  28. - — Miriam Baglioni / detail
  29. Changes for tables and creation of the new indicator indi_is_result_accessible — dpierrakos / detail
  30. [graph cleaning] applying coar based vocabularies in bulk — Claudio Atzori / detail
  31. Update StatsAtomicActionsJob.java — dpierrakos / detail
  32. Implemented ORCID Enrichment — Sandro La Bruzzo / detail
  33. changed the parameter from production to baseURL. Fixed issue in tagging configuration — Miriam Baglioni / detail
  34. refactoring — Miriam Baglioni / detail
  35. Implemented Author MErger for ORCID that takes in account the case when name and surname are swapped — Sandro La Bruzzo / detail
  36. added comment — Sandro La Bruzzo / detail
  37. Changed implementation of check similarity to verify exact match of name instead of the first char — Sandro La Bruzzo / detail
  38. added test — Sandro La Bruzzo / detail
  39. added instanceTypeMapping original field in the mapping of — Sandro La Bruzzo / detail
  40. added vocabulary in instanceTypeMapping for — Sandro La Bruzzo / detail
  41. removed Orcid intersection on DOIBoost — Sandro La Bruzzo / detail
  42. Added copy of the untouched entities of the graph — Sandro La Bruzzo / detail
  43. code formatting — Sandro La Bruzzo / detail
  44. Update StatsAtomicActionsJob.java — dpierrakos / detail
  45. Removed unused function — Sandro La Bruzzo / detail
  46. Changes to indicators — dpierrakos / detail
  47. Add new indicator — dpierrakos / detail
  48. New institutions added — dpierrakos / detail
  49. using objectSubType as originalType in Crossref2Oaf, code formatting — Claudio Atzori / detail
  50. code formatting — Claudio Atzori / detail

#133 (Oct 20, 2023 10:30:49 PM)

  1. replaced bip scores workflow with the Software Heritage one — Claudio Atzori / detail

#133 (Oct 20, 2023 10:30:49 PM)

  1. Changes — dpierrakos / detail
  2. Update step15.sql — dpierrakos / detail
  3. Changes in indicators step, monitor step — dpierrakos / detail
  4. Update step16-createIndicatorsTables.sql — dpierrakos / detail
  5. Add collecting software code repository URLs — Serafeim Chatzopoulos / detail
  6. Update step16-createIndicatorsTables.sql — dpierrakos / detail
  7. Add steps to collect last visit data && archive not found repository URLs — Serafeim Chatzopoulos / detail
  8. Add step for archiving repoUrls to SWH — Serafeim Chatzopoulos / detail
  9. extending the coverage of the peer non-unknown refereed instances — Claudio Atzori / detail
  10. Add action for creating actionsets — Serafeim Chatzopoulos / detail
  11. Add param for limiting repo Urls — Serafeim Chatzopoulos / detail
  12. Restructure workflow parameters — Serafeim Chatzopoulos / detail
  13. Add actionsetsPath as a global WF param — Serafeim Chatzopoulos / detail
  14. Add SWH in the collectedFrom field — Serafeim Chatzopoulos / detail
  15. Move SWH API Key from constants to workflow param — Serafeim Chatzopoulos / detail
  16. cleanup and refinements — Claudio Atzori / detail
  17. Add prefix in SWH ID — Serafeim Chatzopoulos / detail
  18. ignored jenv prop — Sandro La Bruzzo / detail
  19. implemented relation to irish funder from a Json list — Sandro La Bruzzo / detail
  20. code formatting — Claudio Atzori / detail
  21. [SWH] aligned parameter name — Claudio Atzori / detail
  22. [SWH] compress the output actionset — Claudio Atzori / detail
  23. Fix cleaning of Pmid  where parsing of numbers stopped at first not leading 0' character — Claudio Atzori / detail
  24. [OC] compress the output actionset — Claudio Atzori / detail
  25. [OC] using the common pid cleaning function — Claudio Atzori / detail
  26. [Doiboost] removed linkage to SFI unidentified project — Claudio Atzori / detail
  27. Update step16-createIndicatorsTables.sql — dpierrakos / detail
  28. Update step20-createMonitorDB.sql — dpierrakos / detail
  29. code formatting — Claudio Atzori / detail
  30. extend the fos model to include the level4 and the scores for level3 and level4. removed bip indicators from the instance — Miriam Baglioni / detail
  31. removed module dhp-stats-monitor-update — Claudio Atzori / detail
  32. leftover for the properties and removal of bipfinder — Miriam Baglioni / detail
  33. [UnresolvedEntities] updated action name — Claudio Atzori / detail
  34. [graph cleaning] avoid NPEs — Claudio Atzori / detail
  35. [AMF] docs — Claudio Atzori / detail
  36. cleanup & docs — Claudio Atzori / detail
  37. [SWH] renamed 'Software Heritage Identifier' to 'Software Hash Identifier' — Claudio Atzori / detail
  38. [dedup] use common saveParquet and save methods to ensure outputs are compressed — Claudio Atzori / detail
  39. FIX: GroupEntitiesSparkJob deletes whole graph outputPath instead of its temporary folder — Giambattista Bloisi / detail
  40. avoid NPEs — Claudio Atzori / detail
  41. added defaults to the graph resolution workflow config-default.xml — Claudio Atzori / detail
  42. depending on dhp-schemas:3.17.2 — Claudio Atzori / detail