Changes
#142 (Apr 19, 2024 9:46:03 PM)
- [BETA] added the deployment instruction in the jenkins file for beta related to the creation of the action set to include the results tagged with FoS without a doi — Miriam Baglioni / detail
- [DUMP] added Jenkins file for the deployment of the dumps — Miriam Baglioni / detail
- indentation — Claudio Atzori / detail
#142 (Apr 19, 2024 9:46:03 PM)
- code formatted — Sandro La Bruzzo / detail
- added first part of refactoring of the code generating MAG, — Sandro La Bruzzo / detail
- Implemented class that generates a normalized table of MAG, which is the starting point for the creation of the mag source — Sandro La Bruzzo / detail
- added oozie workflow — Sandro La Bruzzo / detail
- update mag mapping — Sandro La Bruzzo / detail
- Moved Crossref Mapping on dhp-aggregations, — Sandro La Bruzzo / detail
- code refactor — Sandro La Bruzzo / detail
- applied cherry pick — Sandro La Bruzzo / detail
- update crossref mapping to be runnable separately as a single datasource outside doiboost — Sandro La Bruzzo / detail
- update crossref mapping to be transformed together with UnpayWall — Sandro La Bruzzo / detail
- Improved Crossref mapping to include also unpaywall tested — Sandro La Bruzzo / detail
- fixed the result_country definition — Antonis Lempesis / detail
- Add action set creation for Datacite affiliations — Serafeim Chatzopoulos / detail
- added new orgs in monitor — antleb / detail
- [UsageCount] add check in case the datasource is not matched against those present in the graph — Miriam Baglioni / detail
- [UsageCount] fixed error — Miriam Baglioni / detail
- Fix datacite input path in properties file — Serafeim Chatzopoulos / detail
- [OpenCitation] add compression option when writing the sequence file — Miriam Baglioni / detail
- base datainfo with trust=0.89 — Michele Artini / detail
- [NOAMI] removed entry for Health and Social Care Board from the list of funders. Modified IRC putting 1596 and 1597 as synonyms, as required in ticket 9635 — Miriam Baglioni / detail
- - Update the code which acquires the "IMPALA_HDFS_NODE", to test the "tmp"-dir, instead of the base-dir and introduce retries, to overcome potential file-system failures. This change was suggested by "Sebastian Tymkow" and "Grzegorz Bakalarski". — Lampros Smyrnaios / detail
- [MapToFunderLink]added references for HFRI and Erasmus+ for the creation of links for funders — Miriam Baglioni / detail
- Implemented first part of the new MAG mapping — Sandro La Bruzzo / detail
- [DataciteHostedByMap] added entry for EBRAINS — Miriam Baglioni / detail
- [DataciteHostedByMap] added entry for EBRAINS — Miriam Baglioni / detail
- completed mapping from paper to OAF, not tested — Sandro La Bruzzo / detail
- Updated mapping — Sandro La Bruzzo / detail
- mapping generated for MAG, — Sandro La Bruzzo / detail
- added instanceTypeMapping field on MAG — Sandro La Bruzzo / detail
- fixed duplicated property dhp-schemas.version — Claudio Atzori / detail
- Upgrade the copying operation to Impala Cluster: — Lampros Smyrnaios / detail
- Use the "HADOOP_USER_NAME" value from the "workflow-property", in "copyDataToImpalaCluster.sh", in "stats-monitor-updates". — Lampros Smyrnaios / detail
- updated wf of MAG and crossref to use transaction — Sandro La Bruzzo / detail
- added vocabulary tu instanceTypeMApping of Mag — Sandro La Bruzzo / detail
- code formatted — Sandro La Bruzzo / detail
- Miscellaneous updates to the copying operation to Impala Cluster: — Lampros Smyrnaios / detail
- Minor updates to the copying operation to Impala Cluster: — Lampros Smyrnaios / detail
- - Bug fix in matchOrderedTokenAndAbbreviations algorithms where tokens with same initial character were always considered equal — Giambattista Bloisi / detail
- test — Michele Artini / detail
- added a couple more invalid author names — Claudio Atzori / detail
- Added Action set generation for the MAG organization — Sandro La Bruzzo / detail
- removed the funder id : 100011062 Asian Spinal Cord Network, wrongly associated to Ireland — Miriam Baglioni / detail
- Refinements to PR #404: refactoring the Oaf records merge utilities into dhp-common — Giambattista Bloisi / detail
- Various fixes for the stats DB update workflow, step16-createIndicatorsTables.sql — Claudio Atzori / detail
- integrating changes from PR#424 — Claudio Atzori / detail
- updated Ignore annotation that is deprecated to Disabled — Sandro La Bruzzo / detail
- [graph indexing] sets spark memoryOverhead in the join operations to the same value used for the memory executor — Claudio Atzori / detail
- [transformative agreement] including reuslt-funder relations to the information imported from the TRs — Claudio Atzori / detail
- updated schema version — Claudio Atzori / detail
#137 (Apr 2, 2024 11:09:22 AM)
- added workflow for updating the dedup pivot history database — Claudio Atzori / detail
- updated deployment spec for PROD — Claudio Atzori / detail
- Updated deployments of orcid collection from api — Sandro La Bruzzo / detail
- Change DNET_HADOOP_REPO_BRANCH default value to "beta" for BETA pipelines — Giambattista Bloisi / detail
- added deployment specs for dhp-stats-monitor-update, dhp-stats-monitor-irish, dhp-stats-hist-snaps — Claudio Atzori / detail
- removed deployment spec for dhp-stats-monitor-update — Claudio Atzori / detail
- added deployment specs for dhp-stats-monitor-irish, dhp-stats-hist-snaps — Claudio Atzori / detail
- [BETA] added the deployment instruction in the jenkins file for beta related to the creation of the action set to extend the results with the open apc transformative agreemnt file information — Miriam Baglioni / detail
#137 (Apr 2, 2024 11:09:22 AM)
- [UsageCount] split the count for result at the level of the datasource. for each indicator one unit is specified for each datasource contrinuting to that indicator value. The datasource key is the value of the key element in the unit for the measure, while the count for that datasource is in the value — Miriam Baglioni / detail
- refactoring — Miriam Baglioni / detail
- [Transformative Agreement] added code to extract relations from the transformative agreement file for the IE products got from OpenAPC — Miriam Baglioni / detail
- [Transformative Agreement] removed not needed class. Read directly the json and no need to pass from the csv — Miriam Baglioni / detail
- [Transformative Agreement] added check to verify the APC were paid byu the IReL funder — Miriam Baglioni / detail
- Changes to indicators and funders definition — dpierrakos / detail
- Monitor Irish Stats WF — dpierrakos / detail
- Historical Snapshots Workflow — dpierrakos / detail
- Update buildIrishMonitorDB.sql — dpierrakos / detail
- fixed the result_country definition — Antonis Lempesis / detail
- Changes to beta db names — dpierrakos / detail
- Changes to indicators — dpierrakos / detail
- Implemented Download update of ORCID — Sandro La Bruzzo / detail
- Added workflow — Sandro La Bruzzo / detail
- code refactor — Sandro La Bruzzo / detail
- added some useful comment — Sandro La Bruzzo / detail
- creating result_instances even when no pids exist for the instance — Antonis Lempesis / detail
- fix issue on FoS integration. Removing the null values from FoS — Miriam Baglioni / detail
- fixed missing parameter on download update — Sandro La Bruzzo / detail
- Fixed error of connection timeout — Sandro La Bruzzo / detail
- Reusable RunSQLSparkJob for executing SQL in Spark through Oozie Spark Actions — Giambattista Bloisi / detail
- [enrichment single step] refactoring to fix issue in disappeared result type — Miriam Baglioni / detail
- [enrichment single step] refactoring to fix issues in disappeared result type — Miriam Baglioni / detail
- [enrichment single step] remove parameter from execution — Miriam Baglioni / detail
- - — Miriam Baglioni / detail
- [enrichment single step] moving parameter file in correct location — Miriam Baglioni / detail
- [enrichment single step] adding <end> element in wf definition — Miriam Baglioni / detail
- increased shuffle partitions for publications in the country propagation workflow — Claudio Atzori / detail
- [orcid enrichment] drop paths before copying the non-modifyed contents — Claudio Atzori / detail
- [graph provision] obtain context info from the context API instead from the ISLookUp service — Claudio Atzori / detail
- code formatting — Claudio Atzori / detail
- [graph provision] updated param specification for the XML converter job — Claudio Atzori / detail
- [BulkTagging] extend the definition of the pathMap to include also actions that should be performed of the value extracted from the result befor applying the constraint — Miriam Baglioni / detail
- compilation after merging — Miriam Baglioni / detail
- logg added during download — Sandro La Bruzzo / detail
- [collection] increased logging from the oai-pmh metadata collection process — Claudio Atzori / detail
- [graph provision] retrieve all the context information by adding all=true to the requests issued to thr API — Claudio Atzori / detail
- added code of conduct and contributing files — Claudio Atzori / detail
- minor — Claudio Atzori / detail
- Update 'CONTRIBUTING.md' — Claudio Atzori / detail
- max mem of joins (hive.mapjoin.followby.gby.localtask.max.memory.usage) now 80%, up from 55%. — Antonis Lempesis / detail
- max mem of joins (hive.mapjoin.followby.gby.localtask.max.memory.usage) now 80%, up from 55%. — Claudio Atzori / detail
- Added workflow to update ORCID and replaced some parsing, because the update works and employments xml differs from the dump one. — Sandro La Bruzzo / detail
- Changed step16-createIndicatorsTables to use a spark oozie action instead of hive — antleb / detail
- [collection] increased logging from the oai-pmh metadata collection process — Claudio Atzori / detail
- Fixed problem on missing author in crossref Mapping — Sandro La Bruzzo / detail
- Use SparkSQL in place of Hive for executing step16-createIndicatorsTables.sql of stats update wf — Giambattista Bloisi / detail
- Added exception throwing in Hadoop transformation when TR is not syntactically valid — Sandro La Bruzzo / detail
- [UsageCount] code extention to include also the name of the datasource — Miriam Baglioni / detail
- [orcid-enrichment] change the value of parameters. — Miriam Baglioni / detail
- [bulkTagging] removing checks while performing the substring action so that it will fire an Exception if the paramneters are wrongly set — Miriam Baglioni / detail
- changed orcid ids to all capital — antleb / detail
- test for Italian records from IRS repositories — Alessia Bardi / detail
- [orcid enrichment] fixed directory cleanup before distcp — Claudio Atzori / detail
- [graph cleaning] rule out datasources without an officialname — Claudio Atzori / detail
- [actiosets] introduced support for the PromoteAction strategy — Claudio Atzori / detail
- [actiosets] fixed join type — Claudio Atzori / detail
- fixed import of ORPs stored on HDFS in the internal graph format (e.g. Datacite) — Claudio Atzori / detail
- added 2 new institutions in monitor — antleb / detail
- Dedup aliases, created when a dedup in a previous build has been merged in a new dedup, need to be marked as "deletedbyinference", since they are "merged" in the new dedup — Giambattista Bloisi / detail
- [graph raw] fixed mapping of the original resource type from the Datacite format — Claudio Atzori / detail
- [Transformative Agreement] add results with information abount the agreement and the country of the organization paid for it — Miriam Baglioni / detail
- [Tagging Projects and Datasource] first extention of bulktagging to add the context to projects and datasource — Miriam Baglioni / detail
- [Tagging Projects and Datasource] added test to check datasource tagging. Fixed issue — Miriam Baglioni / detail
- - — Miriam Baglioni / detail
- Promote "Research" to a jolly instanceType in dedup comparisons — Giambattista Bloisi / detail
- Promote "Research" to a jolly instanceType in dedup comparisons — Giambattista Bloisi / detail
- Add Action Set creation for affiliations inferred from the OpenAPC data — Serafeim Chatzopoulos / detail
- [Tagging Projects and Datasource] changed the way the pathMap parameter is passed. It was too long and was truncated — Miriam Baglioni / detail
- [Transformative Agreement] removed the relations from the ActionSet waiting to have the gree light from Ioanna — Miriam Baglioni / detail
- mapping of project PIDs — Michele Artini / detail
- Implemented workflow for updating table , added step to check if the new generated table is valid — Sandro La Bruzzo / detail
- Revised procedure when converting json data into xml: — Giambattista Bloisi / detail
- following the comment on the pull requests: — Sandro La Bruzzo / detail
- formatted code — Sandro La Bruzzo / detail
- WIP — Claudio Atzori / detail
- When converting json to XML, remove characters that are not allowed in the XML 1.0 specs, as they will cause xpath failures even if escaped — Giambattista Bloisi / detail
- [OCNEW] first implementation — Miriam Baglioni / detail
- using distinct apcs per publication to avoid huge sums — Antonis Lempesis / detail
- [OCNEW] added creation of the actionset for the results classified with FoS based ont he OpenAIRE identifier — Miriam Baglioni / detail
- [FOSNEW] removed test class — Miriam Baglioni / detail
- fixed the irish result subset — antleb / detail
- WIP: extended provision workflow to create the JSON based payload — Claudio Atzori / detail
- Enrich authors with ORCID info using new matching algorithm — Giambattista Bloisi / detail
- selecting distinct peer_reviewed — antleb / detail
- WIP: updated provision workflow to create a JSON based representation of the payload — Claudio Atzori / detail
- [OC New] last fix — Miriam Baglioni / detail
- [OC New] last fix — Miriam Baglioni / detail
- including related organization url in the XML record serialization (ticket #9498) — Claudio Atzori / detail
- expanded paper abstract in the result/children XML element (ticket #9497) — Claudio Atzori / detail
- implemented changes from #9497: sort abstracts by string length, included author fullnames in the related results, expanded instance details within each children/result XML element — Claudio Atzori / detail
- cleanup — Claudio Atzori / detail
- new plugin to collect from a dump of BASE — Claudio Atzori / detail
- mapped oaf:country from results — Michele Artini / detail
- apply commits from master — Michele Artini / detail
- updated BASE filter param — Michele Artini / detail
- xslt rules — Michele Artini / detail
- Unify merge logic of entities in MergeUtils.class — Giambattista Bloisi / detail
- Commit monitor-updates-wf — dpierrakos / detail
- code cleanup — Antonis Lempesis / detail
- code cleanup — antleb / detail
- code cleanup — antleb / detail
- fixed an identifier xpath — Michele Artini / detail
- Fix conditions that prevented ORCID Enrichment — Giambattista Bloisi / detail
- refactoring the Oaf records merge utilities into dhp-common — Claudio Atzori / detail
- fixed a problem with multiple nodes — Michele Artini / detail
- xslt rules and tests — Michele Artini / detail
- implemented default merge procedure applied to result.instance — Claudio Atzori / detail
- align dhp-schema.version with the beta branch — Claudio Atzori / detail
- integrated minor change from beta branch — Claudio Atzori / detail
- align dhp-schema.version with the beta branch — Claudio Atzori / detail
- further follow up changes from integrating the mergeutils branch — Claudio Atzori / detail
- included new stats* workflows in parent pom list of modules, code formatting — Claudio Atzori / detail
- Use the ACTIVE HDFS NODE for Impala cluster, in "copyDataToImpalaCluster.sh" script. — Lampros Smyrnaios / detail
- Automatically select the ACTIVE HDFS NODE for Impala cluster, in all "copyDataToImpalaCluster.sh" scripts. — Lampros Smyrnaios / detail
- Generate tables with parquet-files, instead of csv, in "dhp-stats-update/.../contexts.sh" script. — Lampros Smyrnaios / detail
- [BulkTagging - tag datasource and projects]merging with branch beta — Miriam Baglioni / detail
- code formatting — Claudio Atzori / detail
- added missing EOS — Antonis Lempesis / detail
- fixed typo in indicator query — Antonis Lempesis / detail
#136 (Jan 11, 2024 10:17:29 PM)
- enrichment with subworkflows — Claudio Atzori / detail
#136 (Jan 11, 2024 10:17:29 PM)
- [graph cleaning] added cleaning for result.publisher and result.instance.license — Claudio Atzori / detail
- fixed doiboost process workflow, removed references to the ProcessORCID step — Claudio Atzori / detail
- Extracted the correct original type to pass to instanceTypeMapping in Crossref Mapping — Sandro La Bruzzo / detail
- code formatting — Claudio Atzori / detail
- [graph grouping] added isLookupUrl to the workflow definition, passed to the grouping spark aciton — Claudio Atzori / detail
- avoid NPEs in Vocabulary.getTermBySynonym — Claudio Atzori / detail
- avoid NPEs — Claudio Atzori / detail
- avoid NPEs — Claudio Atzori / detail
- [bulktagging] fixed workflow parameters — Claudio Atzori / detail
- [community_organization propagation] fixed workflow parameters — Claudio Atzori / detail
- added serialization for the new fields imported for the Irish tender — Claudio Atzori / detail
- [dedup] added isLookupUrl to the graph consistency workflow definition, required now by the entity grouping phase — Claudio Atzori / detail
- [orcid enrichment] fixed workflow definition — Claudio Atzori / detail
- first version of the workflow single step — Miriam Baglioni / detail
- [bulktagging] setting first step of bulktaggin as the copy of the entities and relations not involved in the tagging' — Miriam Baglioni / detail
- [community_result_propagation] adjusting starting poit of workflow — Miriam Baglioni / detail
- [enrichment] passing the community API base URL — Claudio Atzori / detail
- logging typo — Claudio Atzori / detail
- [graph cleaning] avoid stack overflow error when navigating Oaf objects declaring an Enum — Claudio Atzori / detail
- code formatting — Claudio Atzori / detail
- adjusting workflow definition — Miriam Baglioni / detail
- removed not needed parameter — Miriam Baglioni / detail
- [graph provision] added tests for the new model fields — Claudio Atzori / detail
- [cleaning] allow enriched orcids to pass the cleaning, rule out non-orcid author pids — Claudio Atzori / detail
- code formatting — Claudio Atzori / detail
- [graph provision] added tests for new peerreviewed field — Claudio Atzori / detail
- - — Miriam Baglioni / detail
- [doiboost - preprocess] remove transition to orcid preparation from sequence of steps at the beginning of the workflow — Miriam Baglioni / detail
- - — Miriam Baglioni / detail
- updated the transformation Baseline workflow to include mdstore rollback/commit action — Sandro La Bruzzo / detail
- uploaded input parameters on CreateBaseline WF — Sandro La Bruzzo / detail
- added needed parameter — Miriam Baglioni / detail
- - — Miriam Baglioni / detail
- refactoring after compiletion — Miriam Baglioni / detail
- added metaresourcetype to the result hive DB view — Claudio Atzori / detail
- adjustments for country propagation — Miriam Baglioni / detail
- adding the bulkTag parameter file in the folder for the oozie workflow for bulkTagging. Changes the path in the class — Miriam Baglioni / detail
- changed the path to the parameter file in the class for entitytoorganization propagation — Miriam Baglioni / detail
- added properties file in the forlder for the workflow of orcid propagation. Changes the path in the classes implementing the propagationchanged the path to the parameter file in the class for entitytoorganization propagation — Miriam Baglioni / detail
- changed in the classes the path for the property files for the propagation of community from project — Miriam Baglioni / detail
- added properties file in the forlder for the workflow of project to result propagation. Changes the path in the classes implementing the propagation — Miriam Baglioni / detail
- added properties file in the forlder for the workflow of result to community from organization propagation. Changes the path in the classes implementing the propagation — Miriam Baglioni / detail
- added properties file in the forlder for the workflow of result to community from semrel propagation. Changes the path in the classes implementing the propagation — Miriam Baglioni / detail
- added properties file in the forlder for the workflow of result to organization from inst repo propagation. Changes the path in the classes implementing the propagation — Miriam Baglioni / detail
- SparkCreateSimRels: — Giambattista Bloisi / detail
- Do no longer use dedupId information from pivotHistory Database — Giambattista Bloisi / detail
- Generate "merged" dedup id relations also for records that are filtered out by the cut parameters — Giambattista Bloisi / detail
- Use dedup_wf_002 in place of dedup_wf_001 to make explicit a different algorithm has been used to generate those kind of ids — Giambattista Bloisi / detail
- Create dedup record for "merged" pivots — Giambattista Bloisi / detail
- refined mapping for the extraction of the original resource type — Claudio Atzori / detail
#134 (Dec 1, 2023 3:53:07 PM)
- replaced bip scores workflow with the Software Heritage one — Claudio Atzori / detail
- added step resulttocommunityfromproject — Claudio Atzori / detail
- renamed step resulttocommunityfromproject — Claudio Atzori / detail
- added step resulttocommunityfromproject to the BETA deployment — Claudio Atzori / detail
- added deploy specs for stats_actionset, download_orcid_dump, horizontal orcid enrichment — Claudio Atzori / detail
#134 (Dec 1, 2023 3:53:07 PM)
- changes to use the API instead of the IS the get the information for the communities to be used during bulktagging and context propagation — Miriam Baglioni / detail
- refactoring — Miriam Baglioni / detail
- [raw graph] adopting the new COAR based vocabularies for the resource typing — Claudio Atzori / detail
- used the API instead of the IS for bulktagging and propagation for community through organization. Added a new propagation step for communities through projects. Still using the API and not the IS — Miriam Baglioni / detail
- [raw graph] WIP: mapping original resource types — Claudio Atzori / detail
- testing and fix some issues — Miriam Baglioni / detail
- new spark parrameter updated — Sandro La Bruzzo / detail
- [raw graph] mapping original resource types — Claudio Atzori / detail
- more NPE checks — Claudio Atzori / detail
- [graph raw] URL Validator to accept double slashes — Claudio Atzori / detail
- Add actionset creation for pubmed affiliations — Serafeim Chatzopoulos / detail
- fixing issue on propagation organization. added --config to workflow definition. added oozie_app to communtiy project — Miriam Baglioni / detail
- Change the description of the workflow — Serafeim Chatzopoulos / detail
- StatsDB workflow to export actionsets about OA routes, diamond, and publicly-funded — dpierrakos / detail
- Renaming input param for crossref input path — Serafeim Chatzopoulos / detail
- Adjust tests to new WF input params — Serafeim Chatzopoulos / detail
- [graph cleaning] implemented further suggestions from https://support.openaire.eu/issues/8898 — Claudio Atzori / detail
- [graph cleaning] cleanup — Claudio Atzori / detail
- test for project propagation — Miriam Baglioni / detail
- removed not needed test class — Miriam Baglioni / detail
- - — Miriam Baglioni / detail
- refactoring and test — Miriam Baglioni / detail
- changing test for new implementation — Miriam Baglioni / detail
- refactoring — Miriam Baglioni / detail
- - — Miriam Baglioni / detail
- Changes to actionsets — dpierrakos / detail
- Implemented ORCID Workflow on DHP-Aggregation for retrieving ORCID DUMP and generating tables — Sandro La Bruzzo / detail
- - — Miriam Baglioni / detail
- Changes for tables and creation of the new indicator indi_is_result_accessible — dpierrakos / detail
- [graph cleaning] applying coar based vocabularies in bulk — Claudio Atzori / detail
- Update StatsAtomicActionsJob.java — dpierrakos / detail
- Implemented ORCID Enrichment — Sandro La Bruzzo / detail
- changed the parameter from production to baseURL. Fixed issue in tagging configuration — Miriam Baglioni / detail
- refactoring — Miriam Baglioni / detail
- Implemented Author MErger for ORCID that takes in account the case when name and surname are swapped — Sandro La Bruzzo / detail
- added comment — Sandro La Bruzzo / detail
- Changed implementation of check similarity to verify exact match of name instead of the first char — Sandro La Bruzzo / detail
- added test — Sandro La Bruzzo / detail
- added instanceTypeMapping original field in the mapping of — Sandro La Bruzzo / detail
- added vocabulary in instanceTypeMapping for — Sandro La Bruzzo / detail
- removed Orcid intersection on DOIBoost — Sandro La Bruzzo / detail
- Added copy of the untouched entities of the graph — Sandro La Bruzzo / detail
- code formatting — Sandro La Bruzzo / detail
- Update StatsAtomicActionsJob.java — dpierrakos / detail
- Removed unused function — Sandro La Bruzzo / detail
- Changes to indicators — dpierrakos / detail
- Add new indicator — dpierrakos / detail
- New institutions added — dpierrakos / detail
- using objectSubType as originalType in Crossref2Oaf, code formatting — Claudio Atzori / detail
- code formatting — Claudio Atzori / detail
#133 (Oct 20, 2023 10:30:49 PM)
- replaced bip scores workflow with the Software Heritage one — Claudio Atzori / detail
#133 (Oct 20, 2023 10:30:49 PM)
- Changes — dpierrakos / detail
- Update step15.sql — dpierrakos / detail
- Changes in indicators step, monitor step — dpierrakos / detail
- Update step16-createIndicatorsTables.sql — dpierrakos / detail
- Add collecting software code repository URLs — Serafeim Chatzopoulos / detail
- Update step16-createIndicatorsTables.sql — dpierrakos / detail
- Add steps to collect last visit data && archive not found repository URLs — Serafeim Chatzopoulos / detail
- Add step for archiving repoUrls to SWH — Serafeim Chatzopoulos / detail
- extending the coverage of the peer non-unknown refereed instances — Claudio Atzori / detail
- Add action for creating actionsets — Serafeim Chatzopoulos / detail
- Add param for limiting repo Urls — Serafeim Chatzopoulos / detail
- Restructure workflow parameters — Serafeim Chatzopoulos / detail
- Add actionsetsPath as a global WF param — Serafeim Chatzopoulos / detail
- Add SWH in the collectedFrom field — Serafeim Chatzopoulos / detail
- Move SWH API Key from constants to workflow param — Serafeim Chatzopoulos / detail
- cleanup and refinements — Claudio Atzori / detail
- Add prefix in SWH ID — Serafeim Chatzopoulos / detail
- ignored jenv prop — Sandro La Bruzzo / detail
- implemented relation to irish funder from a Json list — Sandro La Bruzzo / detail
- code formatting — Claudio Atzori / detail
- [SWH] aligned parameter name — Claudio Atzori / detail
- [SWH] compress the output actionset — Claudio Atzori / detail
- Fix cleaning of Pmid where parsing of numbers stopped at first not leading 0' character — Claudio Atzori / detail
- [OC] compress the output actionset — Claudio Atzori / detail
- [OC] using the common pid cleaning function — Claudio Atzori / detail
- [Doiboost] removed linkage to SFI unidentified project — Claudio Atzori / detail
- Update step16-createIndicatorsTables.sql — dpierrakos / detail
- Update step20-createMonitorDB.sql — dpierrakos / detail
- code formatting — Claudio Atzori / detail
- extend the fos model to include the level4 and the scores for level3 and level4. removed bip indicators from the instance — Miriam Baglioni / detail
- removed module dhp-stats-monitor-update — Claudio Atzori / detail
- leftover for the properties and removal of bipfinder — Miriam Baglioni / detail
- [UnresolvedEntities] updated action name — Claudio Atzori / detail
- [graph cleaning] avoid NPEs — Claudio Atzori / detail
- [AMF] docs — Claudio Atzori / detail
- cleanup & docs — Claudio Atzori / detail
- [SWH] renamed 'Software Heritage Identifier' to 'Software Hash Identifier' — Claudio Atzori / detail
- [dedup] use common saveParquet and save methods to ensure outputs are compressed — Claudio Atzori / detail
- FIX: GroupEntitiesSparkJob deletes whole graph outputPath instead of its temporary folder — Giambattista Bloisi / detail
- avoid NPEs — Claudio Atzori / detail
- added defaults to the graph resolution workflow config-default.xml — Claudio Atzori / detail
- depending on dhp-schemas:3.17.2 — Claudio Atzori / detail