Readme.md 2.73 KB

Extract IDs from ORCID profiles

This tool was develop to extract IDs from ORCID. It was developed in Bash and Perl languages and it uses XSL Transformations for the output result, so it can be used for any other purpose than this.

At this moment the output it's in CSV format that can be opened in OpenOffice, LibreOffice, MS Excel.

Something like this:

|        ORCID        |          Name         | ResearcherID  |      Scoups author ID    |
| ------------------- | --------------------- | ------------- | ------------------------ |
| 0000-0001-5006-0738 | Martim                | J-3442-2183   | 11540149010              |
| 0000-0001-5005-8758 | Paulo                 |               | 27412014820              |
| 0000-0001-5005-7265 | Mafalda               |               |                          |
| 0000-0001-5005-610X | Maria                 |               | 55966341300||51951231450 |
| 0000-0001-5004-8517 | William               |               |                          |
| 0000-0001-5005-1201 | João                  |               | 21019375500              |
| 0000-0001-5005-2404 | Emanuel               |               |                          |
| 0000-0001-5005-3594 | Sergio                | M-2054-4313   | 6288906719               |
| 0000-0001-5006-3006 | Carla                 |               | 55970003233              |

Instalation

This tool uses Perl. So it's required for Perl to be instaled on your system and also some libXML libraries for parsing and transform XML.

For Ubuntu systems:

sudo apt-get install libxml-libxslt-perl, libxml-xpath-perl, libxml-perl

How to use

It's split in two parts. The first one downloads files based on input file or argument ORCIDs and the second one, from a directory of orcids, process each one and outputs the result.

bash get_orcid_data.sh --file /path/to/orcid_list.txt >> /path/to/output_data.csv

output_data.csv result:

"0000-0001-5006-0738","Martim","J-3442-2183","11540149010"
"0000-0001-5005-8758","Paulo","",27412014820"
"0000-0001-5005-7265","Mafalda","",""
"0000-0001-5005-610X","Maria","",55966341300||51951231450"
"0000-0001-5004-8517","William","",""
"0000-0001-5005-1201","João","","21019375500"
"0000-0001-5005-2404","Emanuel","",""
"0000-0001-5005-3594","Sergio","M-2054-4313","6288906719"
"0000-0001-5006-3006","Carla","","55970003233"

Other uses

You can change orcid-map.xslt to output data in other formats. Or change the http://pub.orcid.org/v1.2/${1}/orcid-bio url to affiliations to get the user affiliations or any type of information provided by ORCID.

Need help, or give any type of contribution?

Please contact us at FCT|FCCN or any commiter.