Knowledge Graphs Generation from unstructured text
bash
$ python3 common/stanfordcorenlp/server.py
(syntax: python3 common/stanfordcorenlp/server.py -h)
bash
$ python3 pipeline.py text.txt -p senna -s -k cso -ng
(syntax: python3 pipeline.py -h)
bash
$ cd preprocessor
$ python3 preprocessor.py text.txt
(syntax: python3 preprocessor.py -h)
bash
$ cd facts_extractor
$ python3 extractor.py text_preprocessed.txt -p senna -s
(syntax: python3 extractor.py -h)
bash
$ cd kb_linker
$ python3 linker.py text_preprocessed.txt -k cso
(syntax: python3 linker.py -h)
bash
$ cd rdf_maker
$ python3 maker.py text_preprocessed_triples.txt -l text_preprocessed_links.txt
(syntax: python3 maker.py -h)
bash
$ cd graph_generator
$ python3 generator.py text_preprocessed_kg.ttl
(syntax: python3 generator.py -h)
bash
$python3 common/stanfordcorenlp/server.py -k
(or simply Ctrl+C in its shell)
Rossanez, A.; Dos Reis, J. C.; Torres, R. S.; De Ribaupierre, H. KGen: A Knowledge Graph Generator from Biomedical Scientific Literature. BMC Medical Informatics and Decision Making, v. 20, p. 314, 2020.
Rossanez, A.; Dos Reis, J. C. Generating Knowledge Graphs from Scientific Literature of Degenerative Diseases. In Proceedings of the 4th International Workshop on Semantics-Powered Data Mining and Analytics (SEPDA 2019), co-located with the 18th International Semantic Web Conference (ISWC 2019). Aachen: CEUR Workshop Proceedings, 2019. v. 2427. p. 12-23.
```bash $ du -h /Users/b/KGen/ | sort -hr | head -n 10
3.1G /Users/b/KGen/ 2.9G /Users/b/KGen/.git/objects/pack 2.9G /Users/b/KGen/.git/objects 2.9G /Users/b/KGen/.git 183M /Users/b/KGen/examples 152M /Users/b/KGen/examples/cs 146M /Users/b/KGen/examples/cs/ISWC 31M /Users/b/KGen/examples/biomedical 19M /Users/b/KGen/examples/biomedical/tentative 12M /Users/b/KGen/examples/biomedical/reduced ```
Environment: macOS Monterey 12.1 (21C52)
```bash $ python3 common/stanfordcorenlp/server.py
Starting Stanford CoreNLP Server from /Users/b/kgen/common/stanfordcorenlp
Stanford CoreNLP Server startup command: java -Djava.io.tmpdir="/tmp/" -mx5g -cp "/Users/b/kgen/common/stanfordcorenlp/stanford-corenlp.jar:/Users/b/kgen/common/stanfordcorenlp/stanford-corenlp-models.jar:/Users/b/kgen/common/stanfordcorenlp/slf4j-api.jar:/Users/b/kgen/common/stanfordcorenlp/slf4j-simple.jar:/Users/b/kgen/common/stanfordcorenlp/ejml.jar" edu.stanford.nlp.pipeline.StanfordCoreNLPServer -port 9000 -timeout 15000
Error: Could not find or load main class edu.stanford.nlp.pipeline.StanfordCoreNLPServer
Caused by: java.lang.ClassNotFoundException: edu.stanford.nlp.pipeline.StanfordCoreNLPServer
Traceback (most recent call last):
File "/Users/b/kgen/common/stanfordcorenlp/server.py", line 119, in
Hi @rossanez I came across this repo while doing some research for a hack-a-thon in the EU, and there is no license specified.
Here's a guide from GitHub on adding an open source license using their templates:
https://docs.github.com/en/communities/setting-up-your-project-for-healthy-contributions/adding-a-license-to-a-repository
Hi @rossanez, I came across your repo while doing some research on KG extraction from unstructured text. It seems there is no requirements file included in this repo, it would be a great help if you can include that and few instructions on installation of stanford nlp server. Thanks in Advance...:)