BatsResearch, updated 🕥 2022-01-21 20:12:49

Weakly Supervised Sequence Tagging from Noisy Rules - Reproducibility Code

Getting Started

These instructions will get you a copy of our experiments up and running on your local machine for development and testing purposes.

Installing

In your virtual environment, please install the required dependencies using

pip install -r requirements.txt Or alternatively conda install --file requirements.txt

Datasets

Our experiments depend on six different datasets that you will need to download separately.

  • BC5CDR: Download and install the train, development, and test BioCreative V CDR corpus data files. Place the three separate files inside data/BC5CD

  • NCBI Disease: Download and install the complete training, development, and testing sets. Place the three separate files inside data/NCBI.

  • LaptopReview: Download the train data V2.0 for the Laptops and Restaurants dataset, and place the Laptop_Train_v2.xml file inside data/LaptopReview. Then, download the test data - phase B, and place the Laptops_Test_Data_phaseB.xml file inside the same directory.

  • CoNLL v5: Download and compile the English dataset version 5.0, and place it in data/conll-formatted-ontonotes-5.0.

  • Scibert: Download the scibert-scivocab-uncased version of the Scibert embeddings, and place the files weights.tar.gz and vocab.txt inside data/scibert_scibocab_uncased*.

  • UMLS: The UMLS dictionaries have been extracted from the UMLS 2018AB dataset and are provided in our code. They are distributed according to the License Agreement for Use of the UMLS® Metathesaurus®.

  • AutoNER Dictionaries. The AutoNER dictionaries for the BC5CDR, LaptopReview, and NCBI datasets have been generously provided by Jingbo Shang et al. They have been sourced from the EMNLP 2018 paper "Learning Named Entity Tagger using Domain-Specific Dictionary".

Citation

Please cite the following paper if you are using our tool. Thank you!

Safranchik Esteban, Shiying Luo, Stephen H. Bach. "Weakly Supervised Sequence Tagging From Noisy Rules". In 34th AAAI Conference on Artificial Intelligence, 2020.

@inproceedings{safranchik2020weakly, title = {Weakly Supervised Sequence Tagging From Noisy Rules}, author = {Safranchik, Esteban and Luo, Shiying and Bach, Stephen H.}, booktitle = {AAAI}, year = 2020, }

Issues

Bump ipython from 7.11.1 to 7.16.3

opened on 2022-01-21 20:12:49 by dependabot[bot]

Bumps ipython from 7.11.1 to 7.16.3.

Commits


Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) - `@dependabot use these labels` will set the current labels as the default for future PRs for this repo and language - `@dependabot use these reviewers` will set the current reviewers as the default for future PRs for this repo and language - `@dependabot use these assignees` will set the current assignees as the default for future PRs for this repo and language - `@dependabot use this milestone` will set the current milestone as the default for future PRs for this repo and language You can disable automated security fix PRs for this repo from the [Security Alerts page](https://github.com/BatsResearch/safranchik-aaai20-code/network/alerts).
Bats Research

We are a machine learning research group at Brown University. We work on improving the processes by which humans teach computers.

GitHub Repository