How to annotate a vertical file?

1) As a first step, you need to create a corpus in vertical format. It means that every token / structure needs to be on a separate line and attributes (such as tag, lemma…) are added to the next columns (separated by a tab). Each column then contains one attribute, each line contains all attributes belonging to one token.

If you are annotating a corpus which already exists in Sketch Engine, the best way is to download the corpus as the vertical file and add one more column for each attribute you want to include. To check the columns are separated correctly, you can paste the content into a spreadsheet (e.g. MS Excel). Each attribute (word, tag, lemma…) should appear in separate cells.

2) As a second step (before uploading the corpus), it is needed to create your own Template (My Sketch Engine -> My Templates). We can help you with this step, please contact-us.

Generally, the best practice saving your time is to create a small sample of your data which you can send to us. We will check and confirm that the selected format is correct. Also, we can prepare a corpus template for your data and create an example corpus from your sample so you could check how the results look like. Then you can decide if you want to continue preparing data in that way or you need to change something first.

related topic: