--add_parses

Switch

--add_parses

Description

Creates parsed versions of the message table.

Argument and Default Value

None

Details

This switch creates three tables (for a given table name TABLE): TABLE_const, TABLE_pos, TABLE_dep. Each table contains the same columns and contents as the original table except for the contents of the message column.

The message column in TABLE_const is now a tree structure corresponding to the grammatical structure of the message. For example:

  • original message: Everything is breaking apart.
  • parsed message: (ROOT (S (NP (NNP Everything)) (VP (VBZ is) (VP (VBG breaking) (ADVP (RB apart)))) (. .)))

The message column in TABLE_dep is a list of dependencies which provide a representation of grammatical relations between words in a sentence. For example:

  • original message: Everything is breaking apart.
  • parsed message: ['nsubj(breaking-3, Everything-1)', 'aux(breaking-3, is-2)', 'root(ROOT-0, breaking-3)', 'advmod(breaking-3, apart-4)']

The message column in TABLE_pos is a part of speech tagged version of the original message. For example:

  • original message: Everything is breaking apart.
  • parsed message: Everything/NNP is/VBZ breaking/VBG apart/RB ./.

Note that TABLE_pos is tagged according to the Penn Treebank Project tags.

For more details, download the Part-of-speech tagging annotation style manual.

Other Switches

Required Switches:

Example Commands

# creates the table msgs_const, msgs_pos, msgs_dep
./dlatkInterface.py -d dla_tutorial -t msgs -c message_id --add_parses
mysql> select message from msgs_const limit 1;
+-------------------------------------------------------------------------------------------------+
| message                                                                                         |
+-------------------------------------------------------------------------------------------------+
| (ROOT (S (VP (VB urlLink) (NP (DT The) (NNP Obligatory) (NNP Field) (NNP Shot) (NN urlLink))))) |
+-------------------------------------------------------------------------------------------------+

mysql> select message from msgs_pos limit 1;
+----------------------------------------------------------------+
| message                                                        |
+----------------------------------------------------------------+
| urlLink/VB The/DT Obligatory/NNP Field/NNP Shot/NNP urlLink/NN |
+----------------------------------------------------------------+

mysql> select message from msgs_dep limit 1;
+----------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| message                                                                                                                                                              |
+----------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| ['root(ROOT-0, urlLink-1)', 'det(urlLink-6, The-2)', 'nn(urlLink-6, Obligatory-3)', 'nn(urlLink-6, Field-4)', 'nn(urlLink-6, Shot-5)', 'dobj(urlLink-1, urlLink-6)'] |
+----------------------------------------------------------------------------------------------------------------------------------------------------------------------+