class allennlp.models.semantic_parsing.atis.atis_semantic_parser.AtisSemanticParser(vocab:, utterance_embedder: allennlp.modules.text_field_embedders.text_field_embedder.TextFieldEmbedder, action_embedding_dim: int, encoder: allennlp.modules.seq2seq_encoders.seq2seq_encoder.Seq2SeqEncoder, decoder_beam_search: allennlp.state_machines.beam_search.BeamSearch, max_decoding_steps: int, input_attention: allennlp.modules.attention.attention.Attention, add_action_bias: bool = True, training_beam_size: int = None, decoder_num_layers: int = 1, dropout: float = 0.0, rule_namespace: str = 'rule_labels', database_file='/atis/atis.db')[source]

Bases: allennlp.models.model.Model


Embedder for utterances.


Dimension to use for action embeddings.


The encoder to use for the input utterance.


Beam search used to retrieve best sequences after training.


When we’re decoding with a beam search, what’s the maximum number of steps we should take? This only applies at evaluation time, not during training.

input_attention: ``Attention``

We compute an attention over the input utterance at each step of the decoder, using the decoder hidden state as the query. Passed to the transition function.

add_action_biasbool, optional (default=True)

If True, we will learn a bias weight for each action that gets used when predicting that action, in addition to its embedding.

dropoutfloat, optional (default=0)

If greater than 0, we will apply dropout with this probability after all encoders (pytorch LSTMs do not apply dropout to their last layer).

rule_namespacestr, optional (default=rule_labels)

The vocabulary namespace to use for production rules. The default corresponds to the default used in the dataset reader, so you likely don’t need to modify this.

database_file: ``str``, optional (default=/atis/atis.db)

The path of the SQLite database when evaluating SQL queries. SQLite is disk based, so we need the file location to connect to it.

decode(self, output_dict: Dict[str, torch.Tensor]) → Dict[str, torch.Tensor][source]

This method overrides Model.decode, which gets called after Model.forward, at test time, to finalize predictions. This is (confusingly) a separate notion from the “decoder” in “encoder/decoder”, where that decoder logic lives in TransitionFunction.

This method trims the output predictions to the first end symbol, replaces indices with corresponding tokens, and adds a field called predicted_actions to the output_dict.

forward(self, utterance: Dict[str, torch.LongTensor], world: List[allennlp.semparse.worlds.atis_world.AtisWorld], actions: List[List[]], linking_scores: torch.Tensor, target_action_sequence: torch.LongTensor = None, sql_queries: List[List[str]] = None) → Dict[str, torch.Tensor][source]

We set up the initial state for the decoder, and pass that state off to either a DecoderTrainer, if we’re training, or a BeamSearch for inference, if we’re not.

utteranceDict[str, torch.LongTensor]

The output of TextField.as_array() applied on the utterance TextField. This will be passed through a TextFieldEmbedder and then through an encoder.


We use a MetadataField to get the World for each input instance. Because of how MetadataField works, this gets passed to us as a List[AtisWorld],


A list of all possible actions for each World in the batch, indexed into a ProductionRule using a ProductionRuleField. We will embed all of these and use the embeddings to determine which action to take at each timestep in the decoder.

linking_scores: ``torch.Tensor``

A matrix of the linking the utterance tokens and the entities. This is a binary matrix that is deterministically generated where each entry indicates whether a token generated an entity. This tensor has shape (batch_size, num_entities, num_utterance_tokens).

target_action_sequencetorch.Tensor, optional (default=None)

The action sequence for the correct action sequence, where each action is an index into the list of possible actions. This tensor has shape (batch_size, sequence_length, 1). We remove the trailing dimension.

sql_queriesList[List[str]], optional (default=None)

A list of the SQL queries that are given during training or validation.

get_metrics(self, reset: bool = False) → Dict[str, float][source]

We track four metrics here:

1. exact_match, which is the percentage of the time that our best output action sequence matches the SQL query exactly.

2. denotation_acc, which is the percentage of examples where we get the correct denotation. This is the typical “accuracy” metric, and it is what you should usually report in an experimental result. You need to be careful, though, that you’re computing this on the full data, and not just the subset that can be parsed. (make sure you pass “keep_if_unparseable=True” to the dataset reader, which we do for validation data, but not training data).

3. valid_sql_query, which is the percentage of time that decoding actually produces a valid SQL query. We might not produce a valid SQL query if the decoder gets into a repetitive loop, or we’re trying to produce a super long SQL query and run out of time steps, or something.

4. action_similarity, which is how similar the action sequence predicted is to the actual action sequence. This is basically a soft measure of exact_match.

static is_nonterminal(token: str)[source]