|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectjava_cup.runtime.lr_parser
public abstract class lr_parser
This class implements a skeleton table driven LR parser. In general, LR parsers are a form of bottom up shift-reduce parsers. Shift-reduce parsers act by shifting input onto a parse stack until the Symbols matching the right hand side of a production appear on the top of the stack. Once this occurs, a reduce is performed. This involves removing the Symbols corresponding to the right hand side of the production (the so called "handle") and replacing them with the non-terminal from the left hand side of the production.
To control the decision of whether to shift or reduce at any given point, the parser uses a state machine (the "viable prefix recognition machine" built by the parser generator). The current state of the machine is placed on top of the parse stack (stored as part of a Symbol object representing a terminal or non terminal). The parse action table is consulted (using the current state and the current lookahead Symbol as indexes) to determine whether to shift or to reduce. When the parser shifts, it changes to a new state by pushing a new Symbol (containing a new state) onto the stack. When the parser reduces, it pops the handle (right hand side of a production) off the stack. This leaves the parser in the state it was in before any of those Symbols were matched. Next the reduce-goto table is consulted (using the new state and current lookahead Symbol as indexes) to determine a new state to go to. The parser then shifts to this goto state by pushing the left hand side Symbol of the production (also containing the new state) onto the stack.
This class actually provides four LR parsers. The methods parse() and debug_parse() provide two versions of the main parser (the only difference being that debug_parse() emits debugging trace messages as it parses). In addition to these main parsers, the error recovery mechanism uses two more. One of these is used to simulate "parsing ahead" in the input without carrying out actions (to verify that a potential error recovery has worked), and the other is used to parse through buffered "parse ahead" input in order to execute all actions and re-synchronize the actual parser configuration.
This is an abstract class which is normally filled out by a subclass generated by the JavaCup parser generator. In addition to supplying the actual parse tables, generated code also supplies methods which invoke various pieces of user supplied code, provide access to certain special Symbols (e.g., EOF and error), etc. Specifically, the following abstract methods are normally supplied by generated code:
Symbol
,
Symbol
,
virtual_parse_stack
Field Summary | |
---|---|
protected boolean |
_done_parsing
Internal flag to indicate when parser should quit. |
protected static int |
_error_sync_size
The default number of Symbols after an error we much match to consider it recovered from. |
protected short[][] |
action_tab
Direct reference to the action table. |
protected Symbol |
cur_token
The current lookahead Symbol. |
protected Symbol[] |
lookahead
Lookahead Symbols used for attempting error recovery "parse aheads". |
protected int |
lookahead_pos
Position in lookahead input buffer used for "parse ahead". |
protected short[][] |
production_tab
Direct reference to the production table. |
protected short[][] |
reduce_tab
Direct reference to the reduce-goto table. |
protected java.util.Stack |
stack
The parse stack itself. |
SymbolFactory |
symbolFactory
|
protected int |
tos
Indication of the index for top of stack (for use by actions). |
Constructor Summary | |
---|---|
lr_parser()
Simple constructor. |
|
lr_parser(Scanner s)
Constructor that sets the default scanner. |
|
lr_parser(Scanner s,
SymbolFactory symfac)
Constructor that sets the default scanner and a SymbolFactory |
Method Summary | |
---|---|
abstract short[][] |
action_table()
The action table (supplied by generated subclass). |
protected boolean |
advance_lookahead()
Advance to next "parse ahead" input Symbol. |
protected Symbol |
cur_err_token()
Return the current lookahead in our error "parse ahead" buffer. |
void |
debug_message(java.lang.String mess)
Write a debugging message to System.err for the debugging version of the parser. |
Symbol |
debug_parse()
Perform a parse with debugging output. |
void |
debug_reduce(int prod_num,
int nt_num,
int rhs_size)
Do debug output for a reduce. |
void |
debug_shift(Symbol shift_tkn)
Do debug output for shift. |
void |
debug_stack()
Do debug output for stack state. |
abstract Symbol |
do_action(int act_num,
lr_parser parser,
java.util.Stack stack,
int top)
Perform a bit of user supplied action code (supplied by generated subclass). |
void |
done_parsing()
This method is called to indicate that the parser should quit. |
void |
dump_stack()
Dump the parse stack for debugging purposes. |
abstract int |
EOF_sym()
The index of the end of file terminal Symbol (supplied by generated subclass). |
protected boolean |
error_recovery(boolean debug)
Attempt to recover from a syntax error. |
abstract int |
error_sym()
The index of the special error Symbol (supplied by generated subclass). |
protected int |
error_sync_size()
The number of Symbols after an error we much match to consider it recovered from. |
protected boolean |
find_recovery_config(boolean debug)
Put the (real) parse stack into error recovery configuration by popping the stack down to a state that can shift on the special error Symbol, then doing the shift. |
protected short |
get_action(int state,
int sym)
Fetch an action from the action table. |
protected short |
get_reduce(int state,
int sym)
Fetch a state from the reduce-goto table. |
Scanner |
getScanner()
Simple accessor method to get the default scanner. |
SymbolFactory |
getSymbolFactory()
Whenever creation of a new Symbol is necessary, one should use this factory. |
protected abstract void |
init_actions()
Initialize the action object. |
protected void |
parse_lookahead(boolean debug)
Parse forward using stored lookahead Symbols. |
Symbol |
parse()
This method provides the main parsing routine. |
abstract short[][] |
production_table()
Table of production information (supplied by generated subclass). |
protected void |
read_lookahead()
Read from input to establish our buffer of "parse ahead" lookahead Symbols. |
abstract short[][] |
reduce_table()
The reduce-goto table (supplied by generated subclass). |
void |
report_error(java.lang.String message,
java.lang.Object info)
Report a non fatal error (or warning). |
void |
report_fatal_error(java.lang.String message,
java.lang.Object info)
Report a fatal error. |
protected void |
restart_lookahead()
Reset the parse ahead input to one Symbol past where we started error recovery (this consumes one new Symbol from the real input). |
Symbol |
scan()
Get the next Symbol from the input (supplied by generated subclass). |
void |
setScanner(Scanner s)
Simple accessor method to set the default scanner. |
protected boolean |
shift_under_error()
Determine if we can shift under the special error Symbol out of the state currently on the top of the (real) parse stack. |
abstract int |
start_production()
The index of the start production (supplied by generated subclass). |
abstract int |
start_state()
The index of the start state (supplied by generated subclass). |
void |
syntax_error(Symbol cur_token)
This method is called when a syntax error has been detected and recovery is about to be invoked. |
protected boolean |
try_parse_ahead(boolean debug)
Do a simulated parse forward (a "parse ahead") from the current stack configuration using stored lookahead input and a virtual parse stack. |
protected static short[][] |
unpackFromStrings(java.lang.String[] sa)
Utility function: unpacks parse tables from strings |
void |
unrecovered_syntax_error(Symbol cur_token)
This method is called if it is determined that syntax error recovery has been unsuccessful. |
void |
user_init()
User code for initialization inside the parser. |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
public SymbolFactory symbolFactory
protected static final int _error_sync_size
protected boolean _done_parsing
protected int tos
protected Symbol cur_token
protected java.util.Stack stack
protected short[][] production_tab
protected short[][] action_tab
protected short[][] reduce_tab
protected Symbol[] lookahead
protected int lookahead_pos
Constructor Detail |
---|
public lr_parser()
public lr_parser(Scanner s)
public lr_parser(Scanner s, SymbolFactory symfac)
Method Detail |
---|
public SymbolFactory getSymbolFactory()
protected int error_sync_size()
public abstract short[][] production_table()
public abstract short[][] action_table()
get_action(int, int)
public abstract short[][] reduce_table()
get_reduce(int, int)
public abstract int start_state()
public abstract int start_production()
public abstract int EOF_sym()
public abstract int error_sym()
public void done_parsing()
public void setScanner(Scanner s)
public Scanner getScanner()
public abstract Symbol do_action(int act_num, lr_parser parser, java.util.Stack stack, int top) throws java.lang.Exception
act_num
- the internal index of the action to be performed.parser
- the parser object we are acting for.stack
- the parse stack of that object.top
- the index of the top element of the parse stack.
java.lang.Exception
public void user_init() throws java.lang.Exception
java.lang.Exception
protected abstract void init_actions() throws java.lang.Exception
java.lang.Exception
public Symbol scan() throws java.lang.Exception
java.lang.Exception
public void report_fatal_error(java.lang.String message, java.lang.Object info) throws java.lang.Exception
message
- an error message.info
- an extra object reserved for use by specialized subclasses.
java.lang.Exception
public void report_error(java.lang.String message, java.lang.Object info)
message
- an error message.info
- an extra object reserved for use by specialized subclasses.public void syntax_error(Symbol cur_token) throws java.lang.Exception
cur_token
- the current lookahead Symbol.
java.lang.Exception
public void unrecovered_syntax_error(Symbol cur_token) throws java.lang.Exception
cur_token
- the current lookahead Symbol.
java.lang.Exception
protected final short get_action(int state, int sym)
state
- the state index of the action being accessed.sym
- the Symbol index of the action being accessed.protected final short get_reduce(int state, int sym)
state
- the state index of the entry being accessed.sym
- the Symbol index of the entry being accessed.public Symbol parse() throws java.lang.Exception
java.lang.Exception
public void debug_message(java.lang.String mess)
mess
- the text of the debugging message.public void dump_stack()
public void debug_reduce(int prod_num, int nt_num, int rhs_size)
prod_num
- the production we are reducing with.nt_num
- the index of the LHS non terminal.rhs_size
- the size of the RHS.public void debug_shift(Symbol shift_tkn)
shift_tkn
- the Symbol being shifted onto the stack.public void debug_stack()
public Symbol debug_parse() throws java.lang.Exception
java.lang.Exception
protected boolean error_recovery(boolean debug) throws java.lang.Exception
debug
- should we produce debugging messages as we parse.
java.lang.Exception
protected boolean shift_under_error()
protected boolean find_recovery_config(boolean debug)
debug
- should we produce debugging messages as we parse.protected void read_lookahead() throws java.lang.Exception
java.lang.Exception
protected Symbol cur_err_token()
protected boolean advance_lookahead()
protected void restart_lookahead() throws java.lang.Exception
java.lang.Exception
protected boolean try_parse_ahead(boolean debug) throws java.lang.Exception
debug
- should we produce debugging messages as we parse.
java.lang.Exception
protected void parse_lookahead(boolean debug) throws java.lang.Exception
debug
- should we produce debugging messages as we parse.
java.lang.Exception
protected static short[][] unpackFromStrings(java.lang.String[] sa)
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |