package FSA::Rules; # $Id: Rules.pm 3126 2006-08-29 16:46:40Z theory $ use strict; $FSA::Rules::VERSION = '0.26'; =begin comment Fake-out Module::Build. Delete if it ever changes to support =head1 headers other than all uppercase. =head1 NAME FSA::Rules - Build simple rules-based state machines in Perl =end comment =head1 Name FSA::Rules - Build simple rules-based state machines in Perl =head1 Synopsis my $fsa = FSA::Rules->new( ping => { do => sub { print "ping!\n"; my $state = shift; $state->result('pong'); $state->machine->{count}++; }, rules => [ game_over => sub { shift->machine->{count} >= 20 }, pong => sub { shift->result eq 'pong' }, ], }, pong => { do => sub { print "pong!\n" }, rules => [ ping => 1, ], # always goes back to ping }, game_over => { do => sub { print "Game Over\n" } } ); $fsa->start; $fsa->switch until $fsa->at('game_over'); =head1 Description This class implements a simple state machine pattern, allowing you to quickly build rules-based state machines in Perl. As a simple implementation of a powerful concept, it differs slightly from an ideal DFA model in that it does not enforce a single possible switch from one state to another. Rather, it short circuits the evaluation of the rules for such switches, so that the first rule to return a true value will trigger its switch and no other switch rules will be checked. (But see the C attribute and parameter to C.) It differs from an NFA model in that it offers no back-tracking. But in truth, you can use it to build a state machine that adheres to either model--hence the more generic FSA moniker. FSA::Rules uses named states so that it's easy to tell what state you're in and what state you want to go to. Each state may optionally define actions that are triggered upon entering the state, after entering the state, and upon exiting the state. They may also define rules for switching to other states, and these rules may specify the execution of switch-specific actions. All actions are defined in terms of anonymous subroutines that should expect an FSA::State object itself to be passed as the sole argument. FSA::Rules objects and the FSA::State objects that make them up are all implemented as empty hash references. This design allows the action subroutines to use the FSA::State object passed as the sole argument, as well as the FSA::Rules object available via its C method, to stash data for other states to access, without the possibility of interfering with the state or the state machine itself. =head2 Serialization As of version 0.24, FSA::Rules supports serialization by L 2.05 and later. In other words, FSA::Rules can function as a persistent state machine. However, FSA::Rules stores data outside of FSA::Rules objects, in private data structures inside the FSA::Rules module itself. Therefore, unless you want to clone your FSA::Rules object, you must let it fall out of scope after you serialize it, so that its data will be cleared from memory. Otherwise, if you freeze and thaw an FSA::Rules object in a single process without Cing the original, there will be I copies of the object stored by FSA::Rules. So how does it work? Because the rules are defined as code references, you must use Storable 2.05 or later and set its C<$Deparse> and C<$Eval> variables to true values: use Storable qw(freeze thaw); local $Storable::Deparse = 1; local $Storable::Eval = 1; my $frozen = freeze($fsa); $fsa = thaw($frozen); The only caveat is that, while Storable can serialize code references, it doesn't properly reference closure variables. So if your rules code references are closures, you'll have to serialize the data that they refer to yourself. =cut ############################################################################## =head1 Class Interface =head2 Constructor =head3 new my $fsa = FSA::Rules->new( foo_state => { ... }, bar_state => { ... }, ); $fsa = FSA::Rules->new( \%params, foo_state => { ... }, bar_state => { ... }, ); Constructs and returns a new FSA::Rules object. An optional first argument is a hash reference that may contain one or more of these keys: =over =item start Causes the C method to be called on the machine before returning it. =item done A value to which to set the C attribute. =item strict A value to which to set the C attribute. =item state_class The name of the class to use for state objects. Defaults to "FSA::State". Use this parameter if you want to use a subclass of FSA::State. =back All other parameters define the state table, where each key is the name of a state and the following hash reference defines the state, its actions, and its switch rules. These state specifications will be converted to FSA::State objects available via the C method. The first state parameter is considered to be the start state; call the C method to automatically enter that state. The supported keys in the state definition hash references are: =over =item label label => 'Do we have a username?', label => 'Create a new user', A label for the state. It might be the question that is being asked within the state (think decsision tree), the answer to which determins which rule will trigger the switch to the next state. Or it might merely describe what's happening in the state. =item on_enter on_enter => sub { ... } on_enter => [ sub {... }, sub { ... } ] Optional. A code reference or array reference of code references. These will be executed when entering the state, after any switch actions defined by the C of the previous state. The FSA::State for which the C actions are defined will be passed to each code reference as the sole argument. =item do do => sub { ... } do => [ sub {... }, sub { ... } ] Optional. A code reference or array reference of code references. These are the actions to be taken while in the state, and will execute after any C actions. The FSA::State object for which the C actions are defined will be passed to each code reference as the sole argument. =item on_exit on_exit => sub { ... } on_exit => [ sub {... }, sub { ... } ] Optional. A code reference or array reference of code references. These will be executed when exiting the state, before any switch actions (defined by C). The FSA::State object for which the C actions are defined will be passed to each code reference as the sole argument. =item rules Optional. The rules for switching from the state to other states. This is an array reference but shaped like a hash. The keys are the names of the states to consider moving to, while the values are the rules for switching to that state. The rules will be executed in the order specified in the array reference, and I unless the C attribute has been set to a true value. So for the sake of efficiency it's worthwhile to specify the switch rules most likely to evaluate to true before those more likely to evaluate to false. Rules themselves are best specified as hash references with the following keys: =over =item rule A code reference or value that will be evaluated to determine whether to switch to the specified state. The value must be true or the code reference must return a true value to trigger the switch to the new state, and false not to switch to the new state. When executed, it will be passed the FSA::State object for the state for which the rules were defined, along with any other arguments passed to C or C--the methods that execute the rule code references. These arguments may be inputs that are specifically tested to determine whether to switch states. To be polite, rules should not transform the passed values if they're returning false, as other rules may need to evaluate them (unless you're building some sort of chaining rules--but those aren't really rules, are they?). =item message An optional message that will be added to the current state when the rule specified by the C parameter evaluates to true. The message will also be used to label switches in the output of the C method. =item action A code reference or an array reference of code references to be executed during the switch, after the C actions have been executed in the current state, but before the C actions execute in the new state. Two arguments will be passed to these code references: the FSA::State object for the state for which they were defined, and the FSA::State object for the new state (which will not yet be the current state). =back A couple of examples: rules => [ foo => { rule => 1 }, bar => { rule => \&goto_bar, message => 'Have we got a bar?', }, yow => { rule => \&goto_yow, message => 'Yow!', action => [ \&action_one, \&action_two], } ] A rule may also simply be a code reference or value that will be evaluated when FSA::Rules is determining whether to switch to the new state. You might want just specify a value or code reference if you don't need a message label or switch actions to be executed. For example, this C specification: rules => [ foo => 1 ] Is equivalent to this C specification: rules => [ foo => { rule => 1 } ] And finally, you can specify a rule as an array reference. In this case, the first item in the array will be evaluated to determine whether to switch to the new state, and any other items must be code references that will be executed during the switch. For example, this C specification: rules => [ yow => [ \&check_yow, \&action_one, \&action_two ] ] Is equivalent to this C specification: rules => [ yow => { rule => \&check_yow, action => [ \&action_one, \&action_two ], } ] =back =cut my (%machines, %states); sub new { my $class = shift; my $self = bless {}, $class; my $params = ref $_[0] ? shift : {}; my $fsa = $machines{$self} = { done => undef, notes => {}, stack => [], table => {}, self => $self, }; $params->{state_class} ||= 'FSA::State'; while (@_) { my $state = shift; my $def = shift; $self->_croak(qq{The state "$state" already exists}) if exists $fsa->{table}{$state}; # Setup enter, exit, and do actions. for (qw(on_enter do on_exit)) { if (my $ref = ref $def->{$_}) { $def->{$_} = [$def->{$_}] if $ref eq 'CODE'; } else { $def->{$_} = []; } } # Create the state object and cache the state data. my $obj = $params->{state_class}->new; $def->{name} = $state; $def->{machine} = $self; $fsa->{table}{$state} = $obj; push @{$fsa->{ord}}, $obj; $states{$obj} = $def; } # Setup rules. We process the table a second time to catch invalid # references. while (my ($key, $obj) = each %{$fsa->{table}}) { my $def = $states{$obj}; if (my $rule_spec = $def->{rules}) { my @rules; while (@$rule_spec) { my $state = shift @$rule_spec; $self->_croak( qq{Unknown state "$state" referenced by state "$key"} ) unless $fsa->{table}{$state}; my $rules = shift @$rule_spec; my $exec = ref $rules eq 'ARRAY' ? $rules : [$rules]; my $rule = shift @$exec; my $message; if (ref $rule eq 'HASH') { $self->_croak( qq{In rule "$state", state "$key": you must supply a rule.} ) unless exists $rule->{rule}; $exec = ref $rule->{action} eq 'ARRAY' ? $rule->{action} : [$rule->{action}] if exists $rule->{action}; $message = $rule->{message} if exists $rule->{message}; $rule = $rule->{rule}; } # Used to convert a raw value to a code reference here, but as # it ended up as a closure and these don't serialize very # well, I pulled it out. Now try_switch has to check to see if # a rule is a literal value each time it's called. This # actually makes it faster for literal values, but a little # slower for code references. push @rules, { state => $fsa->{table}{$state}, rule => $rule, exec => $exec, message => $message, }; } $def->{rules} = \@rules; } else { $def->{rules} = []; } } # Handle any parameters. $self->start if $params->{start}; $self->done($params->{done}) if exists $params->{done}; $self->strict($params->{strict}) if exists $params->{strict}; return $self; } ############################################################################## =head1 Instance Interface =head2 Instance Methods =head3 start my $state = $fsa->start; Starts the state machine by setting the state to the first state defined in the call to C. If the machine is already in a state, an exception will be thrown. Returns the start state FSA::State object. =cut sub start { my $self = shift; my $fsa = $machines{$self}; $self->_croak( 'Cannot start machine because it is already running' ) if $fsa->{current}; my $state = $fsa->{ord}[0] or return $self; $self->curr_state($state); return $state; } ############################################################################## =head3 at $fsa->switch until $fsa->at('game_over'); Requires a statename. Returns false if the current machine state does not match the name. Otherwise, it returns the state. =cut sub at { my ($self, $name) = @_; $self->_croak("You must supply a state name") unless defined $name; my $fsa = $machines{$self}; $self->_croak(qq{No such state "$name"}) unless exists $fsa->{table}{$name}; my $state = $self->curr_state or return; return unless $state->name eq $name; return $state; } ############################################################################## =head3 curr_state my $curr_state = $fsa->curr_state; $fsa->curr_state($curr_state); Get or set the current FSA::State object. Pass a state name or object to set the state. Setting a new state will cause the C actions of the current state to be executed, if there is a current state, and then execute the C and C actions of the new state. Returns the new FSA::State object when setting the current state. =cut sub curr_state { my $self = shift; my $fsa = $machines{$self}; my $curr = $fsa->{current}; return $curr unless @_; my $state = shift; unless (ref $state) { my $name = $state; $state = $fsa->{table}{$name} or $self->_croak(qq{No such state "$name"}); } # Exit the current state. $curr->exit if $curr; # Run any switch actions. if (my $exec = delete $fsa->{exec}) { $_->($curr, $state) for @$exec; } # Push the new state onto the stack and cache the index. push @{$fsa->{stack}} => [$state->name => { result => undef, message => undef}]; push @{$states{$state}->{index}}, $#{$fsa->{stack}}; # Set the new state. $fsa->{current} = $state; $state->enter; $state->do; return $state; } ############################################################################## =head3 state Deprecated alias for C. This method will issue a warning and will be removed in a future version of FSA::Rules. Use C, instead. =cut sub state { require Carp; Carp::carp( 'The state() method has been deprecated. Use curr_state() instead' ); shift->curr_state(@_); } ############################################################################## =head3 prev_state my $prev_state = $fsa->prev_state; Returns the FSA::State object representing the previous state. This is useful in states where you need to know what state you came from, and can be very useful in "fail" states. =cut sub prev_state { my $self = shift; my $stacktrace = $self->raw_stacktrace; return unless @$stacktrace > 1; return $machines{$self}->{table}{$stacktrace->[-2][0]}; } ############################################################################## =head3 states my @states = $fsa->states; my $states = $fsa->states; my $state = $fsa->states($state_name); @states = $fsa->states(@state_names); $states = $fsa->states(@state_names); Called with no arguments, this method returns a list or array reference of all of the FSA::State objects that represent the states defined in the state machine. When called with a single state name, it returns the FSA::State object object for that state. When called with more than one state name arguments, it returns a list or array reference of those states. If called with any state names that did not exist in the original definition of the state machine, this method will C. =cut sub states { my $self = shift; my $fsa = $machines{$self}; return wantarray ? @{$fsa->{ord}} : $fsa->{ord} unless @_; if (my @errors = grep { not exists $fsa->{table}{$_} } @_) { $self->_croak("No such state(s) '@errors'"); } return $fsa->{table}{+shift} unless @_ > 1; return wantarray ? @{$fsa->{table}}{@_} : [ @{$fsa->{table}}{@_} ]; } ############################################################################## =head3 try_switch my $state = $fsa->try_switch; $state = $fsa->try_switch(@inputs); Checks the switch rules of the current state and switches to the first new state for which a rule returns a true value. The evaluation of switch rules short-circuits to switch to the first state for which a rule evaluates to a true value unless the C attribute is set to a true value. If is set to a true value, I rules will be evaluated, and if more than one returns a true statement, an exception will be thrown. This approach guarntees that every attempt to switch from one state to another will have one and only one possible destination state to which to switch, thus satisfying the DFA pattern. All arguments passed to C will be passed to the switch rule code references as inputs. If a switch rule evaluates to true and there are additional switch actions for that rule, these actions will be executed after the C actions of the current state (if there is one) but before the C actions of the new state. They will be passed the current state object and the new state object as arguments. Returns the FSA::State object representing the state to which it switched and C if it cannot switch to another state. =cut sub try_switch { my $self = shift; my $fsa = $machines{$self}; my $state = $fsa->{current}; # XXX Factor this out to the state class to evaluate the rules? my @rules = $state->_rules; my $next; while (my $rule = shift @rules) { my $code = $rule->{rule}; next unless ref $code eq 'CODE' ? $code->($state, @_) : $code; # Make sure that no other rules evaluate to true in strict mode. if (@rules && $self->strict) { if ( my @new = grep { my $c = $_->{rule}; ref $c eq 'CODE' ? $c->( $state, @_ ) : $c } @rules ) { $self->_croak( 'Attempt to switch from state "', $state->name, '"', ' improperly found multiple destination states: "', join('", "', map { $_->{state}->name } $rule, @new), '"' ); } } # We're good to go. $fsa->{exec} = $rule->{exec}; $state->message($rule->{message}) if defined $rule->{message}; $next = $self->curr_state($rule->{state}); last; } return $next; } ############################################################################## =head3 switch my $state = eval { $fsa->switch(@inputs) }; print "No can do" if $@; The fatal form of C. This method attempts to switch states and returns the FSA::State object on success and throws an exception on failure. =cut sub switch { my $self = shift; my $ret = $self->try_switch(@_); return $ret if defined $ret; $self->_croak( 'Cannot determine transition from state "', $machines{$self}->{current}->name, '"' ); } ############################################################################## =head3 done my $done = $fsa->done; $fsa->done($done); $fsa->done( sub {...} ); Get or set a value to indicate whether the engine is done running. Or set it to a code reference to have that code reference called each time C is called without arguments and have I return value returned. A code reference should expect the FSA::Rules object passed in as its only argument. Note that this varies from the pattern for state actions, which should expect the relevant FSA::State object to be passed as the argument. Call the C method on th FSA::Rules object if you want the current state in your C code reference. This method can be useful for checking to see if your state engine is done running, and calling C when it isn't. States can set it to a true value when they consider processing complete, or you can use a code reference that evaluates "done-ness" itself. Something like this: my $fsa = FSA::Rules->new( foo => { do => { $_[0]->machine->done(1) if ++$_[0]->{count} >= 5 }, rules => [ foo => 1 ], } ); Or this: my $fsa = FSA::Rules->new( foo => { do => { ++shift->machine->{count} }, rules => [ foo => 1 ], } ); $fsa->done( sub { shift->{count} >= 5 }); Then you can just run the state engine, checking C to find out when it's, uh, done. $fsa->start; $fsa->switch until $fsa->done; Although you could just use the C method if you wanted to do that. Note that C will be reset to C by a call to C when it's not a code reference. If it I a code reference, you need to be sure to write it in such a way that it knows that things have been reset (by examining states, for example, all of which will have been removed by C). =cut sub done { my $self = shift; my $fsa = $machines{$self}; if (@_) { $fsa->{done} = shift; return $self; } my $code = $fsa->{done}; return $code unless ref $code eq 'CODE'; return $code->($self); } ############################################################################## =head3 strict my $strict = $fsa->strict; $fsa->strict(1); Get or set the C attribute of the state machine. When set to true, the strict attribute disallows the short-circuiting of rules and allows a transfer if only one rule returns a true value. If more than one rule evaluates to true, an exception will be thrown. =cut sub strict { my $self = shift; return $machines{$self}->{strict} unless @_; $machines{$self}->{strict} = shift; return $self; } ############################################################################## =head3 run $fsa->run; This method starts the FSA::Rules engine (if it hasn't already been set to a state) by calling C, and then calls the C method repeatedly until C returns a true value. In other words, it's a convenient shortcut for: $fsa->start unless $self->curr_state; $fsa->switch until $self->done; But be careful when calling this method. If you have no failed switches between states and the states never set the C attribute to a true value, then this method will never die or return, but run forever. So plan carefully! Returns the FSA::Rules object. =cut sub run { my $self = shift; $self->start unless $self->curr_state; $self->switch until $self->done; return $self; } ############################################################################## =head3 reset $fsa->reset; The C method clears the stack and notes, sets the current state to C, and sets C to C (unless C is a code reference). Also clears any temporary data stored directly in the machine hash reference and the state hash references. Use this method when you want to reuse your state machine. Returns the DFA::Rules object. my $fsa = FSA::Rules->new(@state_machine); $fsa->done(sub {$done}); $fsa->run; # do a bunch of stuff $fsa->{miscellaneous} = 42; $fsa->reset->run; # $fsa->{miscellaneous} does not exist =cut sub reset { my $self = shift; my $fsa = $machines{$self}; $fsa->{current} = undef; $fsa->{notes} = {}; $fsa->{done} = undef unless ref $fsa->{done} eq 'CODE'; @{$fsa->{stack}} = (); for my $state ($self->states) { @{$states{$state}->{index}} = (); delete $state->{$_} for keys %$state; } delete $self->{$_} for keys %$self; return $self; } ############################################################################## =head3 notes $fsa->notes($key => $value); my $val = $fsa->notes($key); my $notes = $fsa->notes; The C method provides a place to store arbitrary data in the state machine, just in case you're not comfortable using the FSA::Rules object itself, which is an empty hash. Any data stored here persists for the lifetime of the state machine or until C is called. Conceptually, C contains a hash of key-value pairs. C<< $fsa->notes($key => $value) >> stores a new entry in this hash. C<< $fsa->notes->($key) >> returns a previously stored value. C<< $fsa->notes >>, called without arguments, returns a reference to the entire hash of key-value pairs. Returns the FSA::Rules object when setting a note value. =cut sub notes { my $self = shift; my $fsa = $machines{$self}; return $fsa->{notes} unless @_; my $key = shift; return $fsa->{notes}{$key} unless @_; $fsa->{notes}{$key} = shift; return $self; } ############################################################################## =head3 last_message my $message = $fsa->last_message; $message = $fsa->last_message($state_name); Returns the last message of the current state. Pass in the name of a state to get the last message for that state, instead. =cut sub last_message { my $self = shift; return $self->curr_state->message unless @_; return $self->states(@_)->message; } ############################################################################## =head3 last_result my $result = $fsa->last_result; $result = $fsa->last_result($state_name); Returns the last result of the current state. Pass in the name of a state to get the last result for that state, instead. =cut sub last_result { my $self = shift; return $self->curr_state->result unless @_; return $self->states(@_)->result; } ############################################################################## =head3 stack my $stack = $fsa->stack; Returns an array reference of all states the machine has been in since it was created or since C was last called, beginning with the first state and ending with the current state. No state name will be added to the stack until the machine has entered that state. This method is useful for debugging. =cut sub stack { my $self = shift; return [map { $_->[0] } @{$machines{$self}->{stack}}]; } ############################################################################## =head3 raw_stacktrace my $stacktrace = $fsa->raw_stacktrace; Similar to C, this method returns an array reference of the states that the machine has been in. Each state is an array reference with two elements. The first element is the name of the state and the second element is a hash reference with two keys, "result" and "message". These are set to the values (if used) set by the C and C methods on the corresponding FSA::State objects. A sample state: [ some_state, { result => 7, message => 'A human readable message' } ] =cut sub raw_stacktrace { $machines{shift()}->{stack} } ############################################################################## =head3 stacktrace my $trace = $fsa->stacktrace; Similar to C, except that the Cs and Cs are output in a human readable format with nicely formatted data (using Data::Dumper). Functionally there is no difference from C unless your states are storing references in their Cs or Cs For example, if your state machine ran for only three states, the output may resemble the following: State: foo { message => 'some message', result => 'a' } State: bar { message => 'another message', result => [0, 1, 2] } State: bar { message => 'and yet another message', result => 2 } =cut sub stacktrace { my $states = shift->raw_stacktrace; my $stacktrace = ''; require Data::Dumper; local $Data::Dumper::Terse = 1; local $Data::Dumper::Indent = 1; local $Data::Dumper::Quotekeys = 0; local $Data::Dumper::Sortkeys = 0; foreach my $state (@$states) { $stacktrace .= "State: $state->[0]\n"; $stacktrace .= Data::Dumper::Dumper($state->[1]); $stacktrace .= "\n"; } return $stacktrace; } ############################################################################## =head3 graph my $graph_viz = $fsa->graph(@graph_viz_args); $graph_viz = $fsa->graph(\%params, @graph_viz_args); Constructs and returns a L object useful for generating graphical representations of the complete rules engine. The parameters to C are all those supported by the GraphViz constructor; consult the L documentation for details. Each node in the graph represents a single state. The label for each node in the graph will be either the state label or if there is no lable, the state name. Each edge in the graph represents a rule that defines the relationship between two states. If a rule is specified as a hash reference, the C key will be used as the edge label; otherwise the label will be blank. An optional hash reference of parameters may be passed as the first argument to C. The supported parameters are: =over =item with_state_name This parameter, if set to true, prepends the name of the state and two newlines to the label for each node. If a state has no label, then the state name is simply used, regardless. Defaults to false. =item wrap_nodes =item wrap_node_labels This parameter, if set to true, will wrap the node label text. This can be useful if the label is long. The line length is determined by the C parameter. Defaults to false. =item wrap_edge_labels =item wrap_labels This parameter, if set to true, will wrap the edge text. This can be useful if the rule message is long. The line length is determined by the C parameter. Defaults to false C is deprecated and will be removed in a future version. =item text_wrap =item wrap_length The line length to use for wrapping text when C or C is set to true. C is deprecated and will be removed in a future version. Defaults to 25. =item node_params A hash reference of parameters to be passed to the GraphViz C method when seting up a state as a node. Only the C