[Subversion] / PEAK-Rules / Criteria.txt  

View of /PEAK-Rules/Criteria.txt

Parent Directory | Revision Log
Revision: 2737 - (download)
Sat Apr 4 03:15:22 2015 UTC (9 years ago) by pje
File size: 39046 byte(s)
More Python 3 repr fixes
================
Logical Criteria
================

In order to process arbitrary expression-based rules, PEAK-Rules needs to
"understand" the way that conditions logically relate to each other.  This
document describes the design (and tests the implementation) of its logical
criteria management.  You do not need to read this unless you are extending or
interfacing with this subsystem directly, or just want to understand how this
stuff actually works!

The most important ideas here are implication, intersection, and disjunctive
normal form.  But don't panic if you don't know what those terms mean!  They're
really quite simple.

Implication means that if one thing is true, then so is another.  A implies B
if B is always true whenever A is true.  It doesn't matter what B is when A is
not true, however.  It could be true or false, we don't care.  Implication is
important for prioritizing which rules are "more specific" than others.

Intersection just means that both things have to be true for a condition to
be true - it's like the "and" of two conditions.  But rather than performing
an actual "and", we're creating a *new condition* that will only be true when
the two original conditions would be true.

And finally, disjunctive normal form (DNF) means "an OR of ANDs".  For example,
this expression is in DNF::

    (A and C) or (B and C) or (A and D) or (B and D)

But this equivalent expression is **not** in DNF::

    (A or B) and (C or D)

The criteria used to define generic function methods are likely to look more
like this, than they are to be in disjunctive normal form.  Therefore, we must
convert them in order to implement the Chambers & Chen dispatch algorithm
correctly (see `Indexing.txt`_).

.. _Indexing.txt: http://peak.telecommunity.com/DevCenter/PEAK-Rules/Indexing

We do this using the ``DisjunctionSet`` and ``OrElse`` classes to represent
overall expressions (sets or sequences of "ors"), and the ``Signature`` and
``Conjunction`` classes to represent sequences or sets of "and"-ed conditions.

Within a ``Signature``, the things that are "and"-ed together are a sequence
of ``Test`` instances.  A ``Test`` pairs a "dispatch expression" with a
"criterion".  For example, this expression::

    isinstance(x, Y)

would be represented internally as a ``Test`` instance like this::

    Test(IsInstance(Local('x')), Class(Y))

``Conjunction`` instances, on the other hand, are used to "and" together
criteria that apply to the same dispatch expression.  For example, this
expression::

    isinstance(x, Y) and isinstance(x, Z)

would be represented internally like this::

    Test(IsInstance(Local('x')), Conjunction([Class(Y), Class(Z)]))

The rest of this document describes how predicates, signatures, tests, dispatch
expressions, and criteria work together to create expressions in disjunctive
normal form, and whose implication of other expressions can be determined.

The basic logical functions we will use are ``implies()``, ``intersect()``,
``disjuncts()``, and ``negate()``, all of which are defined in
``peak.rules.core``::

    >>> from peak.rules.core import implies, intersect, disjuncts, negate, long



----------------------------------------
Boolean Conditions and Logical Operators
----------------------------------------

The most fundamental conditions are simply ``True`` and ``False``.  ``True``
represents a rule that *always* applies, while ``False`` represents a rule that
*never* applies.  Therefore, the result of intersecting ``True`` and any other
object, always returns that object, while intersecting ``False`` with any other
object returns ``False``::

    >>> intersect(False, False)
    False
    >>> intersect(False, True)
    False
    >>> intersect(True, False)
    False
    >>> intersect(True, True)
    True

    >>> intersect(object(), True)
    <object object at ...>

    >>> intersect(True, object())
    <object object at ...>

    >>> intersect(object(), False)
    False

    >>> intersect(False, object())
    False

Because ``True`` means "condition that always applies", *everything* implies
``True``, but ``True`` only implies itself::

    >>> implies(object(), True)
    True

    >>> implies(True, object())
    False

    >>> implies(True, True)
    True

On the other hand, because ``False`` means "condition that never applies",
``False`` implies *everything*.  (Because if you start from a false premise,
you can arrive at any conclusion!)::

    >>> implies(False, True)
    True

    >>> implies(False, object())
    True

However, no condition other than ``False`` can ever imply ``False``, because
all other conditions can *sometimes* apply::

    >>> implies(object(), False)
    False

    >>> implies(True, False)
    False

    >>> implies(False, False)
    True

Notice, by the way, a few important differences between ``implies()`` and
``intersect()``.  ``implies()`` *always* returns a boolean value, ``True`` or
``False``, because it's an immediate answer to the question of, "does the
second condition always apply if the first condition applies?"

``intersect()``, on the other hand, returns a *condition* that will always be
true when the original conditions apply.  So, if it returns a boolean value,
that's just an indication that the intersection of the two input conditions
would always apply or never apply.  Also, ``intersect()`` is logically
symmetrical, in that it doesn't matter what order the arguments are in, whereas
the order is critically important for ``implies()``.

However, ``intersect()`` methods must be order *preserving*, because the order
in which logical "and" operations occur is important.  Consider, for example,
the condition ``"y!=0 and z>x/y"``, in which it would be a bad thing to skip
the zero check before the division!

So, as we will see later on, when working with more complex conditions,
``intersect()`` methods must ensure that the subparts of the output condition
are in the same relative order as they were in the input.

(Also, note that in general, when you intersect two conditions, if one condition
implies the other, the result of the intersection is the implying condition.
This general rule greatly simplifies the implementation of most intersect
operations, since as long as there is an implication relationship defined
between conditions, many common cases of intersection can be handled
automatically.)

In contrast to both ``implies()`` and ``intersects()``, the ``disjuncts()``
function takes only a single argument, and returns a list of the "disjuncts"
(or-ed-together conditions) of its argument.  More precisely, it returns a list
of conditions that each imply the original condition.  That is, if any of the
disjuncts were true, then the original condition would also be true.

Thus, the ``disjuncts()`` of an arbitrary object will normally be a list
containing just that object::

    >>> disjuncts(object())
    [<object object at ...>]

    >>> disjuncts(True)
    [True]

But ``False`` is a special case; ``False`` has *no* disjuncts, since no other
condition can ever imply ``False``::

    >>> disjuncts(False)
    []

As a result, "or"-ing ``False`` with other conditions will simply remove the
``False`` from the resulting predicate, and conditions that can never be true
are not used for indexing or dispatching.

Another special case is tuples containing nested tuples::

    >>> disjuncts( (float, (int, str)) )
    [(<... 'float'>, <... 'int'>),
     (<... 'float'>, <... 'str'>)] 

    >>> disjuncts( ((int, str), object) )
    [(<... 'int'>, <... 'object'>),
     (<... 'str'>, <... 'object'>)] 

    >>> disjuncts( (object, (int, str), float) )
    [(<... 'object'>, <... 'int'>, <... 'float'>),
     (<... 'object'>, <... 'str'>, <... 'float'>)] 

    >>> disjuncts( ((int, str), (int, str)) )
    [(<... 'int'>, <... 'int'>),
     (<... 'str'>, <... 'int'>),
     (<... 'int'>, <... 'str'>),
     (<... 'str'>, <... 'str'>)] 

This lets you avoid writing lots of decorators for the cases where you want
more than one type (or ``istype()`` instance) to match in a given argument
position.  (As you can see, it's equivalent to specifying all the individual
combinations of specified types.)

Finally, the ``negate()`` function inverts the truth of a condition, e.g.::

    >>> negate(True)
    False

    >>> negate(False)
    True

Of course, it also applies to criteria other than pure boolean values, as we'll
see in the upcoming sections.


-------------------
"Criterion" Objects
-------------------

A criterion object describes a set of possible values for a dispatch
expression. There are several criterion types supplied with PEAK-Rules, but you
can also add your own, as long as they can be tested for implication with
``implies()``, and intersected with ``intersect()``.  (And if they represent an
"or" of sub-criteria, they should be able to provide their list of
``disjuncts()``.  They'll also need to be indexable, but more on that later in
other documents!)


"And"-ed Criteria
=================

Sometimes, more than one criterion is applied to the same dispatch expression.
For example in the expression ``x is not y and x is not z``, two criteria are
being applied to the identity of ``x``.  To represent this, we need a way to
represent a set of "and-ed" criteria.  ``peak.rules.criteria`` provides a base
class for this, called ``Conjunction``::

    >>> from peak.rules.criteria import Conjunction

This class is a subclass of ``frozenset``, but has a few additional features.
First, a ``Conjunction`` never contains redundant (implied) items.
For example, the conjunction of the classes ``object`` and ``int`` is ``int``,
because ``int`` already implies ``object``::

    >>> Conjunction([int, object])
    <... 'int'>

    >>> Conjunction([object, int])
    <... 'int'>

Notice also that instead of getting back a set with one member, we got back the
item that would have been in the set.  This helps to simplify the expression
structure.  As a further simplification, creating an empty conjunction returns
``True``, because "no conditions required" is the same as "always true"::

    >>> Conjunction([])
    True

A conjunction implies a condition, if any condition in the conjunction
implies the other condition::

    >>> implies(Conjunction([str, int]), str)
    True
    >>> implies(Conjunction([str, int]), int)
    True
    >>> implies(Conjunction([str, int]), object)
    True
    >>> implies(Conjunction([str, int]), float)
    False

A condition implies a conjunction, however, only if the condition implies
every part of the conjunction::

    >>> class a: pass
    >>> class b: pass
    >>> class c(a,b): pass
    >>> class d(a, int): pass

    >>> implies(c, Conjunction([a, b]))
    True
    >>> implies(a, Conjunction([a, b]))
    False

    >>> implies(Conjunction([c,d]), Conjunction([a, int]))
    True
    >>> implies(Conjunction([c, int]), Conjunction([a, int]))
    True
    >>> implies(Conjunction([a, int]), Conjunction([c, int]))
    False

(By the way, on a more sophisticated level of reasoning, you could say that
``Conjunction([str, int])`` should have equalled ``False`` above, since
there's no way for an object to be both an ``int`` and a ``str`` at the same
time.  But that would be an excursion into semantics and outside the bounds of
what PEAK-Rules can "reason" about using only logical implication as defined by
the ``implies()`` generic function.)

``Conjunction`` objects can be intersected with one another, or with
additional conditions, and the result is another ``Conjunction`` of the
same type as the leftmost set.  So, if we use subclasses of our own, the result
of intersecting them will be a conjunction of the correct subclass::

    >>> class MySet(Conjunction): pass

    >>> type(intersect(MySet([int, str]), float))
    <class 'MySet'>
    
    >>> intersect(MySet([int, str]), float) == MySet([int, str, float])
    True

    >>> intersect(float, MySet([int, str])) == MySet([float, int, str])
    True

    >>> intersect(MySet([d, c]), MySet([int, str])) == MySet([d,c,str])
    True

If you want to ensure that all items in a set are of appropriate type or value,
you can override ``__init__`` to do the checking, and raise an appropriate
error.  PEAK-Rules does this for its specialized conjunction classes, but uses
``if __debug__:`` and ``assert`` statements to avoid the extra overhead when
run with ``python -O``.  You may wish to do the same for your subclasses.


"Or"-ed Criteria
================

The ``DisjunctionSet`` and ``OrElse`` classes are used to represent sets and
sequences of "or"-ed criteria::

    >>> from peak.rules.criteria import DisjunctionSet, OrElse

Both types automatically exclude redundant (i.e. more-specific) criteria, and
can never contain less than 2 entries.  For example, "or"-ing ``object`` and
``int`` always returns ``object``, because ``object`` is implied by ``int``::

    >>> DisjunctionSet([int, object])
    <... 'object'>

    >>> DisjunctionSet([object, int])
    <... 'object'>

    >>> OrElse([int, object])
    <... 'object'>

    >>> OrElse([object, int])
    <... 'object'>

Notice that instead of getting back a set or sequence with one member, we got
back the item that would have been in the set.  This helps to simplify the
expression structure.  As a further simplification, creating an empty
disjunction returns ``False``, because "no conditions are sufficient" is the
same as "always false"::

    >>> DisjunctionSet([])
    False

    >>> OrElse([])
    False

In addition to eliminating redundancy, disjunction *sets* also flatten any
nested disjunctions::

    >>> DisjunctionSet([DisjunctionSet([1, 2]), DisjunctionSet([3, 4])])
    DisjunctionSet([1, 2, 3, 4])

This is because it uses the ``disjuncts()`` generic function to determine
whether any of the items it was given are "or"-ed conditions of some kind.  And
the ``disjuncts()`` of a ``DisjunctionSet`` are a list of its contents::

    >>> disjuncts(DisjunctionSet([1, 2, 3, 4]))
    [1, 2, 3, 4]

But ``OrElse`` sequences do not do this flattening, in order to avoid imposing
an arbitrary sequence on their contents::

    >>> OrElse([DisjunctionSet([1, 2]), DisjunctionSet([3, 4])])
    OrElse([DisjunctionSet([1, 2]), DisjunctionSet([3, 4])])

(The ``disjuncts()`` of an ``OrElse`` are much more complicated, as the
disjuncts of a Python expression like ``"a or b or c"`` reduce to ``"a"``,
``"(not a) and b"``, and ``"(not a and not b) and c"``!  We'll talk more about
this later, in the section on `Predicates`_ below.)

A disjunction only implies a condition if *all* conditions in the disjunction
imply the other condition::

    >>> implies(DisjunctionSet([str, int]), str)
    False
    >>> implies(DisjunctionSet([str, int]), int)
    False
    >>> implies(DisjunctionSet([str, int]), float)
    False
    >>> implies(DisjunctionSet([str, int]), object)
    True

    >>> implies(OrElse([str, int]), str)
    False
    >>> implies(OrElse([str, int]), int)
    False
    >>> implies(OrElse([str, int]), float)
    False
    >>> implies(OrElse([str, int]), object)
    True

A condition implies a disjunction, however, if the condition implies any part
of the disjunction::

    >>> class a: pass
    >>> class b: pass
    >>> class c(a,b): pass
    >>> class d(a, int): pass

    >>> implies(c, DisjunctionSet([a, b]))
    True
    >>> implies(a, DisjunctionSet([a, b]))
    True
    >>> implies(a, DisjunctionSet([int, str]))
    False
    >>> implies(DisjunctionSet([c,d]), DisjunctionSet([a, int]))
    True
    >>> implies(DisjunctionSet([c,int]), DisjunctionSet([a, int]))
    True

    >>> implies(DisjunctionSet([c, int]), True)
    True
    >>> implies(False, DisjunctionSet([c, int]))
    True

    >>> implies(c, OrElse([a, b]))
    True
    >>> implies(a, OrElse([a, b]))
    True
    >>> implies(a, OrElse([int, str]))
    False
    >>> implies(OrElse([c,d]), OrElse([a, int]))
    True
    >>> implies(OrElse([c,int]), OrElse([a, int]))
    True

    >>> implies(OrElse([c, int]), True)
    True
    >>> implies(False, OrElse([c, int]))
    True

The intersection of a disjunction and any other object is a disjunction
containing the intersection of that object with the original disjunctions'
contents.  In other words::

    >>> int_or_str = DisjunctionSet([int, str])
    >>> long_or_float = DisjunctionSet([long, float])

    >>> intersect(int_or_str, float) == DisjunctionSet([
    ...     Conjunction([int, float]), Conjunction([str, float])
    ... ])
    True

    >>> intersect(long, int_or_str) == DisjunctionSet([
    ...     Conjunction([long, int]), Conjunction([long, str])
    ... ])
    True

    >>> intersect(int_or_str, long_or_float) == DisjunctionSet([
    ...     Conjunction([int,long]), Conjunction([int, float]),
    ...     Conjunction([str,long]), Conjunction([str, float]),
    ... ])
    True

    >>> intersect(int_or_str, Conjunction([long, float])) == \
    ...     DisjunctionSet(
    ...         [Conjunction([int, long, float]),
    ...          Conjunction([str, long, float])]
    ... )
    True

    >>> intersect(Conjunction([int, str]), long_or_float) == \
    ...     DisjunctionSet(
    ...         [Conjunction([int, str, long]), Conjunction([int, str, float])]
    ...     )
    True

As you can see, this is the heart of the process that allows expressions like
``(A or B) and (C or D)`` to be transformed into their disjunctive normal
form (i.e. ``(A and C) or (A and D) or (B and C) or (B and D)``).

(In other words, by using ``Disjunction()`` as an "or" operator and
``intersect()`` as the "and" operator, we always end up with a DNF result!)


Object Identity
===============

The ``IsObject`` criterion type represents the set of objects which either
are -- or are *not* -- one specific object instance.  ``IsObject(x)`` (or
``IsObject(x, True)``) represents the set of objects ``y`` for which the
``y is x`` condition would be true.  Conversely, ``IsObject(x, False)``
represents the set of objects ``y`` for whom ``y is not x``::

    >>> from peak.rules.criteria import IsObject, Conjunction

    >>> o = object()
    >>> is_o = IsObject(o)
    >>> is_not_o = IsObject(o, False)

    >>> is_o
    IsObject(<object object at ...>, True)

    >>> is_not_o
    IsObject(<object object at ...>, False)

    >>> is_not_o == negate(is_o)
    True

    >>> is_o == negate(is_not_o)
    True

The intersection of two different ``is`` identities is ``False``, since an
object cannot be both itself and another object::

    >>> intersect(is_o, IsObject("foo"))
    False

    >>> implies(is_o, IsObject("foo"))
    False

Similarly, an object can't be both itself, and not itself::

    >>> intersect(is_o, is_not_o)
    False

    >>> intersect(is_not_o, is_o)
    False

    >>> implies(is_o, is_not_o)
    False

But it *can* be itself and itself::

    >>> intersect(is_o, is_o) == is_o
    True

    >>> implies(is_o, is_o)
    True

Or not itself and not itself::

    >>> intersect(is_not_o, is_not_o) == is_not_o
    True

    >>> implies(is_not_o, is_not_o)
    True

And an object can be itself, while not being something else::

    >>> intersect(is_o, IsObject("foo", False)) == is_o
    True

    >>> intersect(IsObject("foo", False), is_o) == is_o
    True

    >>> implies(is_o, IsObject("foo", False))
    True

But just because an object is not something, doesn't mean it's something else::

    >>> implies(is_not_o, IsObject("foo"))
    False

And the intersection of multiple ``is not`` conditions produces a
``Conjunction``::

    >>> not_foo = IsObject("foo", False)
    >>> not_bar = IsObject("bar", False)
    >>> not_foobar = intersect(not_foo, not_bar)
    >>> not_foobar
    Conjunction([IsObject('foo', False), IsObject('bar', False)])

Which of course then implies each of the individual "not" conditions::

    >>> implies(not_foobar, not_bar)
    True
    >>> implies(not_foobar, not_foo)
    True

But not their opposites::

    >>> implies(not_foobar, IsObject("bar"))
    False

Oh, and an ``is`` condition implies any ``Conjunction`` that don't contain its
opposite::

    >>> implies(is_o, not_foobar)
    True

But not the other way around::

    >>> implies(not_foobar, is_o)
    False

Finally, negating a ``Conjunction`` of is-nots returns a disjunction of true
``IsObject`` tests, and vice versa::

    >>> negate(not_foobar)
    DisjunctionSet([IsObject('foo', True), IsObject('bar', True)])

    >>> negate(DisjunctionSet([IsObject('foo'), IsObject('bar')]))
    Conjunction([IsObject('foo', False), IsObject('bar', False)])


Values and Ranges
=================

``Value`` objects are used to represent ``==`` and ``!=`` comparisons.
``Value(x)`` represents ``==x`` and ``Value(x, False)`` represents ``!=x``.

A ``Value`` implies another ``Value`` if the two are identical::

    >>> from peak.rules.criteria import Value, Range, Min, Max

    >>> implies(Value(27), Value(42))
    False
    >>> implies(Value(27, False), Value(42))
    False
    >>> implies(Value(27), Value(27))
    True
    >>> implies(Value(99), Value(99, False))
    False
    >>> implies(Value(99, False), Value(99, False))
    True    

Or, if they have different target values, but the first is an ``==``
comparison, and the second is a ``!=`` comparison::

    >>> implies(Value(27), Value(99, False))
    True

    >>> intersect(Value(27), Value(99, False))
    Value(27, True)

The negation of a ``Value`` is of course another ``Value`` of the same
target but the reverse operator::

    >>> negate(Value(27))
    Value(27, False)

    >>> negate(Value(99, False))
    Value(99, True)

The intersection of two different ``==`` values, or a ``!=`` and ``==`` of the
same value, is ``False`` (i.e., no possible match::

    >>> intersect(Value(27), Value(42))
    False
    >>> intersect(Value(27), Value(27, False))
    False

But the intersection of two different ``!=`` values produces a disjunction of
three ``Range()`` objects::

    >>> one_two = intersect(Value(1, False), Value(2, False))

    >>> one_two == DisjunctionSet([
    ...     Range((Min, -1), (1, -1)),
    ...     Range((1, 1), (2, -1)),
    ...     Range((2, 1), (Max, 1))
    ... ])
    True

    >>> intersect(one_two, Value(3,False)) == DisjunctionSet([
    ...     Range((Min, -1), (1, -1)),
    ...     Range((1, 1), (2, -1)),
    ...     Range((2, 1), (3, -1)),
    ...     Range((3, 1), (Max, 1))
    ... ])
    True
    

The ``Range()`` criterion type represents an inequality such as ``lo < x < hi``
or ``x >= lo``.  The lows and highs given have to be a 2-tuple, consisting of
a value and a "direction".  The direction is an integer (either -1 or 1) that
indicates whether the edge is on the low or high side of the target value.
Thus, a tuple ``(27, -1)`` means "the low edge of 27", while ``(99, 1)``
means "the high edge of 99".  In this way, any simple inequality or range
can be represented by a pair of edges.

Thus, the intersection of two different ``!=`` values produces a disjunction of
three ``Range()`` objects, representing the intervals that "surround" the
original ``!=`` values::

    >>> from peak.rules.criteria import Range

    >>> intersect(Value(27, False), Value(42, False)) == DisjunctionSet([
    ...     Range(hi=(27, -1)),     # below Min ... below 27
    ...     Range((27,1), (42,-1)), # above  27 ... below 42
    ...     Range(lo=(42, 1)),      # above  42 ... above Max
    ... ])
    True

Notice that if we omit the ``hi`` or ``lo``, end of the range, it's replaced
with "below ``Min``" or "above ``Max``", as appropriate.  (The ``Min`` and
``Max`` values are special objects that compare below or above any other
object.)

When creating range and value objects, it can be useful to use the
``Inequality`` constructor, which takes a comparison operator and a value::

    >>> from peak.rules.criteria import Inequality

    >>> Inequality('>=', 27)    # >=27 : below 27 ... above Max
    Range((27, -1), (Max, 1))

    >>> negate(Inequality('<', 27))
    Range((27, -1), (Max, 1))
    
    >>> Inequality('>', 27)     # > 27 : above 27 ... above Max
    Range((27, 1), (Max, 1))

    >>> Inequality('<', 99)     # < 99 : below Min ... below 99
    Range((Min, -1), (99, -1))

    >>> Inequality('<=',99)     # <=99 : below Min ... above 99
    Range((Min, -1), (99, 1))

    >>> negate(Inequality('>', 99))
    Range((Min, -1), (99, 1))

    >>> Inequality('==', 66)
    Value(66, True)

    >>> Inequality('!=', 77)
    Value(77, False)

Intersecting two ranges (or a range and an ``==`` value) produces a smaller
range or value, or ``False`` if there is no overlap::

    >>> intersect(Inequality('<', 27), Inequality(">",19))
    Range((19, 1), (27, -1))

    >>> intersect(Inequality('>=', 27), Inequality("<=",19))
    False

    >>> intersect(Value(27), Inequality('>=', 27))
    Value(27, True)    
    >>> intersect(Inequality('<=', 27), Value(27))
    Value(27, True)

    >>> intersect(Value(27), Inequality('<',27))
    False
    >>> intersect(Inequality('>',27), Value(27))
    False    
    
Last, but not least, a range (or value) implies another range or value if it
lies entirely within it::

    >>> implies(Range((42,-1), (42,1)), Value(42))
    True

    >>> implies(Range((27,-1), (42,1)), Range((15,1),(99,-1)))
    True

    >>> implies(Range((27,-1), (42,1)), Value(99, False))
    True

But not if it overlaps or lies outside of it::

    >>> implies(Range((15,-1),(42,1)), Range((15,1),(99,-1)))
    False

    >>> implies(Range((27,-1), (42,1)), Value(99))
    False


Class/Type Tests
================

``Class`` objects represent ``issubclass()`` or ``isinstance()`` sets.
``Class(x)`` is a instance/subclass match, while ``Class(x, False)`` is a
non-match.  Implication, negation, and intersection are defined accordingly::

    >>> from peak.rules.criteria import Class

    >>> implies(Class(int), Class(object))
    True
    >>> implies(Class(object, False), Class(int, False))
    True

    >>> negate(Class(int))
    Class(<... 'int'>, False)

    >>> negate(Class(object, False))
    Class(<... 'object'>, True)
    
    >>> implies(Class(int), Class(str))
    False
    >>> implies(Class(object), Class(int, False))
    False
    >>> implies(Class(object), Class(int))
    False
    >>> implies(Class(int), Class(int))
    True

    >>> intersect(Class(int), Class(object))
    Class(<... 'int'>, True)

    >>> intersect(Class(object), Class(int))
    Class(<... 'int'>, True)

The intersection of two or more unrelated ``Class`` criteria is represented by
a ``Conjunction``::

    >>> from peak.rules.criteria import Conjunction

    >>> intersect(Class(int, False), Class(str, False)) == Conjunction(
    ...     [Class(int, False), Class(str, False)]
    ... )
    True


Exact-Type and Type-Exclusion Tests
===================================

Exact type tests are expressed using ``istype(x)``, and type exclusion tests
are represented as ``istype(x, False)``::

    >>> from peak.rules import istype

    >>> negate(istype(int))
    istype(<... 'int'>, False)

    >>> negate(istype(object, False))
    istype(<... 'object'>, True)

One ``istype()`` test implies another only if they're equal::

    >>> implies(istype(int), istype(int))
    True

    >>> implies(istype(int, False), istype(int, False))
    True

    >>> implies(istype(int, False), istype(int))
    False

Or if the first is an exact match, and the second is an exclusion test for
a different type::

    >>> implies(istype(int), istype(str, False))
    True

Thus, the intersection of two ``istype()`` tests will be either one of the
input tests, or ``False`` (meaning no overlap)::

    >>> intersect(istype(int), istype(int))
    istype(<... 'int'>, True)

    >>> intersect(istype(int), istype(str, False))
    istype(<... 'int'>, True)

    >>> intersect(istype(int, False), istype(int, False))
    istype(<... 'int'>, False)

    >>> intersect(istype(int), istype(str))
    False

Unless both are exclusion tests on different types, in which case their
intersection is a ``Conjunction`` of the two::

    >>> intersect(istype(str, False), istype(int, False)) == Conjunction([
    ...     istype(int, False), istype(str, False)
    ... ])
    True

An ``istype(x)`` implies ``Class(y)`` only if x is y or a subtype thereof::

    >>> implies(istype(int), Class(str))
    False

    >>> implies(istype(int), Class(object))
    True

And it implies ``Class(y, False)`` only if x is *not* y or a subtype thereof::

    >>> implies(istype(int), Class(str, False))
    True

    >>> implies(istype(int), Class(object, False))
    False

But ``istype(x, False)`` implies nothing about any ``Class`` test, since it
refers to exactly one type, while the ``Class`` may refer to infinitely many
types::

    >>> implies(istype(int, False), Class(int, False))
    False

    >>> implies(istype(int, False), Class(object))
    False

Meanwhile, ``Class(x)`` tests can only imply ``istype(y, False)``, where y
is a *superclass* of x::

    >>> implies(Class(int), istype(int))
    False

    >>> implies(Class(int), istype(object))
    False    

    >>> implies(Class(int), istype(object, False))
    True

And ``Class(x, False)`` cannot imply anything about any ``istype()`` test,
whether true or false::

    >>> implies(Class(int, False), istype(int))
    False

    >>> implies(Class(int, False), istype(int, False))
    False

When ``Class()`` is intersected with an exact type test, the result is either
the exact type test, or ``False``::

    >>> intersect(Class(int), istype(int))
    istype(<... 'int'>, True)

    >>> intersect(istype(int), Class(int))
    istype(<... 'int'>, True)

    >>> intersect(Class(int), istype(object))
    False

    >>> intersect(istype(object), Class(int))
    False

    >>> intersect(Class(int, False), istype(object))
    istype(<... 'object'>, True)

    >>> intersect(istype(object), Class(int, False))
    istype(<... 'object'>, True)

But when it's intersected with a type exclusion test, the result is a
``Conjunction``::

    >>> intersect(istype(int, False), Class(str)) == Conjunction([
    ...     istype(int, False), Class(str, True)
    ... ])
    True

    >>> s = intersect(Class(str), istype(int, False))
    >>> s == Conjunction([istype(int, False), Class(str, True)])
    True

    >>> intersect(s, istype(int))
    False

    >>> intersect(s, istype(int, False)) == s
    True

    >>> intersect(s, istype(str))
    istype(<... 'str'>, True)


--------------------
Tests and Signatures
--------------------


``Test`` Objects
================

A ``Test`` is the combination of a "dispatch expression" and a criterion to
be applied to it::

    >>> from peak.rules.criteria import Test
    >>> x_isa_int = Test("x", Class(int))

(Note that although these examples use strings, actual dispatch expressions
will be AST-like structures.)

Creating a test with disjunct criteria actually returns a set of tests::

    >>> Test("x", DisjunctionSet([int, str])) == \
    ...           DisjunctionSet([Test('x', int), Test('x', str)])
    True

So the ``disjuncts()`` of a test will always just be the test itself::

    >>> disjuncts(x_isa_int)
    [Test('x', Class(<... 'int'>, True))]

Negating a test usually just negates its criterion, leaving the expression
intact::

    >>> negate(x_isa_int)
    Test('x', Class(<... 'int'>, False))

But if the test criterion is a conjunction or range, negating it can produce
a disjunction of tests::

    >>> negate(
    ...     Test('x',
    ...         Conjunction([IsObject('foo',False), IsObject('bar',False)])
    ...     )
    ... ) == DisjunctionSet(
    ...     [Test('x', IsObject('foo', True)),
    ...      Test('x', IsObject('bar', True))])
    True

Intersecting two tests for the same dispatch expression returns a test whose
criterion is the intersection of the original tests' criteria::

    >>> intersect(x_isa_int, Test("x", Class(str))) == Test(
    ...     'x', Conjunction([Class(int), Class(str)])
    ... )
    True

And similarly, a test only implies another test if they have equal dispatch
expressions, and the second test's criterion is implied by the first's::

    >>> implies(x_isa_int, Test("x", Class(str)))
    False
    >>> implies(x_isa_int, Test("x", Class(object)))
    True
    >>> implies(x_isa_int, Test("y", Class(int)))
    False

But the intersection of two tests with *different* dispatch expressions
produces a ``Signature`` object::

    >>> y_isa_str = Test("y", Class(str))
    >>> x_int_y_str = intersect(x_isa_int, y_isa_str)
    >>> x_int_y_str
    Signature([Test('x', Class(...int...)), Test('y', Class(...str...))])


``Signature`` Objects
=====================

``Signature`` objects are similar to ``Conjunction`` objects, except for three
important differences.

First, signatures are sequences, not sets.  They preserve the ordering they
were created with::

    >>> intersect(x_isa_int, y_isa_str)
    Signature([Test('x', Class(...int...)), Test('y', Class(...str...))])

    >>> intersect(y_isa_str, x_isa_int)
    Signature([Test('y', Class(...str...)), Test('x', Class(...int...))])

and their negations preserve the order as well (using ``OrElse`` instances)::

    >>> negate(intersect(x_isa_int, y_isa_str))
    OrElse([Test('x', Class(...int..., False)),
            Test('y', Class(...str..., False))])

    >>> negate(intersect(y_isa_str, x_isa_int))
    OrElse([Test('y', Class(...str..., False)),
            Test('x', Class(...int..., False))])

Second, signatures can only contain ``Test`` instances, and they automatically
``intersect()`` any tests that apply to the same dispatch signatures::

    >>> from peak.rules.criteria import Signature

    >>> intersect(x_int_y_str, Test("y", Class(float))) == Signature([
    ...     Test('x', Class(int)),
    ...     Test('y', Conjunction([Class(str), Class(float)]))
    ... ])
    True

    >>> intersect(x_int_y_str, Test("x", Class(float))) == Signature([
    ...     Test('x', Conjunction([Class(int), Class(float)])),
    ...     Test('y', Class(str))
    ... ])
    True

    >>> intersect(Test("x", Class(float)), x_int_y_str) == Signature([
    ...     Test('x', Conjunction([Class(int), Class(float)])),
    ...     Test('y', Class(str))
    ... ])
    True

But, as with conjunctions, you can't create a signature with less than two
items in it::

    >>> Signature([Test("x",1)])
    Test('x', 1)

    >>> Signature([True])
    True

    >>> Signature([False])
    False   

    >>> Signature([])
    True


----------
Predicates
----------

Now that we've got all the basic pieces in place, we can now operationally
define predicates for the Chambers & Chen dispatch algorithm.

Specifically, a predicate can be any of the following:

* ``True`` (meaning a condition that always applies)

* ``False`` (meaning a condition that *never* applies)

* A ``Test`` or ``Signature`` instance

* A ``DisjunctionSet`` or ``OrElse`` containing two or more ``Test`` or
  ``Signature`` instances

In each case, invoking ``disjuncts()`` on the object in question will return
a list of objects suitable for constructing dispatch "cases" -- i.e., sets of
simple "and-ed" criteria that can easily be indexed.

The ``tests_for()`` function can then be used to yield the component tests of
each case signature.  When called on a ``Test``, it yields the given test::

    >>> from peak.rules.criteria import tests_for

    >>> list(tests_for(Test('y',42)))
    [Test('y', 42)]
    
But called on a ``Signature``, it yields the tests contained within::
    
    >>> list(tests_for(x_int_y_str))
    [Test('x', Class(...int...)), Test('y', Class(...str...))]

And called on ``True``, it yields nothing::

    >>> list(tests_for(True))
    []

``tests_for(False)``, however, is undefined, because ``False`` cannot be
represented as a conjunction of tests.  ``False`` is still a valid predicate,
of course, because it represents an empty disjunction.

In normal predicate processing, one loops over the ``disjuncts()`` of a
predicate, and only then uses ``tests_for()`` to inspect the individual items.
But since ``disjuncts(False)`` is an empty list, it should never be necessary
to invoke ``tests_for(False)``.

There is an important distinction, however, in how ``disjuncts()`` works on
``OrElse`` objects, compared to all other kinds of predicates.  ``disjuncts()``
is used to obtain the *unordered* disjunctions of a logical condition, but
``OrElse`` is *ordered*, because it represents a series of applications of the
Python "or" operator.

In Python, a condition on the right-hand side of an "or" operator is not tested
*unless the condition on the left is false*.  PEAK-Rules, however, tests the
``disjuncts()`` of a predicate *independently*.  Thus, in order to properly
translate "or" conditions in a predicate, the ``disjuncts()`` of an ``OrElse``
must include additional and-ed conditions to force them to be tested in order.

Specifically, the ``disjuncts()`` of ``OrElse([a, b, c])`` will be:

  * ``a``,
  * ``intersect(negate(a), b)``, and
  * ``intersect(intersect(negate(a), negate(b)), c)``!

This expansion ensures that ``b`` will never be tested unless ``a`` is false,
and ``c`` will never be tested unless ``a`` and ``b`` are both false, just like
in a regular Python expression.  Observe::

    >>> DisjunctionSet([OrElse([Class(a), Class(b)])]) == DisjunctionSet([
    ...     Class(a, True),
    ...     Conjunction([Class(a, False), Class(b, True)])
    ... ])
    True
    
Also, because ``OrElse`` objects don't expand their contents' disjuncts at
creation time, they must be expanded as part of the ``disjuncts()`` operation::

    >>> a_or_b = DisjunctionSet([Class(a), Class(b)])

    >>> try:
    ...     set = set
    ... except NameError:
    ...     from sets import Set as set     # 2.3, ugh

    >>> set(disjuncts(OrElse([istype(int), a_or_b]))) == set([
    ...     istype(int),
    ...     Conjunction([istype(int, False), Class(b)]),
    ...     Conjunction([istype(int, False), Class(a)])
    ... ])
    True

This delayed expansion "preserves the unorderedness" of the contents, by not
forcing them to be evaluated in any specific sequence, apart from the
requirements imposed by their position within the ``OrElse``.

We'll do one more test, to show that the disjuncts of the negated portions of
the ``OrElse`` are also expanded::

    >>> a_and_b = Conjunction([Class(a), Class(b)])
    >>> not_a = Class(a, False)
    >>> not_b = Class(b, False)
    >>> int_or_str = DisjunctionSet([Class(int), Class(str)])

    >>> set(disjuncts(OrElse([a_and_b, int_or_str]))) == set([
    ...     a_and_b, Conjunction([not_a, Class(int)]), Conjunction([not_a, Class(str)]),
    ...              Conjunction([not_b, Class(int)]), Conjunction([not_b, Class(str)])
    ... ])
    True

per this expansion logic (using ``|`` for "symmetric or")::

    (a and b) or (int|str) => (a and b) | not (a and b) and (int|str)

    not (a and b) and (int|str) => (not a | not b) and (int|str)

    (not a | not b) and (int|str) => (
        (not a and int) | (not a and str) | (not b and int) | (not b and str)
    )


cvs-admin@eby-sarna.com

Powered by ViewCVS 1.0-dev

ViewCVS and CVS Help