1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355
|
[;1m sofs[0m
Functions for manipulating sets of sets.
This module provides operations on finite sets and relations
represented as sets. Intuitively, a set is a collection of
elements; every element belongs to the set, and the set contains
every element.
The data representing [;;4msofs[0m as used by this module is to be
regarded as opaque by other modules. In abstract terms, the
representation is a composite type of existing Erlang terms. See
note on data types. Any code assuming knowledge of the format is
running on thin ice.
Given a set A and a sentence S(x), where x is a free variable, a
new set B whose elements are exactly those elements of A for which
S(x) holds can be formed, this is denoted B = {x in A : S(x)}.
Sentences are expressed using the logical operators "for some" (or
"there exists"), "for all", "and", "or", "not". If the existence
of a set containing all the specified elements is known (as is
always the case in this module), this is denoted B = {x : S(x)}.
• The unordered set containing the elements a, b, and c is
denoted {a, b, c}. This notation is not to be confused with
tuples.
The ordered pair of a and b, with first coordinate a and
second coordinate b, is denoted (a, b). An ordered pair is
an ordered set of two elements. In this module, ordered
sets can contain one, two, or more elements, and parentheses
are used to enclose the elements.
Unordered sets and ordered sets are orthogonal, again in
this module; there is no unordered set equal to any ordered
set.
• The empty set contains no elements.
Set A is equal to set B if they contain the same elements,
which is denoted A = B. Two ordered sets are equal if they
contain the same number of elements and have equal elements
at each coordinate.
Set B is a subset of set A if A contains all elements that
B contains.
The union of two sets A and B is the smallest set that
contains all elements of A and all elements of B.
The intersection of two sets A and B is the set that
contains all elements of A that belong to B.
Two sets are disjoint if their intersection is the empty
set.
The difference of two sets A and B is the set that
contains all elements of A that do not belong to B.
The symmetric difference of two sets is the set that
contains those element that belong to either of the two
sets, but not both.
The union of a collection of sets is the smallest set that
contains all the elements that belong to at least one set of
the collection.
The intersection of a non-empty collection of sets is the
set that contains all elements that belong to every set of
the collection.
• The Cartesian product of two sets X and Y, denoted X × Y,
is the set {a : a = (x, y) for some x in X and for some
y in Y}.
A relation is a subset of X × Y. Let R be a relation. The
fact that (x, y) belongs to R is written as x R y. As
relations are sets, the definitions of the last item
(subset, union, and so on) apply to relations as well.
The domain of R is the set {x : x R y for some y in Y}.
The range of R is the set {y : x R y for some x in X}.
The converse of R is the set {a : a = (y, x) for some
(x, y) in R}.
If A is a subset of X, the image of A under R is the set
{y : x R y for some x in A}. If B is a subset of Y, the
inverse image of B is the set {x : x R y for some y in B}.
If R is a relation from X to Y, and S is a relation from Y
to Z, the relative product of R and S is the relation T
from X to Z defined so that x T z if and only if there
exists an element y in Y such that x R y and y S z.
The restriction of R to A is the set S defined so that
x S y if and only if there exists an element x in A such
that x R y.
If S is a restriction of R to A, then R is an extension of
S to X.
If X = Y, then R is called a relation in X.
The field of a relation R in X is the union of the domain
of R and the range of R.
If R is a relation in X, and if S is defined so that x S y
if x R y and not x = y, then S is the strict relation
corresponding to
○ Conversely, if S is a relation in X, and if R is
defined so that x R y if x S y or x = y, then R is the
weak relation corresponding to S.
A relation R in X is reflexive if x R x for every element
x of X, it is symmetric if x R y implies that y R x, and
it is transitive if x R y and y R z imply that x R z.
• function F is a relation, a subset of X × Y, such that the
domain of F is equal to X and such that for every x in X
there is a unique element y in Y with (x, y) in F. The
latter condition can be formulated as follows: if x F y and
x F z, then y = z. In this module, it is not required that
the domain of F is equal to X for a relation to be
considered a function.
Instead of writing (x, y) in F or x F y, we write F(x) = y
when F is a function, and say that F maps x onto y, or that
the value of F at x is y.
As functions are relations, the definitions of the last item
(domain, range, and so on) apply to functions as well.
If the converse of a function F is a function F', then F' is
called the inverse of F.
The relative product of two functions F1 and F2 is called
the composite of F1 and F2 if the range of F1 is a subset
of the domain of F2.
• Sometimes, when the range of a function is more important
than the function itself, the function is called a family.
The domain of a family is called the index set, and the
range is called the indexed set.
If x is a family from I to X, then x[i] denotes the value of
the function at index i. The notation "a family in X" is
used for such a family.
When the indexed set is a set of subsets of a set X, we call
x a family of subsets of X.
If x is a family of subsets of X, the union of the range of
x is called the union of the family x.
If x is non-empty (the index set is non-empty), the
intersection of the family x is the intersection of the
range of x.
In this module, the only families that are considered are
families of subsets of some set X; in the following, the
word "family" is used for such families of subsets.
• partition of a set X is a collection S of non-empty subsets
of X whose union is X and whose elements are pairwise
disjoint.
A relation in a set is an equivalence relation if it is
reflexive, symmetric, and transitive.
If R is an equivalence relation in X, and x is an element of
X, the equivalence class of x with respect to R is the set
of all those elements y of X for which x R y holds. The
equivalence classes constitute a partitioning of X.
Conversely, if C is a partition of X, the relation that
holds for any two elements of X if they belong to the same
equivalence class, is an equivalence relation induced by the
partition C.
If R is an equivalence relation in X, the canonical map is
the function that maps every element of X onto its
equivalence class.
• Relations as defined above (as sets of ordered pairs) are
from now on referred to as binary relations.
We call a set of ordered sets (x[1], ..., x[n]) an (n-ary)
relation, and say that the relation is a subset of the
Cartesian product X[1] × ... × X[n], where x[i] is an
element of X[i], 1 <= i <= n.
The projection of an n-ary relation R onto coordinate i is
the set {x[i] : (x[1], ..., x[i], ..., x[n]) in R for some
x[j] in X[j], 1 <= j <= n and not i = j}. The projections of
a binary relation R onto the first and second coordinates
are the domain and the range of R, respectively.
The relative product of binary relations can be generalized
to n-ary relations as follows. Let TR be an ordered set
(R[1], ..., R[n]) of binary relations from X to Y[i] and S a
binary relation from (Y[1] × ... × Y[n]) to Z. The relative
product of TR and S is the binary relation T from X to Z
defined so that x T z if and only if there exists an element
y[i] in Y[i] for each 1 <= i <= n such that x R[i] y[i] and
(y[1], ..., y[n]) S z. Now let TR be a an ordered set
(R[1], ..., R[n]) of binary relations from X[i] to Y[i] and
S a subset of X[1] × ... × X[n]. The multiple relative
product of TR and S is defined to be the set {z : z =
((x[1], ..., x[n]), (y[1],...,y[n])) for some
(x[1], ..., x[n]) in S and for some (x[i], y[i]) in R[i],
1 <= i <= n}.
The natural join of an n-ary relation R and an m-ary
relation S on coordinate i and j is defined to be the set
{z : z = (x[1], ..., x[n],
y[1], ..., y[j-1], y[j+1], ..., y[m]) for some
(x[1], ..., x[n]) in R and for some (y[1], ..., y[m]) in S
such that x[i] = y[j]}.
• The sets recognized by this module are represented by
elements of the relation Sets, which is defined as the
smallest set such that:
○ For every atom T, except '_', and for every term X,
(T, X) belongs to Sets (atomic sets).
○ (['_'], []) belongs to Sets (the untyped empty set).
○ For every tuple T = {T[1], ..., T[n]} and for every
tuple X = {X[1], ..., X[n]}, if (T[i], X[i]) belongs
to Sets for every 1 <= i <= n, then (T, X) belongs to
Sets (ordered sets).
○ For every term T, if X is the empty list or a
non-empty sorted list [X[1], ..., X[n]] without
duplicates such that (T, X[i]) belongs to Sets for
every 1 <= i <= n, then ([T], X) belongs to Sets (
typed unordered sets).
An external set is an element of the range of Sets.
A type is an element of the domain of Sets.
If S is an element (T, X) of Sets, then T is a valid type
of X, T is the type of S, and X is the external set of S. [;;4m[0m
[;;4mfrom_term/2[0m creates a set from a type and an Erlang term
turned into an external set.
The sets represented by Sets are the elements of the range
of function Set from Sets to Erlang terms and sets of Erlang
terms:
○ Set(T,Term) = Term, where T is an atom
○ Set({T[1], ..., T[n]}, {X[1], ..., X[n]}) =
(Set(T[1], X[1]), ..., Set(T[n], X[n]))
○ Set([T], [X[1], ..., X[n]]) =
{Set(T, X[1]), ..., Set(T, X[n])}
○ Set([T], []) = {}
When there is no risk of confusion, elements of Sets are
identified with the sets they represent. For example, if U
is the result of calling [;;4munion/2[0m with S1 and S2 as
arguments, then U is said to be the union of S1 and S2. A
more precise formulation is that Set(U) is the union of
Set(S1) and Set(S2).
The types are used to implement the various conditions that sets
must fulfill. As an example, consider the relative product of two
sets R and S, and recall that the relative product of R and S is
defined if R is a binary relation to Y and S is a binary relation
from Y. The function that implements the relative product, [;;4m[0m
[;;4mrelative_product/2[0m, checks that the arguments represent binary
relations by matching [{A,B}] against the type of the first
argument (Arg1 say), and [{C,D}] against the type of the second
argument (Arg2 say). The fact that [{A,B}] matches the type of
Arg1 is to be interpreted as Arg1 representing a binary relation
from X to Y, where X is defined as all sets Set(x) for some
element x in Sets the type of which is A, and similarly for Y. In
the same way Arg2 is interpreted as representing a binary relation
from W to
• Finally it is checked that B matches C, which is sufficient
to ensure that W is equal to Y. The untyped empty set is
handled separately: its type, ['_'], matches the type of any
unordered set.
A few functions of this module ([;;4mdrestriction/3[0m, [;;4m[0m
[;;4mfamily_projection/2[0m, [;;4mpartition/2[0m, [;;4mpartition_family/2[0m, [;;4m[0m
[;;4mprojection/2[0m, [;;4mrestriction/3[0m, [;;4msubstitution/2[0m) accept an Erlang
function as a means to modify each element of a given unordered
set. Such a function, called SetFun in the following, can be
specified as a functional object (fun), a tuple [;;4m{external, Fun}[0m,
or an integer:
• If SetFun is specified as a fun, the fun is applied to each
element of the given set and the return value is assumed to
be a set.
• If SetFun is specified as a tuple [;;4m{external, Fun}[0m, Fun is
applied to the external set of each element of the given set
and the return value is assumed to be an external set.
Selecting the elements of an unordered set as external sets
and assembling a new unordered set from a list of external
sets is in the present implementation more efficient than
modifying each element as a set. However, this optimization
can only be used when the elements of the unordered set are
atomic or ordered sets. It must also be the case that the
type of the elements matches some clause of Fun (the type of
the created set is the result of applying Fun to the type of
the given set), and that Fun does nothing but selecting,
duplicating, or rearranging parts of the elements.
• Specifying a SetFun as an integer I is equivalent to
specifying [;;4m{external, fun(X) -> element(I, X) end}[0m, but is
to be preferred, as it makes it possible to handle this case
even more efficiently.
Examples of SetFuns:
fun sofs:union/1
fun(S) -> sofs:partition(1, S) end
{external, fun(A) -> A end}
{external, fun({A,_,C}) -> {C,A} end}
{external, fun({_,{_,C}}) -> C end}
{external, fun({_,{_,{_,E}=C}}) -> {E,{E,C}} end}
2
The order in which a SetFun is applied to the elements of an
unordered set is not specified, and can change in future versions
of this module.
The execution time of the functions of this module is dominated by
the time it takes to sort lists. When no sorting is needed, the
execution time is in the worst case proportional to the sum of the
sizes of the input arguments and the returned value. A few
functions execute in constant time: [;;4mfrom_external/2[0m, [;;4m[0m
[;;4mis_empty_set/1[0m, [;;4mis_set/1[0m, [;;4mis_sofs_set/1[0m, [;;4mto_external/1[0m [;;4m[0m
[;;4mtype/1[0m.
The functions of this module exit the process with a [;;4mbadarg[0m, [;;4m[0m
[;;4mbad_function[0m, or [;;4mtype_mismatch[0m message when given badly formed
arguments or sets the types of which are not compatible.
When comparing external sets, operator [;;4m==/2[0m is used.
[;1mSee Also[0m
[;;4mdict[0m, [;;4mdigraph[0m, [;;4morddict[0m, [;;4mordsets[0m, [;;4msets[0m
|