1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 695 696 697 698 699 700 701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 728 729 730 731 732 733 734 735 736 737 738 739 740 741 742 743 744 745 746 747 748 749 750 751 752 753 754 755 756 757 758 759 760 761 762 763 764 765 766 767 768 769 770 771 772 773 774 775 776 777 778 779 780 781 782 783 784 785 786 787 788 789 790 791 792 793 794 795 796 797 798 799 800 801 802 803 804 805 806 807 808 809 810 811 812 813 814 815 816 817 818 819 820 821 822 823 824 825 826 827 828 829 830 831 832 833 834 835 836 837 838 839 840 841 842 843 844 845 846 847 848 849 850 851 852 853 854 855 856 857 858 859 860 861 862 863 864 865 866 867 868 869 870 871 872 873 874 875 876 877 878 879 880 881 882 883 884 885 886 887 888 889 890 891 892 893 894 895 896 897 898 899 900 901 902 903 904 905 906 907 908 909 910 911 912 913 914 915 916 917 918 919 920 921 922 923 924 925 926 927 928 929 930 931 932 933 934 935 936 937 938 939 940 941 942 943 944 945 946 947 948 949 950 951 952 953 954 955 956 957 958 959 960 961 962 963 964 965 966 967 968 969 970 971 972 973 974 975 976 977 978 979 980 981 982 983 984 985 986 987 988 989 990 991 992 993 994 995 996 997 998 999 1000 1001 1002 1003 1004 1005 1006 1007 1008 1009 1010 1011 1012 1013 1014 1015 1016 1017 1018 1019 1020 1021 1022 1023 1024 1025 1026 1027 1028 1029 1030 1031 1032 1033 1034 1035 1036 1037 1038 1039 1040 1041 1042 1043 1044 1045 1046 1047 1048 1049 1050 1051 1052 1053 1054 1055 1056 1057 1058 1059 1060 1061 1062 1063 1064 1065 1066 1067 1068 1069 1070 1071 1072 1073 1074 1075 1076 1077 1078 1079 1080 1081 1082 1083 1084 1085 1086 1087 1088 1089 1090 1091 1092 1093 1094 1095 1096 1097 1098 1099 1100 1101 1102 1103 1104 1105 1106 1107 1108 1109 1110 1111 1112 1113 1114 1115 1116 1117 1118 1119 1120 1121 1122 1123 1124 1125 1126 1127 1128 1129 1130 1131 1132 1133 1134 1135 1136 1137 1138 1139 1140 1141
|
(*
Copyright (c) 2000
Cambridge University Technical Services Limited
This library is free software; you can redistribute it and/or
modify it under the terms of the GNU Lesser General Public
License as published by the Free Software Foundation; either
version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU
Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public
License along with this library; if not, write to the Free Software
Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
*)
(*
Title: Generate interpretable code for Poly system from the code tree.
Author: Dave Matthews, Cambridge University Computer Laboratory
Copyright Cambridge University 1985
*)
(* This code-generator is primarily intended as a porting aid from one
machine-code to another. It is modelled on the VAX code-generator and has
a similar structure. Note: It does not use the use-count information on
declarations. *)
functor INTGCODE (
(*****************************************************************************)
(* CODECONS *)
(*****************************************************************************)
structure CODECONS :
sig
type machineWord = Address.machineWord;
type address = Address.address;
type code;
type opcode;
eqtype addrs; (*hacky! *)
type labels;
val noJump: labels;
val jumpFalse : opcode;
val jump : opcode;
val setHandler : opcode;
val delHandler : opcode;
val addrPlus : addrs * int -> addrs;
val addrMinus : addrs * addrs -> int;
val codeCreate: bool * string * Universal.universal list -> code; (* makes the initial segment. *)
(* ic - Address for the next instruction in the segment. *)
val ic: code -> addrs;
(* putBytes : puts "length" bytes of "val" into locations "addr", "addr"+1 *)
val putBytes : int * int * addrs * code -> unit;
(* GEN- routines all put a value at the instruction counter and add
an appropriate amount to it. *)
(* genWord - Puts 2 bytes. *)
val genWord : int * code -> unit;
(* gen... - put instructions and their operands. *)
val genCallClosure : code -> unit;
val genRaiseEx : code -> unit;
val genLock : code -> unit;
val genLdexc : code -> unit;
val genPushHandler : code -> unit;
val genReturn : int * code -> unit;
val genGetStore : int * code -> unit;
val genLocal : int * code -> unit;
val genIndirect : int * code -> unit;
val genMoveToVec : int * code -> unit;
val genSetStackVal : int * code -> unit;
val genCase : int * code -> unit;
val genTuple : int * code -> unit;
val genTailCall : int * int * code -> unit;
val genNonLocal : int * int * int * code -> unit;
(* genEnter instructions are only needed when machine-code routines
can call interpreted routines or vice-versa. The enterInt instruction
causes the interpreter to be entered and the argument indicates the
reason. *)
val genEnterIntCatch : code -> unit;
val genEnterIntProc : code * int -> unit;
val genEnterIntCall : code * int -> unit;
(* pushConst - Generates code to push a constant. *)
val pushConst : machineWord * code -> unit;
(* genRecRef - Recursive reference to a function. *)
val genRecRef: code * code -> unit
(* Create a container on the stack *)
val genContainer : int * code -> unit;
(* Create a tuple from a container. *)
val genTupleFromContainer : int * code -> unit;
(* copyCode - Finish up after compiling a function. *)
val copyCode : code -> address;
(* getBytes - gets "length" bytes from locations "addr", "addr"+1...
Returns a negative number if the first bit was set. *)
val getBytes: int * addrs * code -> int;
(* putBranchInstruction puts in an instruction which involves
a forward reference. *)
val putBranchInstruction: opcode * code -> labels;
(* Instruction to delete a handler and skip round it. *)
val fixup: labels * code -> unit; (* Fix up a forward reference. *)
val linkLabels: labels * labels * code -> labels; (* Link label lists. *)
val jumpback: addrs * code -> unit; (* Backwards jump. *)
val resetStack: int * bool * code -> unit; (* Set a pending reset *)
val alignOffWord: code * int -> unit; (* Add a pad byte if the value would
be word-aligned. *)
end (* CODECONS *);
(*****************************************************************************)
(* MISC *)
(*****************************************************************************)
structure MISC :
sig
exception InternalError of string;
end;
structure BACKENDTREE: BackendIntermediateCodeSig
) :
(*****************************************************************************)
(* GCODE export signature *)
(*****************************************************************************)
sig
type backendIC
type machineWord
val gencode:
backendIC * Universal.universal list * int -> ((unit -> machineWord) * Universal.universal list)
structure Sharing: sig type backendIC = backendIC end
end =
let
(*****************************************************************************)
(* STRETCHARRAY *)
(*****************************************************************************)
structure STRETCHARRAY :
sig
type 'a stretchArray;
val stretchArray : int * '_a -> '_a stretchArray;
val update : '_a stretchArray * int * '_a -> unit;
val sub : 'a stretchArray * int -> 'a;
end = StretchArray;
in
(*****************************************************************************)
(* GCODE functor body *)
(*****************************************************************************)
struct
open CODECONS;
open Address;
open BACKENDTREE;
open MISC;
open RuntimeCalls; (* for POLY_SYS numbers *)
(* gets a value from the run-time system;
usually this is a closure, but sometimes it's an int. *)
val ioOp : int -> machineWord = RunCall.run_call1 POLY_SYS_io_operation;
(* minor HACKS *)
fun forLoop f i n = if i > n then () else (f i; forLoop f (i + 1) n);
fun apply _ [] = () | apply f (h::t) = (f h; apply f t);
val word0 = toMachineWord 0;
val DummyValue : machineWord = word0; (* used as result of "raise e" etc. *)
type caseForm =
{
cases : (backendIC * word) list,
test : backendIC,
caseType: caseType,
default : backendIC
}
(* Where the result, if any, should go *)
datatype whereto =
NoResult (* discard result *)
| ToStack (* Need a result but it can stay on the pseudo-stack *);
(* Are we at the end of the function. *)
datatype tail =
EndOfProc
| NotEnd;
datatype slValue =
Address of int * int (* Address of an entry on the stack. *)
| Recursive of code (* A recursive reference to a closure. *)
(* Code generate a function or global declaration *)
fun codegen
(pt : backendIC,
cvec : code,
loadStaticLink : int * int -> slValue,
numOfArgs: int, parameters) : address =
let
val initTrans = 5; (* Initial size of tables. *)
datatype decEntry =
StackAddr of int
| Empty;
val decVec : decEntry STRETCHARRAY.stretchArray =
STRETCHARRAY.stretchArray (initTrans, Empty);
(* Count of number of items on the stack. *)
val realstackptr = ref 1; (* The static-link/closure ptr is already there *)
(* Exited - set to true if we have jumped out. *)
val exited = ref false;
(* Push a value onto the stack. For the moment this merely involves
adding corresponding values to the real and pseudo-stacks. *)
fun incsp () : unit = realstackptr := !realstackptr + 1;
(* An entry has been removed from the stack. *)
fun decsp () : unit= realstackptr := !realstackptr - 1;
(* Pushes a local or non-local stack value. *)
fun pushStackValue (addr : int, level : int) : unit =
let
val () =
if level > 0
then (* Non-local *)
genNonLocal(!realstackptr, level, addr, cvec)
else (* Locals and references to the closure. *)
genLocal(!realstackptr + addr, cvec);
in
incsp ()
end;
(* Loads a local, argument or closure value; translating local
stack addresses to real stack offsets. *)
fun locaddr(BICLoadArgument locn, _) = pushStackValue (numOfArgs-locn, 0)
| locaddr(BICLoadLocal locn, _) =
(
(* positive address - on the stack. *)
case STRETCHARRAY.sub (decVec, locn) of
StackAddr n => pushStackValue (~ n, 0)
| _ => (* Should be on the stack, not a function. *)
raise InternalError "locaddr: bad stack address"
)
| locaddr(BICLoadClosure locn, _) =
( (* closure-pointer relative *)
case loadStaticLink (locn+1, 1) of
Recursive code =>
( genRecRef (code, cvec); incsp () )
| Address (addr, level) => pushStackValue (addr, level)
)
| locaddr(BICLoadRecursive, _) =
(
case loadStaticLink (0, 1) of
Recursive code =>
( genRecRef (code, cvec); incsp () )
| Address (addr, level) => pushStackValue (addr, level)
)
(* generates code from the tree *)
fun gencde (pt : backendIC, whereto : whereto, tailKind : tail, loopAddr) : unit =
let
(* Save the stack pointer value here. We may want to reset the stack. *)
val oldsp = !realstackptr;
val () =
case pt of
BICEval evl =>
genEval (evl, tailKind)
| BICExtract ext =>
(* This may just be being used to discard a value which isn't
used on this branch. *)
if whereto = NoResult then ()
else locaddr ext
| BICField {base, offset} =>
let
val () = gencde (base, ToStack, NotEnd, loopAddr);
in
genIndirect (offset, cvec)
end
| BICLambda lam =>
genProc (lam, false, fn () => ())
| BICConstnt(w, _) =>
let
val () = pushConst (w, cvec);
in
incsp ()
end
| BICCond (testPart, thenPart, elsePart) =>
genCond (testPart, thenPart, elsePart, whereto, tailKind, loopAddr)
| BICNewenv(decls, exp) =>
let
(* Processes a list of entries. *)
(* Mutually recursive declarations. May be either lambdas or constants. Recurse down
the list pushing the addresses of the closure vectors, then unwind the
recursion and fill them in. *)
fun genMutualDecs [] = ()
| genMutualDecs ({lambda, addr, ...} :: otherDecs) =
genProc (lambda, true,
fn() =>
(
STRETCHARRAY.update (decVec, addr, StackAddr (! realstackptr));
genMutualDecs (otherDecs)
))
fun codeDecls(BICRecDecs dl) = genMutualDecs (dl)
| codeDecls(BICDeclar{value, addr, ...}) =
(
gencde (value, ToStack, NotEnd, loopAddr);
STRETCHARRAY.update (decVec, addr, StackAddr(!realstackptr))
)
| codeDecls(BICNullBinding exp) = gencde (exp, NoResult, NotEnd, loopAddr)
in
List.app codeDecls decls;
gencde (exp, whereto, tailKind, loopAddr)
end
| BICBeginLoop {loop=body, arguments} =>
(* Execute the body which will contain at least one Loop instruction.
There will also be path(s) which don't contain Loops and these
will drop through. *)
let
val args = List.map #1 arguments
(* Evaluate each of the arguments, pushing the result onto the stack. *)
fun genLoopArg ({addr, value, ...}) =
(
gencde (value, ToStack, NotEnd, loopAddr);
STRETCHARRAY.update (decVec, addr, StackAddr (!realstackptr));
!realstackptr (* Return the posn on the stack. *)
)
val argIndexList = map genLoopArg args;
val startSp = ! realstackptr; (* Remember the current top of stack. *)
val startLoop : addrs = ic cvec; (* Start of loop *)
in
(* Process the body, passing the jump-back address down for the Loop instruction(s). *)
gencde (body, whereto, tailKind, SOME(startLoop, startSp, argIndexList))
(* Leave the arguments on the stack. They can be cleared later if needed. *)
end
| BICLoop argList => (* Jump back to the enclosing BeginLoop. *)
let
val (startLoop, startSp, argIndexList) =
case loopAddr of
SOME l => l
| NONE => raise InternalError "No BeginLoop for Loop instr"
(* Evaluate the arguments. First push them to the stack because evaluating
an argument may depend on the current value of others. Only when we've
evaluated all of them can we overwrite the original argument positions. *)
fun loadArgs ([], []) = !realstackptr - startSp (* The offset of all the args. *)
| loadArgs (arg:: argList, _ :: argIndexList) =
let
(* Evaluate all the arguments. *)
val () = gencde (arg, ToStack, NotEnd, NONE);
val argOffset = loadArgs(argList, argIndexList);
in
genSetStackVal(argOffset, cvec); (* Copy the arg over. *)
decsp(); (* The argument has now been popped. *)
argOffset
end
| loadArgs _ = raise InternalError "loadArgs: Mismatched arguments";
val _: int = loadArgs(List.map #1 argList, argIndexList)
in
if !realstackptr <> startSp
then resetStack (!realstackptr - startSp, false, cvec) (* Remove any local variables. *)
else ();
(* Jump back to the start of the loop. *)
jumpback (startLoop, cvec)
end
| BICRaise exp =>
let
val () = gencde (exp, ToStack, NotEnd, loopAddr);
val () = genRaiseEx cvec;
in
exited := true
end
| BICHandle {exp, handler} =>
let
type handler = labels;
(* Save old handler *)
val () = genPushHandler cvec;
val () = incsp ();
val genTag =
putBranchInstruction (setHandler, cvec) before incsp()
val handlerList : handler list = [genTag]
(* Code generate the body; "NotEnd" because we have to come back
to remove the handler; "ToStack" because delHandler needs
a result to carry down. *)
val () = gencde (exp, ToStack, NotEnd, loopAddr);
(* Now get out of the handler and restore the old one. *)
val skipHandler : labels = putBranchInstruction (delHandler, cvec);
(* Now process the handler itself. First we have to reset the stack.
Note that we have to use "ToStack" again to be consistent with
the stack-handling in the body-part. If we actually wanted "NoResult",
the stack adjustment code at the end of gencde will take care
of this. This means that I don't want to do any clever "end-of-function"
optimisation either. SPF 6/1/97
The exception handler must be aligned to an odd word boundary
so that the garbage collector does not get confused when it
finds the address on the stack. Values on full-word
boundaries always point to the start of objects whereas
values off full-word boundaries (either return addresses or
catch-phrases) point into code-segments.
*)
val () = realstackptr := oldsp;
val () = exited := false;
val () = alignOffWord (cvec, 0);
val () = apply (fn handlerLab => fixup (handlerLab, cvec)) handlerList;
(* If we were executing machine code we must re-enter the interpreter. *)
val () = genEnterIntCatch cvec;
val () = gencde (handler, ToStack, NotEnd, loopAddr);
(* Finally fix-up the jump around the handler *)
val () = fixup (skipHandler, cvec);
in
exited := false
end
| BICLdexc =>
let
(* Get the name of the exception. *)
val () = genLdexc cvec
in
incsp ()
end
| BICCase (cas as {cases, caseType, ...}) =>
let
val numberOfCases = List.length cases;
val (isExhaustive, min:int, max:int) =
case caseType of
CaseTag max => (true, 0, Word.toInt max)
| _ =>
let
val (_, aLabel) = hd cases
fun foldCases((_, w), (min, max)) = (Word.min(w, min), Word.max(w, max))
val (min, max) = List.foldl foldCases (aLabel, aLabel) cases
in
(false, Word.toInt min, Word.toInt max)
end
in
if 3 * numberOfCases < max - min + 5
then genSparseCase(cas, whereto, tailKind, loopAddr)
else genDenseCase (cas, whereto, tailKind, loopAddr)
end
| BICTuple recList =>
let
(* Move the fields into the vector. *)
fun loadItems [] = ()
| loadItems (v :: vs) =
let
val () = gencde (v, ToStack, NotEnd, loopAddr);
in
loadItems vs
end;
val size : int = List.length recList;
val () = loadItems recList;
val () = genTuple (size, cvec);
in
realstackptr := !realstackptr - (size - 1)
end
(* Containers are supposed to be implemented on the stack for efficiency.
Unfortunately they cause problems in this code because we assume
that if we generate a value to the stack it only occupies a single
word. Rather than change this code and risk breaking something I'm
simply generating them as mutable tuples on the heap. *)
| BICContainer size =>
(* Reserve a number of words on the stack for use as a tuple on the
stack. The result is the address of this space. *)
(
genContainer(size, cvec); (* Push the address of this container. *)
incsp() (* Pushes a single word. *)
)
| BICSetContainer{container, tuple, filter} =>
(* Copy the contents of a tuple into a container. If the tuple is a
Tuple instruction we can avoid generating the tuple and then
unpacking it and simply copy the fields that make up the tuple
directly into the container. *)
(
case tuple of
BICTuple cl =>
(* Simply set the container from the values. *)
let
(* Load the address of the container. *)
val _ = gencde (container, ToStack, NotEnd, loopAddr);
fun setValues([], _, _) = ()
| setValues(v::tl, sourceOffset, destOffset) =
if sourceOffset < BoolVector.length filter andalso BoolVector.sub(filter, sourceOffset)
then
(
gencde (v, ToStack, NotEnd, loopAddr);
(* Move the entry into the container. This instruction
pops the value to be moved but not the destination. *)
genMoveToVec(destOffset, cvec);
decsp();
setValues(tl, sourceOffset+1, destOffset+1)
)
else setValues(tl, sourceOffset+1, destOffset)
in
setValues(cl, 0, 0)
(* The container address is still on the stack. *)
end
| _ =>
let (* General case. *)
(* First the target tuple, then the container. *)
val () = gencde (tuple, ToStack, NotEnd, loopAddr)
val () = gencde (container, ToStack, NotEnd, loopAddr)
val last = BoolVector.foldli(fn (i, true, _) => i | (_, false, n) => n) ~1 filter
fun copy (sourceOffset, destOffset) =
if BoolVector.sub(filter, sourceOffset)
then
(
(* Duplicate the tuple address . *)
genLocal(1, cvec);
genIndirect(sourceOffset, cvec);
genMoveToVec(destOffset, cvec);
if sourceOffset = last
then ()
else copy (sourceOffset+1, destOffset+1)
)
else copy(sourceOffset+1, destOffset)
in
copy (0, 0)
(* The container and tuple addresses are still on the stack. *)
end
)
| BICTagTest { test, tag, ... } =>
let
(* Convert this into a simple equality function. *)
val code =
BICEval {
function = BICConstnt(ioOp POLY_SYS_word_eq, []),
argList=[(test, GeneralType), (BICConstnt(toMachineWord tag, []), GeneralType)],
resultType=GeneralType }
in
gencde (code, whereto, tailKind, loopAddr)
end
| BICKillItems { expression, ... } =>
(* This is inserted by the higher level code to get the use-counts
correct. Kill entries are BICExtract entries with lastRef true. *)
gencde (expression, whereto, tailKind, loopAddr)
in (* body of gencde *)
(* This ensures that there is precisely one item on the stack if
whereto = ToStack and no items if whereto = NoResult.
There are two points to note carefully here:
(1) Negative stack adjustments are legal if we have exited.
This is because matchFailFn can cut the stack back too
far for its immediately enclosing expression. This is
harmless because the code actually exits that expression.
(2) A stack adjustment of ~1 is legal if we're generating
a declaration in "ToStack" mode, because not all declarations
actually generate the dummy value that we expect. This
used to be handled in resetStack itself, but it's more
transparent to do it here. (In addition, there was a bug in
resetStack - it accumulated the stack resets, but didn't
correctly accumulate these "~1" dummy value pushes.)
It's all much better now.
SPF 9/1/97
*)
case whereto of
ToStack =>
let
val newsp = oldsp + 1;
val adjustment = !realstackptr - newsp;
val () =
if !exited orelse adjustment = 0
then ()
else if adjustment < ~1
then raise InternalError ("gencde: bad adjustment " ^ Int.toString adjustment)
(* Hack for declarations that should push values, but don't *)
else if adjustment = ~1
then pushConst (DummyValue, cvec)
else
resetStack (adjustment, true, cvec);
in
realstackptr := newsp
end
| NoResult =>
let
val adjustment = !realstackptr - oldsp;
val () =
if !exited orelse adjustment = 0
then ()
else if adjustment < 0
then raise InternalError ("gencde: bad adjustment " ^ Int.toString adjustment)
else
resetStack (adjustment, false, cvec);
in
realstackptr := oldsp
end
end (* gencde *)
(* doNext is only used for mutually recursive functions where a
function may not be able to fill in its closure if it does not have
all the remaining declarations. *)
(* TODO: This always creates the closure on the heap even when makeClosure is false. *)
and genProc (lam as { closure=[], ...}, mutualDecs, doNext: unit -> unit) : unit =
let
val newCode : code = codeCreate(false, #name lam, parameters);
(* The only global references are recursive ones (?) *)
fun loadRecLink _ = Recursive newCode
(* Code-gen function. No non-local references. *)
val res : address =
codegen (#body lam, newCode, loadRecLink, List.length (#argLifetimes lam), parameters);
val () = pushConst(toMachineWord res, cvec);
val () = incsp();
in
if mutualDecs then doNext () else ()
end
| genProc (lam, mutualDecs: bool, doNext: unit -> unit) : unit =
let (* Full closure required. *)
(* If there is a closure we must fetch all non-local references
from it. *)
(* Loads the value from the closure. *)
fun loadSl (addr : int, level : int) : slValue =
if addr = 0
then (* recursive - return address of closure. *)
Address (~1, level - 1)
else (* Return an entry in the closure. *)
Address (addr, level);
val newCode : code = codeCreate (true, #name lam, parameters);
(* Code-gen function. *)
val res : address =
codegen (#body lam, newCode, loadSl, List.length (#argLifetimes lam), parameters);
val sizeOfClosure = List.length (#closure lam) + 1;
in
if mutualDecs
then let (* Have to make the closure now and fill it in later. *)
val () = genGetStore (sizeOfClosure, cvec);
val () = incsp ();
(* Put code address into closure *)
val () = pushConst(toMachineWord res, cvec);
val () = genMoveToVec(0, cvec);
val entryAddr : int = !realstackptr;
val () = doNext (); (* Any mutually recursive functions. *)
(* Push the address of the vector - If we have processed other
closures the vector will no longer be on the top of the stack. *)
val () = pushStackValue (~ entryAddr, 0);
(* Load items for the closure. *)
fun loadItems ([], _) = ()
| loadItems (v :: vs, addr : int) =
let
(* Generate an item and move it into the vector *)
val () = gencde (v, ToStack, NotEnd, NONE);
val () = genMoveToVec(addr, cvec);
val () = decsp ();
in
loadItems (vs, addr + 1)
end;
val () = loadItems (#closure lam, 1);
val () = genLock cvec; (* Lock it. *)
(* Remove the extra reference. *)
val () = resetStack (1, false, cvec);
in
realstackptr := !realstackptr - 1
end
else let
(* Put it on the stack. *)
val () = pushConst (toMachineWord res, cvec);
val () = incsp ();
val () =
apply (fn (pt: backendIC) => gencde (pt, ToStack, NotEnd, NONE)) (#closure lam);
val () = genTuple (sizeOfClosure, cvec);
in
realstackptr := !realstackptr - (sizeOfClosure - 1)
end
end
and genCond (first: backendIC,
second: backendIC,
third: backendIC,
whereto: whereto,
tailKind: tail,
loopAddr) : unit =
let
val () = gencde (first, ToStack, NotEnd, loopAddr);
val toElse : labels = putBranchInstruction(jumpFalse, cvec);
val () = decsp();
in
let
val () = gencde (second, whereto, tailKind, loopAddr);
(* Get rid of the result from the stack. If there is a result then the
``else-part'' will push it. *)
val () = case whereto of ToStack => decsp () | NoResult => ();
val thenExited : bool = !exited;
val toExit : labels =
if thenExited then noJump
else putBranchInstruction (jump, cvec);
(* start of "else part" *)
val () = fixup (toElse, cvec);
val () = exited := false;
val () = gencde (third, whereto, tailKind, loopAddr);
val elseExited : bool= !exited;
val () = fixup (toExit, cvec);
in
exited := (thenExited andalso elseExited) (* Only exited if both sides did. *)
end
end (* genCond *)
and genEval (eval, tailKind : tail) : unit =
let
val argList : backendIC list = List.map #1 (#argList eval)
val argsToPass : int = List.length argList;
(* Load arguments *)
fun loadArgs [] = ()
| loadArgs (v :: vs) =
let (* Push each expression onto the stack. *)
val () = gencde(v, ToStack, NotEnd, NONE);
in
loadArgs vs
end;
(* Called after the args and the closure to call have been pushed
onto the stack. *)
fun callClosure () : unit =
case tailKind of
NotEnd => (* Normal call. *)
genCallClosure cvec
| EndOfProc => (* Tail recursive call. *)
let
(* Get the return address onto the top of the stack. *)
val () = pushStackValue (0, 0);
(* Slide the return address, closure and args over the
old closure, return address and args, and reset the
stack. Then jump to the closure. *)
val () =
genTailCall(argsToPass + 2, !realstackptr - 1 + (numOfArgs - argsToPass), cvec);
(* It's "-1" not "-2", because we didn't bump the realstackptr
when we pushed the return address. SPF 3/1/97 *)
in
exited := true
end;
val () =
case #function eval of
(* The function is being loaded from the stack or closure so it
may be a static-link function. *)
BICExtract(ext, _) =>
let
(* Since the function is on the stack there can be no side-effects
in loading it. Can therefore load the arguments now. *)
val () = loadArgs argList;
val staticLinkValue =
case ext of
BICLoadArgument addr => Address (numOfArgs - addr, 0)
| BICLoadLocal addr =>
(
(* Local - is it a function? *)
case STRETCHARRAY.sub (decVec, addr) of
StackAddr n => Address (~ n, 0)
| Empty => raise InternalError "staticLinkValue: missing decVec entry"
)
| BICLoadClosure addr => loadStaticLink (addr+1, 1)(* Non-local or recursive. *)
| BICLoadRecursive => loadStaticLink (0, 1)
in
case staticLinkValue of
Address (addr, level) =>
let
val () = pushStackValue (addr, level);
in
callClosure ()
end
(* recursive reference to a function - not static link. *)
| Recursive code =>
let
val () = genRecRef (code, cvec);
val () = incsp();
in
callClosure ()
end
end (* BICExtract *)
| _ => (* The function is not being found by simply loading a value
from the stack or the closure. *)
let
(* Must load the closure and call. *)
(* Have to guarantee that the expression to return the function
is evaluated before the arguments. *)
(* Returns true if evaluating it later is safe. *)
fun safeToLeave (node: backendIC) : bool =
case node of
BICConstnt _ => true
| BICLambda _ => true
| BICField {base, ...} =>
(* Safe only if the expression (always a type) being indirected
is. This is put in because calling functions in a type is a
common occurence and should be made reasonably efficient. *)
safeToLeave base
| _ => false
val () =
if (case argList of [] => true | _ => safeToLeave (#function eval))
then let
(* Can load the args first. *)
val () = loadArgs argList;
in
gencde (#function eval, ToStack, NotEnd, NONE)
end
else let
(* The expression for the function is too complicated to
risk leaving. It might have a side-effect and we must
ensure that any side-effects it has are done before the
arguments are loaded. *)
val () = gencde(#function eval, ToStack, NotEnd, NONE);
val () = loadArgs(argList);
(* Load the function again. *)
val () = genLocal(argsToPass, cvec);
in
incsp ()
end
in
(* Call the function. *)
callClosure ()
end; (* Not BICExtract *)
(* Make sure we interpret when we return from the call *)
val () = genEnterIntCall (cvec, argsToPass);
in (* body of genEval *)
realstackptr := !realstackptr - argsToPass (* Args popped by caller. *)
end
(* This is used for cases which are not sparse i.e. the values occupy
a narrow enough range to make indexing efficient. *)
and genDenseCase
(pt as { cases, ...}: caseForm,
whereto: whereto,
tailKind: tail,
loopAddr) : unit =
let
val () = gencde (#test pt, ToStack, NotEnd, loopAddr);
(* The exit jumps are chained together. *)
val lastEndJump : labels ref = ref noJump;
val (isExhaustive, min:int, max:int) =
case #caseType pt of
CaseTag max => (true, 0, Word.toInt max)
| _ =>
let
val (_, aLabel) = hd cases
fun foldCases((_, w), (min, max)) = (Word.min(w, min), Word.max(w, max))
val (min, max) = List.foldl foldCases (aLabel, aLabel) cases
in
(false, Word.toInt min, Word.toInt max)
end
val limit : int = max - min
val () =
if min = 0 then ()
else let (* Subtract lower limit. *)
val () = pushConst(toMachineWord min, cvec);
val () = pushConst(ioOp POLY_SYS_aminus, cvec);
val () = genCallClosure cvec;
in
genEnterIntCall (cvec, 2) (* added SPF 28/6/95 *)
end;
val () = genCase (limit, cvec);
val () = decsp ();
(* Addresses are relative to the first entry in the vector. *)
val startVec : addrs = ic cvec;
(* Set each address in the vector to point to the default - they
will be overwritten by the actual address later. *)
val defaultAddr : int = (limit + 1) * 2;
val () =
forLoop (fn (_ : int) => genWord (defaultAddr, cvec)) 0 limit;
(* The default case, if any, follows the case statement. *)
val () = gencde (#default pt, whereto, tailKind, loopAddr);
val () = exited := false;
(* Now generate the code for each of the cases. *)
fun genEachCase ([] : (backendIC * word) list) : unit = ()
| genEachCase ((body : backendIC, matches : word) :: otherCases) : unit =
let
(* First exit from the previous case or the default if
this is the first. *)
val () =
lastEndJump :=
linkLabels
(!lastEndJump,
if !exited then noJump else putBranchInstruction(jump, cvec),
cvec);
(* Remove the result - the last case will leave it. *)
val () = case whereto of ToStack => decsp () | NoResult => ();
(* Now put the address of this code into the table if
an entry has not already been set. If it has the new
entry should be ignored. It is NOT an error (just
redundant). *)
local
val entryAddr : addrs = addrPlus(startVec, (Word.toInt matches - min) * 2)
in
val () =
if getBytes(2, entryAddr, cvec) = defaultAddr
then putBytes(addrMinus(ic cvec, startVec), 2, entryAddr, cvec)
else ()
end
(* Generate code for this case *)
val () = exited := false;
val () = gencde (body, whereto, tailKind, loopAddr);
in
genEachCase otherCases
end; (* genEachCase *)
val () = genEachCase (#cases pt);
(* Finally go down the list of exit labels pointing them to here. *)
val () = fixup (!lastEndJump, cvec);
in
exited := false
end (* genDenseCase *)
(* Generate a sparse case. *)
and genSparseCase
(pt : caseForm,
whereto : whereto,
tailKind : tail,
loopAddr) : unit =
let
(* Have already dealt with cases where there is only one case. *)
(* The exit jumps are chained together. *)
val lastEndJump = ref noJump;
fun caseCode ([], othersExited : bool) : bool = othersExited
| caseCode ((body : backendIC, matches : word) :: otherCases, othersExited : bool) : bool =
let
local
(* Is this really safe? What about multiple side-effects? SPF *)
val () = gencde (#test pt, ToStack, NotEnd, loopAddr);
(* Push the value to be compared. *)
val () = pushConst(toMachineWord matches, cvec);
(* Compare them. *)
val () =
pushConst (ioOp (POLY_SYS_word_eq), cvec);
val () = genCallClosure cvec;
val () = genEnterIntCall(cvec, 2); (* added SPF 28/6/95 *)
in
val lab : labels = putBranchInstruction (jumpFalse, cvec);
val () = decsp (); (* Remove result of test. *)
end
(* Now the expression. *)
val () = gencde(body, whereto, tailKind, loopAddr);
val thisHasExited : bool = !exited;
(* Remove the result - the default case will leave it. *)
val () = case whereto of ToStack => decsp () | NoResult => ();
val () =
lastEndJump :=
linkLabels
(!lastEndJump,
if thisHasExited then noJump else putBranchInstruction(jump, cvec),
cvec);
(* Now the next case. *)
val () = fixup (lab, cvec);
val () = exited := false;
in
caseCode (otherCases, othersExited andalso thisHasExited)
end; (* caseCode *)
(* First the cases. *)
val casesExited : bool = caseCode (#cases pt, true);
val () = exited := false;
val () =
let
val () = gencde(#default pt, whereto, tailKind, loopAddr);
in
exited := (!exited andalso casesExited)
end;
in
(* Finally go down the list of exit labels pointing them to here. *)
fixup (!lastEndJump, cvec)
end
(* Closure functions must start with ``enterIntProc''. This is actually a
break-point instruction in the machine-code instruction set to make sure
that the code is interpreted. It is a no-op if we are already
interpreting. *)
val () = genEnterIntProc (cvec, numOfArgs); (* SPF 23/6/95 *)
(* Generate the function. *)
(* Assume we always want a result. There is otherwise a problem if the
called routine returns a result of type void (i.e. no result) but the
caller wants a result (e.g. the identity function). *)
val () = gencde (pt, ToStack, EndOfProc, NONE)
val () = if !exited then () else genReturn (numOfArgs, cvec);
in (* body of codegen *)
(* Having code-generated the body of the function, it is copied
into a new data segment. *)
copyCode cvec
end (* codegen *);
fun gencode (pt: backendIC, parameters, _) : (unit -> machineWord) * Universal.universal list =
case pt of
BICLambda lam => (* We are compiling a function. *)
let
(* It is not essential to treat this specially, but it saves generating
a piece of code whose only function is to return the address of the
function. *)
(* make the code buffer for the new function. *)
val newCode : code = codeCreate (false, #name lam, parameters);
(* The only global references are recursive ones (?) *)
fun loadRecLink (_ : int, _ : int) : slValue =
Recursive newCode;
(* This function must have no non-local references. *)
val closureAddr : address =
codegen (#body lam, newCode, loadRecLink, List.length (#argLifetimes lam), parameters);
val res : machineWord = toMachineWord closureAddr;
in
(* Result is a function which returns the address of the function. *)
(fn () => res, [])
end
| _ =>
let (* Compile a top-level expression. *)
val newCode : code = codeCreate(false, "<top level>", parameters);
(* There ane *no* global references at all *)
fun loadRecLink (_ : int, _ : int) : slValue =
raise InternalError "top level reached";
val closureAddr : address =
codegen (pt, newCode, loadRecLink, 0, parameters)
in
(* Result is a function to execute the code. *)
(fn () => call (closureAddr, toMachineWord ()), [])
end; (* gencode *)
end (* GCODE functor body *)
end; (* structure-level let *)
|