1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 695 696 697 698 699 700 701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 728 729 730 731 732 733 734 735 736 737 738 739 740 741 742 743 744 745 746 747 748 749 750 751 752 753 754 755 756 757 758 759 760 761 762 763 764 765 766 767 768 769 770 771 772 773 774 775 776 777 778 779 780 781 782 783 784 785 786 787 788 789 790 791 792 793 794 795 796 797 798 799 800 801 802 803 804 805 806 807 808 809 810 811 812 813 814 815 816 817 818 819 820 821 822 823 824 825 826 827 828 829 830 831 832 833 834 835 836 837 838 839 840 841 842 843 844 845 846 847 848 849 850 851 852 853 854 855 856 857 858 859 860 861 862 863 864 865 866 867 868 869 870 871 872 873 874 875 876 877 878 879 880 881 882 883 884 885 886 887 888 889 890 891 892 893 894 895 896 897 898 899 900 901 902 903 904 905 906 907 908 909 910 911 912 913 914 915 916 917 918 919 920 921 922 923 924 925 926 927 928 929 930 931 932 933 934 935 936 937 938 939 940 941 942 943 944 945 946 947 948 949 950 951 952 953 954 955 956 957 958 959 960 961 962 963 964 965 966 967 968 969 970 971 972 973 974 975 976 977 978 979 980 981 982 983 984 985 986 987 988 989 990 991 992 993 994 995 996 997 998 999 1000 1001 1002 1003 1004 1005 1006 1007 1008 1009 1010 1011 1012 1013 1014 1015 1016 1017 1018 1019 1020 1021 1022 1023 1024 1025 1026 1027 1028 1029 1030 1031 1032 1033 1034 1035 1036 1037 1038 1039 1040 1041 1042 1043 1044 1045 1046 1047 1048 1049 1050 1051 1052 1053 1054 1055 1056 1057 1058 1059 1060 1061 1062 1063 1064 1065 1066 1067 1068 1069 1070 1071 1072 1073 1074 1075 1076 1077 1078 1079 1080 1081 1082 1083 1084 1085 1086 1087 1088 1089 1090 1091 1092 1093 1094 1095 1096 1097 1098 1099 1100 1101 1102 1103 1104 1105 1106 1107 1108 1109 1110 1111 1112 1113 1114 1115 1116 1117 1118 1119 1120 1121 1122 1123 1124 1125 1126 1127 1128 1129 1130 1131 1132 1133 1134 1135 1136 1137 1138 1139 1140 1141 1142 1143 1144 1145 1146 1147 1148 1149 1150 1151 1152 1153 1154 1155 1156 1157 1158 1159 1160 1161 1162 1163 1164 1165 1166 1167 1168 1169 1170 1171 1172 1173 1174 1175 1176 1177 1178 1179 1180 1181 1182 1183 1184 1185 1186 1187 1188 1189 1190 1191 1192 1193 1194 1195 1196 1197 1198 1199 1200 1201 1202 1203 1204 1205 1206 1207 1208 1209 1210 1211 1212 1213 1214 1215 1216 1217 1218 1219 1220 1221 1222 1223 1224 1225 1226 1227 1228 1229 1230 1231 1232 1233 1234 1235 1236 1237 1238 1239 1240 1241 1242 1243 1244 1245 1246 1247 1248 1249 1250 1251 1252 1253 1254 1255 1256 1257 1258 1259 1260 1261 1262 1263 1264 1265 1266 1267 1268 1269 1270 1271 1272 1273 1274 1275 1276 1277 1278 1279 1280 1281 1282 1283 1284 1285 1286 1287 1288 1289 1290 1291 1292 1293 1294 1295 1296 1297 1298 1299 1300 1301 1302 1303 1304 1305 1306 1307 1308 1309 1310 1311 1312
|
/*
Title: Machine dependent code for i386 and X64 under Windows and Unix
Copyright (c) 2000-7
Cambridge University Technical Services Limited
Further work copyright David C. J. Matthews 2011-16
This library is free software; you can redistribute it and/or
modify it under the terms of the GNU Lesser General Public
License version 2.1 as published by the Free Software Foundation.
This library is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU
Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public
License along with this library; if not, write to the Free Software
Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
*/
#ifdef HAVE_CONFIG_H
#include "config.h"
#elif defined(_WIN32)
#include "winconfig.h"
#else
#error "No configuration file"
#endif
#ifdef HAVE_STDLIB_H
#include <stdlib.h>
#endif
#include <stdio.h>
#ifdef HAVE_SIGNAL_H
#include <signal.h>
#endif
#ifdef HAVE_ASSERT_H
#include <assert.h>
#define ASSERT(x) assert(x)
#else
#define ASSERT(x)
#endif
#ifdef HAVE_STRING_H
#include <string.h>
#endif
#ifdef HAVE_ERRNO_H
#include <errno.h>
#endif
#if (defined(_WIN32))
#include <windows.h>
#include <excpt.h>
#endif
#include "globals.h"
#include "run_time.h"
#include "mpoly.h"
#include "arb.h"
#include "diagnostics.h"
#include "processes.h"
#include "sys.h"
#include "profiling.h"
#include "sighandler.h"
#include "machine_dep.h"
#include "scanaddrs.h"
#include "gc.h"
#include "check_objects.h"
#include "save_vec.h"
#include "memmgr.h"
#include "reals.h"
#include "polystring.h"
#include "xwindows.h"
#include "objsize.h"
#include "foreign.h"
#include "process_env.h"
#include "basicio.h"
#include "network.h"
#include "os_specific.h"
#include "poly_specific.h"
#include "timing.h"
#include "polyffi.h"
#include "rtsentry.h"
/**********************************************************************
*
* Register usage:
*
* %Reax: First argument to function. Result of function call.
* %Rebx: Second argument to function.
* %Recx: General register
* %Redx: Closure pointer in call.
* %Rebp: Points to memory used for extra registers
* %Resi: General register.
* %Redi: General register.
* %Resp: Stack pointer.
* The following apply only on the X64
* %R8: Third argument to function
* %R9: Fourth argument to function
* %R10: Fifth argument to function
* %R11: General register
* %R12: General register
* %R13: General register
* %R14: General register
* %R15: Memory allocation pointer
*
**********************************************************************/
#ifdef HOSTARCHITECTURE_X86_64
struct fpSaveArea {
double fpregister[7]; // Save area for xmm0-6
};
#else
// Structure of floating point save area.
// This is dictated by the hardware.
typedef byte fpregister[10];
struct fpSaveArea {
unsigned short cw;
unsigned short _unused0;
unsigned short sw;
unsigned short _unused1;
unsigned short tw;
unsigned short _unused2;
unsigned fip;
unsigned short fcs0;
unsigned short _unused3;
unsigned foo;
unsigned short fcs1;
unsigned short _unused4;
fpregister registers[8];
};
#endif
/* the amount of ML stack space to reserve for registers,
C exception handling etc. The compiler requires us to
reserve 2 stack-frames worth (2 * 20 words). We actually reserve
slightly more than this.
*/
#if (!defined(_WIN32) && !defined(HAVE_SIGALTSTACK))
// If we can't handle signals on a separate stack make sure there's space
// on the Poly stack.
#define OVERFLOW_STACK_SIZE (50+1024)
#else
#define OVERFLOW_STACK_SIZE 50
#endif
class X86TaskData;
// This is passed as the argument vector to X86AsmSwitchToPoly.
// The offsets are built into the assembly code and the code-generator.
// localMpointer and stackPtr are updated before control returns to C.
typedef struct _AssemblyArgs {
PolyWord *localMpointer; // Allocation ptr + 1 word
PolyWord *handlerRegister; // Current exception handler
PolyWord *localMbottom; // Base of memory + 1 word
PolyWord *stackLimit; // Lower limit of stack
PolyWord exceptionPacket; // Set if there is an exception
byte unusedRequestCode; // No longer used.
byte unusedFlag; // No longer used
byte returnReason; // Reason for returning from ML.
byte unusedRestore; // No longer used.
POLYUNSIGNED saveCStack; // Saved C stack frame.
PolyObject *threadId; // My thread id. Saves having to call into RTS for it.
PolyWord *stackPtr; // Current stack pointer
POLYCODEPTR unusedProgramCtr;
byte *heapOverFlowCall; // These are filled in with the functions.
byte *stackOverFlowCall;
byte *stackOverFlowCallEx;
// Saved registers, where applicable.
PolyWord p_rax;
PolyWord p_rbx;
PolyWord p_rcx;
PolyWord p_rdx;
PolyWord p_rsi;
PolyWord p_rdi;
#ifdef HOSTARCHITECTURE_X86_64
PolyWord p_r8;
PolyWord p_r9;
PolyWord p_r10;
PolyWord p_r11;
PolyWord p_r12;
PolyWord p_r13;
PolyWord p_r14;
#endif
struct fpSaveArea p_fp;
} AssemblyArgs;
class X86TaskData: public TaskData {
public:
X86TaskData();
unsigned allocReg; // The register to take the allocated space.
POLYUNSIGNED allocWords; // The words to allocate.
Handle callBackResult;
AssemblyArgs assemblyInterface;
int saveRegisterMask; // Registers that need to be updated by a GC.
virtual void GarbageCollect(ScanAddress *process);
void ScanStackAddress(ScanAddress *process, PolyWord *pt, StackSpace *stack);
virtual Handle EnterPolyCode(); // Start running ML
virtual void InterruptCode();
virtual bool AddTimeProfileCount(SIGNALCONTEXT *context);
virtual void InitStackFrame(TaskData *parentTask, Handle proc, Handle arg);
virtual void SetException(poly_exn *exc);
// Release a mutex in exactly the same way as compiler code
virtual Handle AtomicIncrement(Handle mutexp);
virtual void AtomicReset(Handle mutexp);
// Return the minimum space occupied by the stack. Used when setting a limit.
virtual POLYUNSIGNED currentStackSpace(void) const { return (this->stack->top - assemblyInterface.stackPtr) + OVERFLOW_STACK_SIZE; }
// Increment the profile count for an allocation. Also now used for mutex contention.
virtual void addProfileCount(POLYUNSIGNED words)
{ add_count(this, assemblyInterface.stackPtr[0].AsCodePtr(), words); }
// PreRTSCall: After calling from ML to the RTS we need to save the current heap pointer
virtual void PreRTSCall(void) { SaveMemRegisters(); }
// PostRTSCall: Before returning we need to restore the heap pointer.
// If there has been a GC in the RTS call we need to create a new heap area.
virtual void PostRTSCall(void) { SetMemRegisters(); }
virtual void CopyStackFrame(StackObject *old_stack, POLYUNSIGNED old_length, StackObject *new_stack, POLYUNSIGNED new_length);
virtual Handle EnterCallbackFunction(Handle func, Handle args);
int SwitchToPoly();
void HeapOverflowTrap(byte *pcPtr);
void SetMemRegisters();
void SaveMemRegisters();
void SetRegisterMask();
PLock interruptLock;
PolyWord *get_reg(int n);
PolyWord *®SP() { return assemblyInterface.stackPtr; }
PolyWord ®AX() { return assemblyInterface.p_rax; }
PolyWord ®BX() { return assemblyInterface.p_rbx; }
PolyWord ®CX() { return assemblyInterface.p_rcx; }
PolyWord ®DX() { return assemblyInterface.p_rdx; }
PolyWord ®SI() { return assemblyInterface.p_rsi; }
PolyWord ®DI() { return assemblyInterface.p_rdi; }
#ifdef HOSTARCHITECTURE_X86_64
PolyWord ®8() { return assemblyInterface.p_r8; }
PolyWord ®9() { return assemblyInterface.p_r9; }
PolyWord ®10() { return assemblyInterface.p_r10; }
PolyWord ®11() { return assemblyInterface.p_r11; }
PolyWord ®12() { return assemblyInterface.p_r12; }
PolyWord ®13() { return assemblyInterface.p_r13; }
PolyWord ®14() { return assemblyInterface.p_r14; }
#endif
#if (defined(_WIN32) && !defined(__CYGWIN__))
DWORD savedErrno;
#else
int savedErrno;
#endif
};
class X86Dependent: public MachineDependent {
public:
X86Dependent() {}
// Create a task data object.
virtual TaskData *CreateTaskData(void) { return new X86TaskData(); }
virtual unsigned InitialStackSize(void) { return 128+OVERFLOW_STACK_SIZE; } // Initial size of a stack
virtual void ScanConstantsWithinCode(PolyObject *addr, PolyObject *oldAddr, POLYUNSIGNED length, ScanAddress *process);
virtual Architectures MachineArchitecture(void)
#ifndef HOSTARCHITECTURE_X86_64
{ return MA_I386; }
#else /* HOSTARCHITECTURE_X86_64 */
{ return MA_X86_64; }
#endif /* HOSTARCHITECTURE_X86_64 */
};
// Values for the returnReason byte
enum RETURN_REASON {
RETURN_IO_CALL_NOW_UNUSED = 0,
RETURN_HEAP_OVERFLOW = 1,
RETURN_STACK_OVERFLOW = 2,
RETURN_STACK_OVERFLOWEX = 3,
RETURN_CALLBACK_RETURN = 6,
RETURN_CALLBACK_EXCEPTION = 7,
RETURN_KILL_SELF = 9
};
extern "C" {
// These are declared in the assembly code segment.
void X86AsmSwitchToPoly(void *);
extern int X86AsmKillSelf(void);
extern int X86AsmCallbackReturn(void);
extern int X86AsmCallbackException(void);
extern int X86AsmPopArgAndClosure(void);
extern int X86AsmRaiseException(void);
extern int X86AsmCallExtraRETURN_HEAP_OVERFLOW(void);
extern int X86AsmCallExtraRETURN_STACK_OVERFLOW(void);
extern int X86AsmCallExtraRETURN_STACK_OVERFLOWEX(void);
POLYUNSIGNED X86AsmAtomicIncrement(PolyObject*);
POLYUNSIGNED X86AsmAtomicDecrement(PolyObject*);
};
extern "C" {
POLYEXTERNALSYMBOL POLYUNSIGNED PolySetCodeConstant(byte *pointer, PolyWord offset, PolyWord c, PolyWord flags);
}
X86TaskData::X86TaskData(): allocReg(0), allocWords(0), saveRegisterMask(0)
{
assemblyInterface.heapOverFlowCall = (byte*)X86AsmCallExtraRETURN_HEAP_OVERFLOW;
assemblyInterface.stackOverFlowCall = (byte*)X86AsmCallExtraRETURN_STACK_OVERFLOW;
assemblyInterface.stackOverFlowCallEx = (byte*)X86AsmCallExtraRETURN_STACK_OVERFLOWEX;
savedErrno = 0;
}
void X86TaskData::GarbageCollect(ScanAddress *process)
{
TaskData::GarbageCollect(process); // Process the parent first
assemblyInterface.threadId = threadObject;
if (stack != 0)
{
// Now the values on the stack.
for (PolyWord *q = assemblyInterface.stackPtr; q < stack->top; q++)
ScanStackAddress(process, q, stack);
}
// Register mask
for (int i = 0; i < 16; i++)
{
if (saveRegisterMask & (1 << i))
ScanStackAddress(process, get_reg(i), stack);
}
}
// Process a value within the stack.
void X86TaskData::ScanStackAddress(ScanAddress *process, PolyWord *pt, StackSpace *stack)
{
// We may have return addresses on the stack which could look like
// tagged values. Check whether the value is in the code area before
// checking whether it is untagged.
// The -1 here is because we may have a zero-sized cell in the last
// word of a space.
MemSpace *space = gMem.SpaceForAddress(pt->AsCodePtr()-1);
if (space == 0) return;
if (space->spaceType == ST_CODE)
{
PolyObject *obj = gMem.FindCodeObject(pt->AsCodePtr());
// If it is actually an integer it might be outside a valid code object.
if (obj == 0)
{
ASSERT(pt->IsTagged()); // It must be an integer
}
else // Process the address of the start. Don't update anything.
process->ScanObjectAddress(obj);
}
else if (space->spaceType == ST_LOCAL && pt->IsDataPtr())
// Local values must be word addresses.
*pt = process->ScanObjectAddress(pt->AsObjPtr());
}
// Copy a stack
void X86TaskData::CopyStackFrame(StackObject *old_stack, POLYUNSIGNED old_length, StackObject *new_stack, POLYUNSIGNED new_length)
{
/* Moves a stack, updating all references within the stack */
PolyWord *old_base = (PolyWord *)old_stack;
PolyWord *new_base = (PolyWord*)new_stack;
PolyWord *old_top = old_base + old_length;
/* Calculate the offset of the new stack from the old. If the frame is
being extended objects in the new frame will be further up the stack
than in the old one. */
POLYSIGNED offset = new_base - old_base + new_length - old_length;
PolyWord *oldStackPtr = assemblyInterface.stackPtr;
// Adjust the stack pointer and handler pointer since these point into the stack.
assemblyInterface.stackPtr = assemblyInterface.stackPtr + offset;
assemblyInterface.handlerRegister = assemblyInterface.handlerRegister + offset;
// We need to adjust any values on the stack that are pointers within the stack.
// Skip the unused part of the stack.
POLYUNSIGNED i = oldStackPtr - old_base;
ASSERT (i <= old_length);
i = old_length - i;
PolyWord *old = oldStackPtr;
PolyWord *newp= assemblyInterface.stackPtr;
while (i--)
{
PolyWord old_word = *old++;
if (old_word.IsTagged() || old_word.AsStackAddr() < old_base || old_word.AsStackAddr() >= old_top)
*newp++ = old_word;
else
*newp++ = PolyWord::FromStackAddr(old_word.AsStackAddr() + offset);
}
ASSERT(old == ((PolyWord*)old_stack)+old_length);
ASSERT(newp == ((PolyWord*)new_stack)+new_length);
// And change any registers that pointed into the old stack
for (int j = 0; j < 16; j++)
{
if (saveRegisterMask & (1 << j))
{
PolyWord *regAddr = get_reg(j);
PolyWord addr = *regAddr;
if (! addr.IsTagged() && addr.AsStackAddr() >= old_base && addr.AsStackAddr() < old_top)
*regAddr = PolyWord::FromStackAddr(addr.AsStackAddr() + offset);
}
}
}
// Set code constant. This can be a fast call. The only reason it is in the RTS is
// to ensure that there is no possibility of a GC while the individual bytes are being
// copied.
// At the moment this assumes we're dealing with a 32-bit constant on a 32-bit machine
// and a 64-bit constant on a 64-bit machine.
POLYUNSIGNED PolySetCodeConstant(byte *pointer, PolyWord offset, PolyWord cWord, PolyWord flags)
{
POLYUNSIGNED c = cWord.AsUnsigned();
// pointer is the start of the code segment.
// c will usually be an address.
// offset is a byte offset
pointer += offset.UnTaggedUnsigned();
if (flags == TAGGED(1))
c -= (POLYUNSIGNED)(pointer + sizeof(PolyWord)); // Relative address. Relative to AFTER the pointer.
// Store the value into the code. It can be on an arbitrary alignment.
for (unsigned i = 0; i < sizeof(PolyWord); i++)
{
pointer[i] = (byte)(c & 255);
c >>= 8;
}
return TAGGED(0).AsUnsigned();
}
Handle X86TaskData::EnterPolyCode()
/* Called from "main" to enter the code. */
{
Handle hOriginal = this->saveVec.mark(); // Set this up for the IO calls.
while (1)
{
this->saveVec.reset(hOriginal); // Remove old RTS arguments and results.
// Run the ML code and return with the function to call.
this->inML = true;
int ioFunction = SwitchToPoly();
this->inML = false;
try {
switch (ioFunction)
{
case -1:
// We've been interrupted. This usually involves simulating a
// stack overflow so we could come here because of a genuine
// stack overflow.
// Previously this code was executed on every RTS call but there
// were problems on Mac OS X at least with contention on schedLock.
// Process any asynchronous events i.e. interrupts or kill
processes->ProcessAsynchRequests(this);
// Release and re-acquire use of the ML memory to allow another thread
// to GC.
processes->ThreadReleaseMLMemory(this);
processes->ThreadUseMLMemory(this);
break;
case -2: // A callback has returned.
return callBackResult; // Return the saved value. Not used in the new interface.
default:
Crash("Unknown io operation %d\n", ioFunction);
}
}
catch (IOException &) {
}
}
}
// Run the current ML process. X86AsmSwitchToPoly saves the C state so that
// whenever the ML requires assistance from the rest of the RTS it simply
// returns to C with the appropriate values set in assemblyInterface.requestCode and
//
int X86TaskData::SwitchToPoly()
// (Re)-enter the Poly code from C. Returns with the io function to call or
// -1 if we are responding to an interrupt.
{
Handle mark = this->saveVec.mark();
do
{
this->saveVec.reset(mark); // Remove old data e.g. from arbitrary precision.
SetMemRegisters();
// We need to save the C stack entry across this call in case
// we're making a callback and the previous C stack entry is
// for the original call.
POLYUNSIGNED savedCStack = this->assemblyInterface.saveCStack;
// Restore the saved error state.
#if (defined(_WIN32) && !defined(__CYGWIN__))
SetLastError(savedErrno);
#else
errno = savedErrno;
#endif
// Enter the ML code.
X86AsmSwitchToPoly(&this->assemblyInterface);
this->assemblyInterface.saveCStack = savedCStack;
// Save the error codes. We may have made an RTS/FFI call that
// has set these and we don't want to do anything to change them.
#if (defined(_WIN32) && !defined(__CYGWIN__))
savedErrno = GetLastError();
#else
savedErrno = errno;
#endif
SaveMemRegisters(); // Update globals from the memory registers.
// Handle any heap/stack overflows or arbitrary precision traps.
switch (this->assemblyInterface.returnReason)
{
case RETURN_HEAP_OVERFLOW:
// The heap has overflowed.
SetRegisterMask();
this->HeapOverflowTrap(assemblyInterface.stackPtr[0].AsCodePtr()); // Computes a value for allocWords only
break;
case RETURN_STACK_OVERFLOW:
case RETURN_STACK_OVERFLOWEX:
{
SetRegisterMask();
POLYUNSIGNED min_size;
if (assemblyInterface.returnReason == RETURN_STACK_OVERFLOW)
{
min_size = this->stack->top - assemblyInterface.stackPtr + OVERFLOW_STACK_SIZE;
}
else
{
// Stack limit overflow. If the required stack space is larger than
// the fixed overflow size the code will calculate the limit in %EDI.
PolyWord *stackP = regDI().AsStackAddr();
min_size = this->stack->top - stackP + OVERFLOW_STACK_SIZE;
}
try {
// The stack check has failed. This may either be because we really have
// overflowed the stack or because the stack limit value has been adjusted
// to result in a call here.
CheckAndGrowStack(this, min_size);
}
catch (IOException &) {
// We may get an exception while handling this if we run out of store
}
{
PLocker l(&interruptLock);
// Set the stack limit. This clears any interrupt and also sets the
// correct value if we've grown the stack.
this->assemblyInterface.stackLimit = this->stack->bottom + OVERFLOW_STACK_SIZE;
}
return -1; // We're in a safe state to handle any interrupts.
}
case RETURN_CALLBACK_RETURN:
// Remove the extra exception handler we created in EnterCallbackFunction
ASSERT(assemblyInterface.handlerRegister == regSP());
regSP() += 1;
assemblyInterface.handlerRegister = (*(regSP()++)).AsStackAddr(); // Restore the previous handler.
this->callBackResult = this->saveVec.push(regAX()); // Argument to return is in RAX.
return -2;
case RETURN_CALLBACK_EXCEPTION:
// An ML callback has raised an exception.
// It isn't possible to do anything here except abort.
Crash("An ML function called from foreign code raised an exception. Unable to continue.");
case RETURN_KILL_SELF:
exitThread(this);
default:
Crash("Unknown return reason code %u", this->assemblyInterface.returnReason);
}
} while (1);
}
void X86TaskData::InitStackFrame(TaskData *parentTaskData, Handle proc, Handle arg)
/* Initialise stack frame. */
{
StackSpace *space = this->stack;
StackObject * newStack = space->stack();
POLYUNSIGNED stack_size = space->spaceSize();
POLYUNSIGNED topStack = stack_size-6;
assemblyInterface.stackPtr = (PolyWord*)newStack+topStack;
assemblyInterface.stackLimit = space->bottom + OVERFLOW_STACK_SIZE;
assemblyInterface.handlerRegister = (PolyWord*)newStack+topStack+4;
// Floating point save area.
memset(&assemblyInterface.p_fp, 0, sizeof(struct fpSaveArea));
#ifndef HOSTARCHITECTURE_X86_64
// Set the control word for 64-bit precision otherwise we get inconsistent results.
assemblyInterface.p_fp.cw = 0x027f ; // Control word
assemblyInterface.p_fp.tw = 0xffff; // Tag registers - all unused
#endif
// Initial entry point - on the stack.
((PolyWord*)newStack)[topStack] = PolyWord::FromCodePtr((byte*)&X86AsmPopArgAndClosure);
// Push the argument and the closure on the stack. We can't put them into the registers
// yet because we might get a GC before we actually start the code.
((PolyWord*)newStack)[topStack+1] = DEREFWORDHANDLE(proc); // Closure
((PolyWord*)newStack)[topStack+2] = (arg == 0) ? TAGGED(0) : DEREFWORD(arg); // Argument
/* We initialise the end of the stack with a sequence that will jump to
kill_self whether the process ends with a normal return or by raising an
exception. A bit of this was added to fix a bug when stacks were objects
on the heap and could be scanned by the GC. */
((PolyWord*)newStack)[topStack+5] = TAGGED(0); // Probably no longer needed
// Set the default handler and return address to point to this code.
PolyWord killJump(PolyWord::FromCodePtr((byte*)&X86AsmKillSelf));
// Exception handler.
((PolyWord*)newStack)[topStack+4] = killJump;
// Normal return address. We need a separate entry on the stack from
// the exception handler because it is possible that the code we are entering
// may replace this entry with an argument. The code-generator optimises tail-recursive
// calls to functions with more args than the called function.
((PolyWord*)newStack)[topStack+3] = killJump;
}
// In Solaris-x86 the registers are named EIP and ESP.
#if (!defined(REG_EIP) && defined(EIP))
#define REG_EIP EIP
#endif
#if (!defined(REG_ESP) && defined(ESP))
#define REG_ESP ESP
#endif
// Get the PC and SP(stack) from a signal context. This is needed for profiling.
// This version gets the actual sp and pc if we are in ML.
bool X86TaskData::AddTimeProfileCount(SIGNALCONTEXT *context)
{
PolyWord * sp = 0;
POLYCODEPTR pc = 0;
if (context != 0)
{
// The tests for HAVE_UCONTEXT_T, HAVE_STRUCT_SIGCONTEXT and HAVE_WINDOWS_H need
// to follow the tests in processes.h.
#if defined(HAVE_WINDOWS_H)
#ifdef _WIN64
sp = (PolyWord *)context->Rsp;
pc = (POLYCODEPTR)context->Rip;
#else
// Windows 32 including cygwin.
sp = (PolyWord *)context->Esp;
pc = (POLYCODEPTR)context->Eip;
#endif
#elif defined(HAVE_UCONTEXT_T)
#ifdef HAVE_MCONTEXT_T_GREGS
// Linux
#ifndef HOSTARCHITECTURE_X86_64
pc = (byte*)context->uc_mcontext.gregs[REG_EIP];
sp = (PolyWord*)context->uc_mcontext.gregs[REG_ESP];
#else /* HOSTARCHITECTURE_X86_64 */
pc = (byte*)context->uc_mcontext.gregs[REG_RIP];
sp = (PolyWord*)context->uc_mcontext.gregs[REG_RSP];
#endif /* HOSTARCHITECTURE_X86_64 */
#elif defined(HAVE_MCONTEXT_T_MC_ESP)
// FreeBSD
#ifndef HOSTARCHITECTURE_X86_64
pc = (byte*)context->uc_mcontext.mc_eip;
sp = (PolyWord*)context->uc_mcontext.mc_esp;
#else /* HOSTARCHITECTURE_X86_64 */
pc = (byte*)context->uc_mcontext.mc_rip;
sp = (PolyWord*)context->uc_mcontext.mc_rsp;
#endif /* HOSTARCHITECTURE_X86_64 */
#else
// Mac OS X
#ifndef HOSTARCHITECTURE_X86_64
#if(defined(HAVE_STRUCT_MCONTEXT_SS)||defined(HAVE_STRUCT___DARWIN_MCONTEXT32_SS))
pc = (byte*)context->uc_mcontext->ss.eip;
sp = (PolyWord*)context->uc_mcontext->ss.esp;
#elif(defined(HAVE_STRUCT___DARWIN_MCONTEXT32___SS))
pc = (byte*)context->uc_mcontext->__ss.__eip;
sp = (PolyWord*)context->uc_mcontext->__ss.__esp;
#endif
#else /* HOSTARCHITECTURE_X86_64 */
#if(defined(HAVE_STRUCT_MCONTEXT_SS)||defined(HAVE_STRUCT___DARWIN_MCONTEXT64_SS))
pc = (byte*)context->uc_mcontext->ss.rip;
sp = (PolyWord*)context->uc_mcontext->ss.rsp;
#elif(defined(HAVE_STRUCT___DARWIN_MCONTEXT64___SS))
pc = (byte*)context->uc_mcontext->__ss.__rip;
sp = (PolyWord*)context->uc_mcontext->__ss.__rsp;
#endif
#endif /* HOSTARCHITECTURE_X86_64 */
#endif
#elif defined(HAVE_STRUCT_SIGCONTEXT)
#if defined(HOSTARCHITECTURE_X86_64) && defined(__OpenBSD__)
// CPP defines missing in amd64/signal.h in OpenBSD
pc = (byte*)context->sc_rip;
sp = (PolyWord*)context->sc_rsp;
#else // !HOSTARCHITEXTURE_X86_64 || !defined(__OpenBSD__)
pc = (byte*)context->sc_pc;
sp = (PolyWord*)context->sc_sp;
#endif
#endif
}
if (pc != 0)
{
// See if the PC we've got is an ML code address.
MemSpace *space = gMem.SpaceForAddress(pc);
if (space != 0 && (space->spaceType == ST_CODE || space->spaceType == ST_PERMANENT))
{
add_count(this, pc, 1);
return true;
}
}
// See if the sp value is in the current stack.
if (sp >= this->stack->bottom && sp < this->stack->top)
{
// We may be in the assembly code. The top of the stack will be a return address.
pc = sp[0].AsCodePtr();
MemSpace *space = gMem.SpaceForAddress(pc);
if (space != 0 && (space->spaceType == ST_CODE || space->spaceType == ST_PERMANENT))
{
add_count(this, pc, 1);
return true;
}
}
// See if the value of regSP is a valid stack pointer.
// This works if we happen to be in an RTS call using a "Full" call.
// It doesn't work if we've used a "Fast" call because that doesn't save the SP.
sp = assemblyInterface.stackPtr;
if (sp >= this->stack->bottom && sp < this->stack->top)
{
// We may be in the run-time system.
pc = sp[0].AsCodePtr();
MemSpace *space = gMem.SpaceForAddress(pc);
if (space != 0 && (space->spaceType == ST_CODE || space->spaceType == ST_PERMANENT))
{
add_count(this, pc, 1);
return true;
}
}
// None of those worked
return false;
}
// This is called from a different thread so we have to be careful.
void X86TaskData::InterruptCode()
{
PLocker l(&interruptLock);
// Set the stack limit pointer to the top of the stack to cause
// a trap when we next check for stack overflow.
// We use a lock here to ensure that we always use the current value of the
// stack. The thread we're interrupting could be growing the stack at this point.
if (this->stack != 0)
this->assemblyInterface.stackLimit = this->stack->top-1;
}
// This is called from SwitchToPoly before we enter the ML code.
void X86TaskData::SetMemRegisters()
{
// Copy the current store limits into variables before we go into the assembly code.
// If we haven't yet set the allocation area or we don't have enough we need
// to create one (or a new one).
if (this->allocPointer <= this->allocLimit + this->allocWords)
{
if (this->allocPointer < this->allocLimit)
Crash ("Bad length in heap overflow trap");
// Find some space to allocate in. Updates taskData->allocPointer and
// returns a pointer to the newly allocated space (if allocWords != 0)
PolyWord *space =
processes->FindAllocationSpace(this, this->allocWords, true);
if (space == 0)
{
// We will now raise an exception instead of returning.
// Set allocWords to zero so we don't set the allocation register
// since that could be holding the exception packet.
this->allocWords = 0;
}
// Undo the allocation just now.
this->allocPointer += this->allocWords;
}
if (this->allocWords != 0)
{
// If we have had a heap trap we actually do the allocation here.
// We will have already garbage collected and recovered sufficient space.
// This also happens if we have just trapped because of store profiling.
this->allocPointer -= this->allocWords; // Now allocate
// Set the allocation register to this area.
if (this->allocReg < 15)
*(get_reg(this->allocReg)) =
PolyWord::FromStackAddr(this->allocPointer + 1); /* remember: it's off-by-one */
this->allocWords = 0;
}
// If we have run out of store, either just above or while allocating in the RTS,
// allocPointer and allocLimit will have been set to zero as part of the GC. We will
// now be raising an exception which may free some store but we need to come back here
// before we allocate anything. The compiled code uses unsigned arithmetic to check for
// heap overflow but only after subtracting the space required. We need to make sure
// that the values are still non-negative after substracting any object size.
if (this->allocPointer == 0) this->allocPointer += MAX_OBJECT_SIZE;
if (this->allocLimit == 0) this->allocLimit += MAX_OBJECT_SIZE;
this->assemblyInterface.localMbottom = this->allocLimit + 1;
this->assemblyInterface.localMpointer = this->allocPointer + 1;
// If we are profiling store allocation we set mem_hl so that a trap
// will be generated.
if (profileMode == kProfileStoreAllocation)
this->assemblyInterface.localMbottom = this->assemblyInterface.localMpointer;
this->assemblyInterface.returnReason = RETURN_IO_CALL_NOW_UNUSED;
this->assemblyInterface.threadId = this->threadObject;
}
// This is called whenever we have returned from ML to C.
void X86TaskData::SaveMemRegisters()
{
this->allocPointer = this->assemblyInterface.localMpointer - 1;
this->allocWords = 0;
this->assemblyInterface.exceptionPacket = TAGGED(0);
this->saveRegisterMask = 0;
}
// Called on a GC or stack overflow trap. The register mask
// is in the bytes after the trap call.
void X86TaskData::SetRegisterMask()
{
byte *pc = assemblyInterface.stackPtr[0].AsCodePtr();
if (*pc == 0xcd) // CD - INT n is used for a single byte
{
pc++;
saveRegisterMask = *pc++;
}
else if (*pc == 0xca) // CA - FAR RETURN is used for a two byte mask
{
pc++;
saveRegisterMask = pc[0] | (pc[1] << 8);
pc += 2;
}
assemblyInterface.stackPtr[0] = PolyWord::FromCodePtr(pc);
}
PolyWord *X86TaskData::get_reg(int n)
/* Returns a pointer to the register given by n. */
{
switch (n)
{
case 0: return &assemblyInterface.p_rax;
case 1: return &assemblyInterface.p_rcx;
case 2: return &assemblyInterface.p_rdx;
case 3: return &assemblyInterface.p_rbx;
// Should not have rsp or rbp.
case 6: return &assemblyInterface.p_rsi;
case 7: return &assemblyInterface.p_rdi;
#ifdef HOSTARCHITECTURE_X86_64
case 8: return &assemblyInterface.p_r8;
case 9: return &assemblyInterface.p_r9;
case 10: return &assemblyInterface.p_r10;
case 11: return &assemblyInterface.p_r11;
case 12: return &assemblyInterface.p_r12;
case 13: return &assemblyInterface.p_r13;
case 14: return &assemblyInterface.p_r14;
// R15 is the heap pointer so shouldn't occur here.
#endif /* HOSTARCHITECTURE_X86_64 */
default: Crash("Unknown register %d\n", n);
}
}
// Called as a result of a heap overflow trap
void X86TaskData::HeapOverflowTrap(byte *pcPtr)
{
X86TaskData *mdTask = this;
POLYUNSIGNED wordsNeeded = 0;
// The next instruction, after any branches round forwarding pointers or pop
// instructions, will be a store of register containing the adjusted heap pointer.
// We need to find that register and the value in it in order to find out how big
// the area we actually wanted is. N.B. The code-generator and assembly code
// must generate the correct instruction sequence.
// byte *pcPtr = assemblyInterface.programCtr;
while (true)
{
if (pcPtr[0] == 0xeb)
{
// Forwarding pointer
if (pcPtr[1] >= 128) pcPtr += 256 - pcPtr[1] + 2;
else pcPtr += pcPtr[1] + 2;
}
else if ((pcPtr[0] & 0xf8) == 0x58) // Pop instruction.
pcPtr++;
else if (pcPtr[0] == 0x41 && ((pcPtr[1] & 0xf8) == 0x58)) // Pop with Rex prefix
pcPtr += 2;
else break;
}
#ifndef HOSTARCHITECTURE_X86_64
// This should be movl REG,0[%ebp].
ASSERT(pcPtr[0] == 0x89);
mdTask->allocReg = (pcPtr[1] >> 3) & 7; // Remember this until we allocate the memory
PolyWord *reg = get_reg(mdTask->allocReg);
PolyWord reg_val = *reg;
// The space we need is the difference between this register
// and the current value of newptr.
// The +1 here is because assemblyInterface.localMpointer is A.M.pointer +1. The reason
// is that after the allocation we have the register pointing at the address we will
// actually use.
wordsNeeded = (this->allocPointer - (PolyWord*)reg_val.AsAddress()) + 1;
*reg = TAGGED(0); // Clear this - it's not a valid address.
/* length in words, including length word */
ASSERT (wordsNeeded <= (1<<24)); /* Max object size including length/flag word is 2^24 words. */
#else /* HOSTARCHITECTURE_X86_64 */
ASSERT(pcPtr[1] == 0x89 || pcPtr[1] == 0x8b);
if (pcPtr[1] == 0x89)
{
// New (5.4) format. This should be movq REG,%r15
ASSERT(pcPtr[0] == 0x49 || pcPtr[0] == 0x4d);
mdTask->allocReg = (pcPtr[2] >> 3) & 7; // Remember this until we allocate the memory
if (pcPtr[0] & 0x4) mdTask->allocReg += 8;
}
else
{
// Alternative form of movq REG,%r15
ASSERT(pcPtr[0] == 0x4c || pcPtr[0] == 0x4d);
mdTask->allocReg = pcPtr[2] & 7; // Remember this until we allocate the memory
if (pcPtr[0] & 0x1) mdTask->allocReg += 8;
}
PolyWord *reg = get_reg(this->allocReg);
PolyWord reg_val = *reg;
wordsNeeded = (this->allocPointer - (PolyWord*)reg_val.AsAddress()) + 1;
*reg = TAGGED(0); // Clear this - it's not a valid address.
#endif /* HOSTARCHITECTURE_X86_64 */
if (profileMode == kProfileStoreAllocation)
addProfileCount(wordsNeeded);
mdTask->allocWords = wordsNeeded; // The actual allocation is done in SetMemRegisters.
}
void X86TaskData::SetException(poly_exn *exc)
// Set up the stack to raise an exception.
{
// Do we need to set the PC value any longer? It may be necessary if
// we have taken a trap because another thread has sent a broadcast interrupt.
*(--assemblyInterface.stackPtr) = PolyWord::FromCodePtr((byte*)X86AsmRaiseException);
regAX() = exc; /* put exception data into eax */
assemblyInterface.exceptionPacket = exc; // Set for direct calls.
}
// Sets up a callback function on the current stack. The present state is that
// the ML code has made a call in to foreign_dispatch. We need to set the stack
// up so that we will enter the callback (as with CallCodeTupled) but when we return
// the result we enter callback_return.
Handle X86TaskData::EnterCallbackFunction(Handle func, Handle args)
{
// If we ever implement a light version of the FFI that allows a call to C
// code without saving enough to allow allocation in C code we need to ensure
// that this code doesn't do any allocation. Essentially we need the values
// in localMpointer and localMbottom to be valid across a call to C. If we do
// a callback the ML callback function would pick up the values saved in the
// originating call.
// Set up an exception handler so we will enter callBackException if there is an exception.
*(--regSP()) = PolyWord::FromStackAddr(assemblyInterface.handlerRegister); // Create a special handler entry
*(--regSP()) = PolyWord::FromCodePtr((byte*)&X86AsmCallbackException);
assemblyInterface.handlerRegister = regSP();
// Push the call to callBackReturn onto the stack as the return address.
*(--regSP()) = PolyWord::FromCodePtr((byte*)&X86AsmCallbackReturn);
// Set up the entry point of the callback.
PolyObject *functToCall = func->WordP();
regDX() = functToCall; // Closure address
regAX() = args->Word();
// Push entry point address
*(--regSP()) = functToCall->Get(0); // First word of closure is entry pt.
return EnterPolyCode();
}
// Decode and process an effective address. There may
// be a constant address in here but in any case we need
// to decode it to work out where the next instruction starts.
// If this is an lea instruction any addresses are just constants
// so must not be treated as addresses.
static void skipea(PolyObject *base, byte **pt, ScanAddress *process, bool lea)
{
unsigned int modrm = *((*pt)++);
unsigned int md = modrm >> 6;
unsigned int rm = modrm & 7;
if (md == 3) { } /* Register. */
else if (rm == 4)
{
/* s-i-b present. */
unsigned int sib = *((*pt)++);
if (md == 0)
{
if ((sib & 7) == 5)
{
if (! lea) {
#ifndef HOSTARCHITECTURE_X86_64
process->ScanConstant(base, *pt, PROCESS_RELOC_DIRECT);
#endif /* HOSTARCHITECTURE_X86_64 */
}
(*pt) += 4;
}
}
else if (md == 1) (*pt)++;
else if (md == 2) (*pt) += 4;
}
else if (md == 0 && rm == 5)
{
if (!lea) {
#ifndef HOSTARCHITECTURE_X86_64
/* Absolute address. */
process->ScanConstant(base, *pt, PROCESS_RELOC_DIRECT);
#endif /* HOSTARCHITECTURE_X86_64 */
}
*pt += 4;
}
else
{
if (md == 1) *pt += 1;
else if (md == 2) *pt += 4;
}
}
/* Added to deal with constants within the
code rather than in the constant area. The constant
area is still needed for the function name.
DCJM 2/1/2001
*/
void X86Dependent::ScanConstantsWithinCode(PolyObject *addr, PolyObject *old, POLYUNSIGNED length, ScanAddress *process)
{
byte *pt = (byte*)addr;
PolyWord *end = addr->Offset(length - 1);
while (true)
{
// Escape prefixes come before any Rex byte
if (*pt == 0xf2 || *pt == 0xf3 || *pt == 0x66)
pt++;
#ifdef HOSTARCHITECTURE_X86_64
// REX prefixes. Set this first.
byte lastRex;
if (*pt >= 0x40 && *pt <= 0x4f)
lastRex = *pt++;
else
lastRex = 0;
//printf("pt=%p *pt=%x\n", pt, *pt);
#endif /* HOSTARCHITECTURE_X86_64 */
switch (*pt)
{
case 0x00: return; // This is actually the first byte of the old "marker" word.
case 0xf4: return; // Halt - now used as a marker.
case 0x50: case 0x51: case 0x52: case 0x53:
case 0x54: case 0x55: case 0x56: case 0x57: /* Push */
case 0x58: case 0x59: case 0x5a: case 0x5b:
case 0x5c: case 0x5d: case 0x5e: case 0x5f: /* Pop */
case 0x90: /* nop */ case 0xc3: /* ret */
case 0xf9: /* stc */ case 0xce: /* into */
case 0xf0: /* lock. */ case 0xf3: /* rep/repe */
case 0xa4: case 0xa5: case 0xaa: case 0xab: /* movs/stos */
case 0xa6: /* cmpsb */ case 0x9e: /* sahf */ case 0x99: /* cqo/cdq */
pt++; break;
case 0x70: case 0x71: case 0x72: case 0x73: case 0x74: case 0x75: case 0x76: case 0x77:
case 0x78: case 0x79: case 0x7a: case 0x7b: case 0x7c: case 0x7d: case 0x7e: case 0x7f:
case 0xeb:
/* short jumps. */
case 0xcd: /* INT - now used for a register mask */
case 0xa8: /* TEST_ACC8 */
case 0x6a: /* PUSH_8 */
pt += 2; break;
case 0xc2: /* RET_16 */
case 0xca: /* FAR RET 16 - used for a register mask */
pt += 3; break;
case 0x8d: /* leal. */
pt++; skipea(addr, &pt, process, true); break;
case 0x03: case 0x0b: case 0x13: case 0x1b:
case 0x23: case 0x2b: case 0x33: case 0x3b: /* Add r,ea etc. */
case 0x88: /* MOVB_R_A */ case 0x89: /* MOVL_R_A */
case 0x8b: /* MOVL_A_R */
case 0x62: /* BOUNDL */
case 0xff: /* Group5 */
case 0xd1: /* Group2_1_A */
case 0x8f: /* POP_A */
case 0xd3: /* Group2_CL_A */
case 0x87: // XCHNG
pt++; skipea(addr, &pt, process, false); break;
case 0xf6: /* Group3_a */
{
int isTest = 0;
pt++;
/* The test instruction has an immediate operand. */
if ((*pt & 0x38) == 0) isTest = 1;
skipea(addr, &pt, process, false);
if (isTest) pt++;
break;
}
case 0xf7: /* Group3_A */
{
int isTest = 0;
pt++;
/* The test instruction has an immediate operand. */
if ((*pt & 0x38) == 0) isTest = 1;
skipea(addr, &pt, process, false);
if (isTest) pt += 4;
break;
}
case 0xc1: /* Group2_8_A */
case 0xc6: /* MOVB_8_A */
case 0x83: /* Group1_8_A */
case 0x80: /* Group1_8_a */
case 0x6b: // IMUL Ev,Ib
pt++; skipea(addr, &pt, process, false); pt++; break;
case 0x69: // IMUL Ev,Iv
pt++; skipea(addr, &pt, process, false); pt += 4; break;
case 0x81: /* Group1_32_A */
{
pt ++;
#ifndef HOSTARCHITECTURE_X86_64
unsigned opCode = *pt;
#endif /* HOSTARCHITECTURE_X86_64 */
skipea(addr, &pt, process, false);
// Only check the 32 bit constant if this is a comparison.
// For other operations this may be untagged and shouldn't be an address.
#ifndef HOSTARCHITECTURE_X86_64
if ((opCode & 0x38) == 0x38)
process->ScanConstant(addr, pt, PROCESS_RELOC_DIRECT);
#endif /* HOSTARCHITECTURE_X86_64 */
pt += 4;
break;
}
case 0xe8: case 0xe9:
// Long jump and call. These are used to call constant (known) functions
// and also long jumps within the function.
{
pt++;
POLYSIGNED disp = (pt[3] & 0x80) ? -1 : 0; // Set the sign just in case.
for(unsigned i = 4; i > 0; i--)
disp = (disp << 8) | pt[i-1];
byte *absAddr = pt + disp + 4; // The address is relative to AFTER the constant
// If the new address is within the current piece of code we don't do anything
if (absAddr >= (byte*)addr && absAddr < (byte*)end) {}
else {
#ifdef HOSTARCHITECTURE_X86_64
ASSERT(sizeof(PolyWord) == 4); // Should only be used internally on x64
#endif /* HOSTARCHITECTURE_X86_64 */
if (addr != old)
{
// The old value of the displacement was relative to the old address before
// we copied this code segment.
// We have to correct it back to the original address.
absAddr = absAddr - (byte*)addr + (byte*)old;
// We have to correct the displacement for the new location and store
// that away before we call ScanConstant.
POLYSIGNED newDisp = absAddr - pt - 4;
for (unsigned i = 0; i < 4; i++)
{
pt[i] = (byte)(newDisp & 0xff);
newDisp >>= 8;
}
}
process->ScanConstant(addr, pt, PROCESS_RELOC_I386RELATIVE);
}
pt += 4;
break;
}
case 0xc7:/* MOVL_32_A */
{
pt++;
if ((*pt & 0xc0) == 0x40 /* Byte offset or sib present */ &&
((*pt & 7) != 4) /* But not sib present */ && pt[1] == 256-sizeof(PolyWord))
{
/* We may use a move instruction to set the length
word on a new segment. We mustn't try to treat this as a constant. */
pt += 6; /* Skip the modrm byte, the offset and the constant. */
}
else
{
skipea(addr, &pt, process, false);
#ifndef HOSTARCHITECTURE_X86_64
process->ScanConstant(addr, pt, PROCESS_RELOC_DIRECT);
#endif /* HOSTARCHITECTURE_X86_64 */
pt += 4;
}
break;
}
case 0xb8: case 0xb9: case 0xba: case 0xbb:
case 0xbc: case 0xbd: case 0xbe: case 0xbf: /* MOVL_32_64_R */
pt ++;
#ifdef HOSTARCHITECTURE_X86_64
if ((lastRex & 8) == 0)
pt += 4; // 32-bit mode on 64-bits. Can this occur?
else
#endif /* HOSTARCHITECTURE_X86_64 */
{
// 32 bits in 32-bit mode, 64-bits in 64-bit mode.
process->ScanConstant(addr, pt, PROCESS_RELOC_DIRECT);
pt += sizeof(PolyWord);
}
break;
case 0x68: /* PUSH_32 */
pt ++;
#ifndef HOSTARCHITECTURE_X86_64
process->ScanConstant(addr, pt, PROCESS_RELOC_DIRECT);
#endif /* HOSTARCHITECTURE_X86_64 */
pt += 4;
break;
case 0x0f: /* ESCAPE */
{
pt++;
switch (*pt)
{
case 0xb6: /* movzl */
case 0xb7: // movzw
case 0xc1: /* xaddl */
case 0xaf: // imul
pt++; skipea(addr, &pt, process, false); break;
case 0x80: case 0x81: case 0x82: case 0x83:
case 0x84: case 0x85: case 0x86: case 0x87:
case 0x88: case 0x89: case 0x8a: case 0x8b:
case 0x8c: case 0x8d: case 0x8e: case 0x8f:
/* Conditional branches with 32-bit displacement. */
pt += 5; break;
// These are SSE2 instructions
case 0x10: case 0x11: case 0x58: case 0x5c: case 0x59: case 0x5e:
case 0x2e: case 0x2a: case 0x54: case 0x57: case 0x5a:
pt++; skipea(addr, &pt, process, false); break;
default: Crash("Unknown opcode %d at %p\n", *pt, pt);
}
break;
}
case 0xd8: case 0xd9: case 0xda: case 0xdb:
case 0xdc: case 0xdd: case 0xde: case 0xdf: // Floating point escape instructions
{
pt++;
if ((*pt & 0xe0) == 0xe0) pt++;
else skipea(addr, &pt, process, false);
break;
}
default: Crash("Unknown opcode %d at %p\n", *pt, pt);
}
}
}
// Increment the value contained in the first word of the mutex.
Handle X86TaskData::AtomicIncrement(Handle mutexp)
{
PolyObject *p = DEREFHANDLE(mutexp);
POLYUNSIGNED result = X86AsmAtomicIncrement(p);
return this->saveVec.push(PolyWord::FromUnsigned(result));
}
// Release a mutex. Because the atomic increment and decrement
// use the hardware LOCK prefix we can simply set this to one.
void X86TaskData::AtomicReset(Handle mutexp)
{
DEREFHANDLE(mutexp)->Set(0, TAGGED(1));
}
static X86Dependent x86Dependent;
MachineDependent *machineDependent = &x86Dependent;
struct _entrypts machineSpecificEPT[] =
{
{ "PolySetCodeConstant", (polyRTSFunction)&PolySetCodeConstant},
{ NULL, NULL} // End of list.
};
|