1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 695 696 697 698 699 700 701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720
|
.. _pgscript:
**********************************************
`pgScript Scripting Language Reference`:index:
**********************************************
.. _pgscript-overview:
Overview
========
pgScript is composed of pgScript commands::
pgScript command
: Regular PostgreSQL SQL Command (SELECT INSERT CREATE ...)
| Variable declaration or assignment (DECLARE SET)
| Control-of-flow structure (IF WHILE)
| Procedure (ASSERT PRINT LOG RMLINE)
Command names (**SELECT**, **IF**, **SET**, ...) are case-insensitive
and must be ended with a semi-column ``;``. Identifiers are case-sensitive.
.. _examples:
Examples
========
.. _example1:
Batch table creations
---------------------
::
DECLARE @I, @T; -- Variable names begin with a @
SET @I = 0; -- @I is an integer
WHILE @I < 20
BEGIN
SET @T = 'table' + CAST (@I AS STRING); -- Casts @I
CREATE TABLE @T (id integer primary key, data text);
SET @I = @I + 1;
END
.. _example2:
Insert random data
------------------
::
DECLARE @I, @J, @T, @G;
SET @I = 0;
SET @G1 = INTEGER(10, 29, 1); /* Random integer generator
Unique numbers between 10 and 29 */
SET @G2 = STRING(10, 20, 3); /* Random string generator
3 words between 10 and 20 characters */
WHILE @I < 20
BEGIN
SET @T = 'table' + CAST (@I AS STRING);
SET @J = 0;
WHILE @J < 20
BEGIN
INSERT INTO @T VALUES (@G1, '@G2');
SET @J = @J + 1;
END
SET @I = @I + 1;
END
.. _example3:
Batch table deletions
---------------------
::
DECLARE @I, @T; -- Declaring is optional
SET @I = 0;
WHILE 1 -- Always true
BEGIN
IF @I >= 20
BREAK; -- Exit the loop if @I > 20
SET @T = 'table' + CAST (@I AS STRING);
DROP TABLE @T;
SET @I = @I + 1;
END
.. _example4:
Print information on screen
---------------------------
::
SET @PROGR@M#TITLE = 'pgScript';
PRINT '';
PRINT @PROGR@M#TITLE + ' features:';
PRINT '';
PRINT ' * Regular PostgreSQL commands';
PRINT ' * Control-of-flow language';
PRINT ' * Local variables';
PRINT ' * Random data generators';</pre>
.. _commands:
SQL Commands
============
You can run ANY PostgreSQL query from a pgScript EXCEPT those ones::
BEGIN;
END;
This is because **BEGIN** and **END** are used for delimiting blocks.
Instead use::
BEGIN TRANSACTION;
END TRANSACTION;
For a list of PostgreSQL commands: `http://www.postgresql.org/docs/8.3/interactive/sql-commands.html <http://www.postgresql.org/docs/8.3/interactive/sql-commands.html>`_
.. _variables:
Variables
=========
There are two main types of variables : simple variables and records
(result sets composed of lines and columns).
Variable names begin with a ``@`` and can be composed of
letters, digits, ``_``, ``#``, ``@``.
Variable type is guessed automatically according to the kind of value it
contains. This can be one of: number (real or integer), string, record.
.. _variable1:
Simple variables
----------------
Simple variable declaration
***************************
Declaring simple variable is optional::
DECLARE @A, @B;
DECLARE @VAR1;
Simple variable affectation
***************************
This is done with the **SET** command. The variable type depends on the
value assigned to this variable::
SET @A = 1000, @B = 2000; -- @A and @B are <strong>integer numbers**
SET @C = 10e1, @D = 1.5; -- @C and @D are <strong>real numbers**
SET @E = 'ab', @F = 'a''b'; -- @E and @F are <strong>strings**
SET @G = "ab", @H = "a\"b"; -- @G and @H are <strong>strings**
An uninitialized variable defaults to an empty string. It is possible to
override variables as many times as wanted::
PRINT @A; -- Prints an empty string
SET @A = 1000; -- @A is initialized an integer
PRINT @A; -- Prints 1000
SET @A = 'ab'; -- @A becomes a string
PRINT @A; -- Prints ab
Data generators
***************
Data generators allows users to generate random values. There are
various types of generators, each one producing different type of data.
A variable initialized with a data generator behaves like a regular
simple variable except that it has a different value each time it is
used::
SET @A = INTEGER(100, 200);
PRINT @A; -- Prints an integer between 100 and 200
PRINT @A; -- Prints another integer between 100 and 200
A variable can contain a generator but its type is one of: number (real
or integer), string. For a list of available generators and their
associated type, see `generators`_.
.. _variable2:
Records
-------
Record declaration
******************
Declaring a record is **required**. A name for each column must be
specified even if they will not be used anymore afterwards::
DECLARE @R1 { @A, @B }, @R2 { @A, @C }; -- Two records with two columns
DECLARE @R3 { @A, @B, @C, @D }; -- One record with four columns
The number of lines is dynamic: see the next section.
Record affectation
******************
To access a specific location in a record, one must use the line number
(starts at 0) and can use either the column name (between quotes) or the
column number (starts at 0). This specific location behaves like a
simple variable. Note that a record cannot contain a record::
SET @R1[0]['@A'] = 1; -- First line & first column
SET @R1[0][0] = 1; -- Same location
SET @R1[4]['@B'] = 1; -- Fifth line & second column
SET @R1[0][1] = 1; -- Same location
In the above example, three empty lines are automatically inserted
between the first and the fifth. Using an invalid column number or name
results in an exception.
Specific location can be used as right values as well. A specific line
can also be used as right value::
SET @R1[0][0] = @R3[0][1], @A = @R2[0][0]; -- Behaves like simple variables
SET @A = @R1[1]; -- @A becomes a record which is the first line of @R1
Remember that ``SET @R1[0][0] = @R2`` is impossible because a record
cannot contain a record.
It is possible to assign a record to a variable, in this case the
variable does not need to be declared::
SET @A = @R3; -- @A becomes a record because it is assigned a record
SQL queries
***********
Any SQL query executed returns a record. If the query is a ``SELECT``
query then it returns the results of the query. If it is something else
then it returns a one-line record (``true``) if this is a success
otherwise a zero-line record (``false``)::
SET @A = SELECT * FROM table; -- @A is a record with the results of the query
SET @B = INSERT INTO table ...; -- @B is a one-line record if the query succeeds
Record functions
****************
See `function2`_.
.. _variable3:
Cast
----
It is possible to convert a variable from one type to another with the
cast function::
SET @A = CAST (@B AS STRING);
SET @A = CAST (@B AS REAL);
SET @A = CAST (@B AS INTEGER);
SET @A = CAST (@B AS RECORD);
When a record is converted to a string, it is converted to its flat
representation. When converted to a number, the record is first converted
to a string and then to a number (see string conversion for more
details).
When a number is converted to a string, it is converted to its string
representation. When converted to a record, it is converted to a
one-line-one-column record whose value is the number.
When a string is converted to a number, if the string represents a
number then this number is returned else an exception is thrown. When
converted to a record, either the program can find a **record pattern**
in the string or it converts it to a one-line-one-column record whose
value is the string. A record pattern is::
SET @B = '(1, "abc", "ab\\"")(1, "abc", "ab\\"")'; -- @B is a string
SET @B = CAST (@B AS RECORD); @B becomes a two-line-three-column record
Remember a string is surrounded by simple quotes. Strings composing a
record must be surrounded by double quotes which are escaped with ``\\``
(we double the slash because it is already a special character for the
enclosing simple quotes).
.. _variable4:
Operations
----------
Operations can only be performed between operands of the same type. Cast
values in order to conform to this criterion.
Comparisons result in a number which is 0 or 1.
Strings
*******
Comparisons: ``= <> > < <= >= AND OR``
Concatenation: ``+``
::
SET @B = @A + 'abcdef'; -- @A must be a string and @B will be a string
Boolean value: non-empty string is ``true``, empty string is ``false``
Inverse boolean value: ``NOT``
Case-insensitive comparison: ``~=``
Numbers
*******
Comparisons: ``= <> > < <= >= AND OR``
Arithmetic: ``+ - * / %``
::
SET @A = CAST ('10' AS INTEGER) + 5; -- '10' string is converted to a number
Boolean value: 0 is ``false``, anything else is ``true``
Inverse boolean value: ``NOT`` (note that ``NOT NOT 10 = 1``)
An arithmetic operation involving at least one real number gives a real
number as a result::
SET @A = 10 / 4.; -- 4. is a real so real division: @A = 2.5
SET @A = 10 / 4; -- 4 is an integer so integer division: @A = 2
Records
*******
Comparisons: ``= <> > < <= >= AND OR``
Boolean value: zero-line record is ``false``, anything else is ``true``
Inverse boolean value: ``NOT``
Comparisons for records are about inclusion and exclusion. Order of
lines does not matter. ``<=`` means that each row in the left operand
has a match in the right operand. ``>=`` means the opposite. ``=`` means
that ``<=`` and ``>=`` are both true at the same time...
Comparisons are performed on strings: even if a record contains numbers
like ``10`` and ``1e1`` we will have ``'10' <> '1e1'``.
.. _control:
Control-of-flow structures
==========================
.. _control1:
Conditional structure
---------------------
::
IF condition
BEGIN
pgScript commands
END
ELSE
BEGIN
pgScript commands
END
pgScript commands are optional. **BEGIN** and **END** keywords are
optional if there is only one pgScript command.
.. _control2:
Loop structure
--------------
::
WHILE condition
BEGIN
pgScript commands
END
pgScript commands are optional. **BEGIN** and **END** keywords are
optional if there is only one pgScript command.
**BREAK** ends the enclosing **WHILE** loop, while **CONTINUE** causes
the next iteration of the loop to execute. **RETURN** behaves like
**BREAK**::
WHILE condition1
BEGIN
IF condition2
BEGIN
BREAK;
END
END
.. _control3:
Conditions
----------
Conditions are in fact results of operations. For example the string
comparison ``'ab' = 'ac'`` will result in a number which is ``false``
(the equality is not true)::
IF 'ab' ~= 'AB' -- Case-insensitive comparison which result in 1 (true) which is true
BEGIN
-- This happens
END
IF 0 -- false
BEGIN
-- This does **not** happen
END
ELSE
BEGIN
-- This happens
END
WHILE 1
BEGIN
-- Infinite loop: use BREAK for exiting
END
It is possible to the result of a SQL SELECT query directly as a
condition. The query needs to be surrounded by parenthesis::
IF (SELECT 1 FROM table)
BEGIN
-- This means that table exists otherwise the condition would be false
END
.. _functions:
Additional functions and procedures
===================================
.. _function1:
Procedures
----------
Procedures do not return a result. They must be used alone on a line and
cannot be assigned to a variable.
Print
*****
Prints an expression on the screen::
PRINT 'The value of @A is' + CAST (@A AS STRING);
Assert
******
Throws an exception if the expression evaluated is false::
ASSERT 5 > 3 AND 'a' = 'a';
Remove line
***********
Removes the specified line of a record::
RMLINE(@R[1]); -- Removes @R second line
.. _function2:
Functions
---------
Functions do return a result. Their return value can be assigned to a
variable, like the ``CAST`` operation.
Trim
****
Removes extra spaces surrounding a string::
SET @A = TRIM(' a '); -- @A = 'a'</pre>
Lines
*****
Gives the number of lines in a record::
IF LINES(@R) > 0
BEGIN
-- Process
END
Columns
*******
Gives the number of columns in a record::
IF COLUMNS(@R) > 0
BEGIN
-- Process
END
.. _generators:
Random data generators
======================
.. _generator1:
Overview of the generators
--------------------------
One can assign a variable (**SET**) with a random data generators. This
means each time the variable will be used it will have a different
value.
However the variable is still used as usual::
SET @G = STRING(10, 20, 2);
SET @A = @G; -- @A will hold a random string
SET @B = @G; -- @B will hold another random string
PRINT @G, -- This will print another third random string
.. _generator2:
Sequence and seeding
--------------------
Common parameters for data generators are *sequence* and *seed*.
*sequence* means that a sequence of values is generated in a random
order, in other words each value appears only once before the sequence
starts again: this is useful for columns with a ``UNIQUE`` constraint.
For example, this generator::
SET @G = INTEGER(10, 15, 1); -- 1 means generate a sequence
It can generate such values::
14 12 10 13 11 15 14 12 10 13 11
Where each number appears once before the sequence starts repeating.
*sequence* parameter must be an integer: if it is 0 then no sequence is
generated (default) and if something other than 0 then generate a
sequence.
*seed* is an integer value for initializing a generator: two generators
with the same parameters and the same seed will generate **exactly** the
same values.
*seed* must be an integer: it is used directly to initialize the random
data generator.
.. _generator3:
Data generators
---------------
Optional parameters are put into brackets::
Generator
: INTEGER ( min, max, [sequence], [seed] );
| REAL ( min, max, precision, [sequence], [seed] );
| DATE ( min, max, [sequence], [seed] );
| TIME ( min, max, [sequence], [seed] );
| DATETIME ( min, max, [sequence], [seed] );
| STRING ( min, max, [nb], [seed] );
| REGEX ( regex, [seed] );
| FILE ( path, [sequence], [seed], [encoding] );
| REFERENCE ( table, column, [sequence], [seed] );
Integer numbers
***************
::
INTEGER ( min, max, [sequence], [seed] );
INTEGER ( -10, 10, 1, 123456 );
``min`` is an integer, ``max`` is an integer, ``sequence`` is an integer
and ``seed`` is an integer.
Real numbers
************
::
REAL ( min, max, precision, [sequence], [seed] );
REAL ( 1.5, 1.8, 2, 1 );
``min`` is a number, ``max`` is a number, ``precision`` is an integer
that indicates the number of decimals (should be less than 30),
``sequence`` is an integer and ``seed`` is an integer.
Dates
*****
::
DATE ( min, max, [sequence], [seed] );
DATE ( '2008-05-01', '2008-05-05', 0 );
``min`` is a string representing a date, ``max`` is a string representing
a date, ``sequence`` is an integer and ``seed`` is an integer.
Times
*****
::
TIME ( min, max, [sequence], [seed] );
TIME ( '00:30:00', '00:30:15', 0 );
``min`` is a string representing a time, ``max`` is a string representing
a time, ``sequence`` is an integer and ``seed`` is an integer.
Timestamps (date/times)
***********************
::
DATETIME ( min, max, [sequence], [seed] );
DATETIME ( '2008-05-01 14:00:00', '2008-05-05 15:00:00', 1 );
``min`` is a string representing a timestamp, ``max`` is a string
representing a timestamp, ``sequence`` is an integer and ``seed`` is an
integer.
Strings
*******
::
STRING ( min, max, [nb], [seed] );
STRING ( 10, 20, 5 );
``min`` is an integer representing the minimum length of a word, ``max``
is an integer representing the maximum length of a word, ``nb`` is an
integer representing the number of words (default: ``1``) and ``seed``
is an integer.
In the above example we generate 5 words (separated with a space) whose
size is between 10 and 20 characters.
Strings from regular expressions
********************************
::
REGEX ( regex, [seed] );
REGEX ( '[a-z]{1,3}@[0-9]{3}' );
``regex`` is a string representing a simplified regular expressions and
``seed`` is an integer.
Simplified regular expressions are composed of:
* Sets of possible characters like ``[a-z_.]`` for characters between
``a`` and ``z`` + ``_`` and ``.``
* Single characters
It is possible to specify the minimum and maximum length of the
preceding set or single character:
* ``{min, max}`` like ``{1,3}`` which stands for length between ``1``
and ``3``
* ``{min}`` like ``{3}`` which stands for length of ``3``
* Default (when nothing is specified) is length of ``1``
Note: be careful with spaces because ``'a {3}'`` means one ``a`` followed
by three spaces because the ``3`` is about the last character or set of
characters which is a space in this example.
If you need to use ``[`` ``]`` ``\`` ``{`` or ``}``, they must be escaped
because they are special characters. Remember to use **double
backslash**: ``'\\[{3}'`` for three ``[``.
Strings from dictionary files
*****************************
::
FILE ( path, [sequence], [seed], [encoding] );
FILE ( 'file.txt', 0, 54321, 'utf-8' );
``path`` is a string representing the path to a text file, ``sequence``
is an integer, ``seed`` is an integer and ``encoding`` is a string
representing the file character set (default is system encoding).
This generates a random integer between 1 and the number of lines in the
file and then returns that line. If the file does not exist then an
exception is thrown.
``encoding`` supports the most known encoding like utf-8, utf-16le,
utf-16be, iso-8859-1, ...
Reference to another field
**************************
::
REFERENCE ( table, column, [sequence], [seed] );
REFERENCE ( 'tab', 'col', 1 );
``table`` is a string representing a table, ``column`` is a string
representing a column of the table, ``sequence`` is an integer and
``seed`` is an integer.
This is useful for generating data to put into foreign-key-constrained
columns.
|