1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 695 696 697 698 699 700 701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 728 729 730 731 732 733 734 735 736 737 738 739 740 741 742 743 744 745 746 747 748 749 750 751 752 753 754 755 756 757 758 759 760 761 762 763 764 765 766 767 768 769 770 771 772 773 774 775 776 777 778 779 780 781 782 783 784 785 786 787 788 789 790 791 792 793 794 795 796 797 798 799 800 801 802 803 804 805 806 807 808 809 810 811 812 813 814 815 816 817 818 819 820 821 822 823 824 825 826 827 828 829 830 831 832 833 834 835 836 837 838 839 840 841 842 843 844 845 846
|
#!@path_perl@ -w
# tarcust -- A Tar Customizer
# Copyright (C) 1999-2001
# Denis Barbier <barbier@debian.org>
#
# New versions of this program are on the tarcust home page
# http://www.engelschall.com/sw/tarcust/
#
# This program is a free software released under the GNU GPL License;
# see the LICENSE file for conditions.
require 5.000;
use strict;
use vars (qw($opt_a $opt_d $opt_e $opt_g $opt_h $opt_p $opt_u $opt_v
$opt_B $opt_D $opt_E $opt_G $opt_U $opt_V
@opt_s @opt_x
$filein $fileout $dir_mode $longnames $header $header_line
$total_size $new_values $contents $long_filename
%list_dirs
));
# The tar files format is quite simple. For each file, the archive
# contains a header line of 512 bytes containing all informations
# about this file followed by the contents of the file. After then
# null chars are inserted so that each file size is a multiple of 512
# bytes.
# For files with a null size (e.g. directories, links, devices,...),
# just the header is put in the file.
# Default size of header line is 512.
my $blocksize = 512;
# Structure of the header
#
# offset length unpack pack octal
#
my $structure_header = {
name => [ 0, 100, "A100", "a100"],
mode => [ 100, 8, "A8", "a8", "%07o"],
uid => [ 108, 8, "A8", "a8", "%07o"],
gid => [ 116, 8, "A8", "a8", "%07o"],
size => [ 124, 12, "A12", "a12", "%011o"],
mtime => [ 136, 12, "A12", "a12", "%011o"],
chksum => [ 148, 8, "A8", "A8", "%06o\0 "],
typeflag => [ 156, 1, "A1", "a1"],
linkname => [ 157, 100, "A100", "a100"],
magic => [ 257, 6, "A6", "A6"],
version => [ 263, 2, "A2", "a2"],
uname => [ 265, 32, "A32", "a32"],
gname => [ 297, 32, "A32", "a32"],
devmajor => [ 329, 8, "a8", "a8", "%07o"],
devminor => [ 337, 8, "a8", "a8", "%07o"],
prefix => [ 345, 155, "A155", "a155"],
null => [ 500, 12, "x12", "x12"],
};
# Initializes the structure and returns a reference to it.
# Apart the structure above, the header line is stored in the
# ``input'' hash reference and the list of keys in the structure is
# stored in ``keys''
sub init_block {
my ($input) = shift;
my $block = {};
my @keys = ();
my $unpack_format = '';
my $pack_format = '';
# keys sre sorted for user messages
foreach (sort { $structure_header->{$a}->[0] <=> $structure_header->{$b}->[0] } keys %$structure_header) {
$block->{$_} = {
'offset' => $structure_header->{$_}->[0],
'length' => $structure_header->{$_}->[1],
'unpack' => $structure_header->{$_}->[2],
'pack' => $structure_header->{$_}->[3],
'octal' => defined($structure_header->{$_}->[4]) ?
$structure_header->{$_}->[4] : "",
};
push(@keys, $_);
$unpack_format .= $block->{$_}->{unpack}." ";
$pack_format .= $block->{$_}->{pack}." ";
}
$block->{input} = $input;
$block->{keys} = \@keys;
$block->{unpack} = $unpack_format;
$block->{pack} = $pack_format;
my (@entries) = unpack $unpack_format, $input;
foreach (@keys) {
$block->{$_}->{value} = shift @entries || "";
$block->{$_}->{value} =~ s/\0+$//;
$block->{$_}->{value} = oct($block->{$_}->{value})
if $block->{$_}->{octal} and $block->{$_}->{value} ne "";
}
# Calculate chksum
substr ($input, $block->{chksum}->{offset}, $block->{chksum}->{length})
= " " x $block->{chksum}->{length};
warn "Warning: checksum error with entry: $block->{name}->{value}\n"
if unpack ("%16C*", $input) != $block->{chksum}->{value}
and $block->{typeflag}->{value} !~ m/[KL]/;
# Fix some special cases
$block->{typeflag}->{value} = 0
if $block->{typeflag}->{value} eq "";
$block->{version}->{value} ||= " ";
if ($block->{name}->{value} eq "././\@LongLink") {
read($filein, $long_filename, $blocksize) == $blocksize
or die "error occurred when reading";
read($filein, $header_line, $blocksize) == $blocksize
or die "error occurred when reading";
msgdebug("Long filename: $long_filename");
$block = init_block($header_line);
$block->{name}->{value} = $long_filename;
$block->{realname} = $long_filename;
$long_filename = "";
}
return $block;
}
# Copy block
sub copy_block {
my $block = shift;
my $newblock = {};
foreach (@{$block->{keys}}) {
$newblock->{$_} = {};
foreach my $field (qw(offset length unpack pack octal value)) {
$newblock->{$_}->{$field} = $block->{$_}->{$field};
}
}
$newblock->{input} = $block->{input};
$newblock->{keys} = $block->{keys};
$newblock->{unpack} = $block->{unpack};
$newblock->{pack} = $block->{pack};
return $newblock;
}
# build the header line from the header structure
sub hash2string {
my ($block) = shift;
my ($name, $input, $chksum, $prefix);
$prefix = '';
$block->{realname} = $block->{name}->{value};
$block->{realname} =~ s/\0+$//;
# Fix file name
if (length($block->{name}->{value}) > $block->{name}->{length}) {
my $newblock = copy_block($block);
$block->{name}->{value} =~ s/\0+$//;
$newblock->{size}->{value} = length($block->{name}->{value})+1;
$newblock->{name}->{value} = "././\@LongLink";
$newblock->{mode}->{value} = 0;
$newblock->{uid}->{value} = 0;
$newblock->{gid}->{value} = 0;
$newblock->{mtime}->{value} = 0;
$newblock->{uname}->{value} = "root";
$newblock->{gname}->{value} = "root";
if ($block->{typeflag}->{value} == 0 ||
$block->{typeflag}->{value} == 5) {
$newblock->{typeflag}->{value} = "L";
} else {
die "Fatal error: unknown format found with long filenames\n";
}
$prefix = hash2string($newblock) .
pack("a$blocksize", $block->{name}->{value});
substr ($block->{name}->{value}, $block->{name}->{length}-1) = "";
$total_size += 2 * $blocksize;
}
$input = ' ' x $blocksize;
my @entries = ();
foreach (@{$block->{keys}}) {
if ($_ eq "chksum") {
push (@entries, "");
} else {
if ($block->{$_}->{octal} && $block->{$_}->{value} ne "") {
push (@entries, sprintf("$block->{$_}->{octal}",
$block->{$_}->{value}));
} else {
push (@entries, $block->{$_}->{value});
}
}
}
# calculate the checksum for this new string
$input = pack($block->{pack}, @entries);
$block->{chksum}->{value} = unpack("%16C*", $input);
# and store this checksum and the string
substr ($input, $block->{chksum}->{offset}, $block->{chksum}->{length})
= sprintf("$block->{chksum}->{octal}", $block->{chksum}->{value});
$block->{input} = $input;
return $prefix.$input;
}
# print informations stored in the header
sub print_header {
my ($block) = shift;
foreach (@{$block->{keys}}) {
msgdebug("$_: ".pack($block->{$_}->{pack},
$block->{$_}->{value}));
}
}
# read contents of the file. In the archive, a file fills
# entire blocks
sub read_contents {
my ($header) = shift;
my ($contents, $size);
$size = $header->{size}->{value};
if ($size % $blocksize != 0) {
$size += $blocksize - ($size % $blocksize);
}
read($filein, $contents, $size) == $size
|| die "error occurred when reading";
return $contents;
}
# adds directory to the archive when a filename is inserted whereas
# its parent directory is not.
sub add_directory {
my $header = shift;
my ($output, $filename, $dir, $header_dir);
# copy the header to a new block
$header_dir = copy_block($header);
# change the type flag to tell this is a directory
$header_dir->{typeflag}->{value} = 5;
$header_dir->{size}->{value} = 0;
$header_dir->{mode}->{value} = $dir_mode;
$dir = $output = '';
$filename = $header->{realname};
$filename =~ s|/+[^/]*$||;
foreach (split('/', $filename)) {
$dir .= $_ . '/';
$header_dir->{name}->{value} = $dir;
if (not exists($list_dirs{$dir}) and
$dir ne $filename."/") {
$output .= hash2string($header_dir);
msgdebug("Adding directory $dir");
}
$list_dirs{$dir} = 1;
}
return $output;
}
sub msgdebug {
my ($string) = @_;
print STDERR "tarcust:debug: " . $string . "\n" if $opt_d == 1;
}
# Apply changes requested by command-line arguments
# Options are kept in a hash table.
sub change_values {
my ($values, $block) = @_;
return if $block->{name}->{value} !~ m{^$opt_a$};
if ($values->{uname} ne '') {
$block->{uname}->{value} = $values->{uname};
$block->{uid}->{value} = 0;
}
$block->{uid}->{value} = $values->{uid}
if $values->{uid} ne '';
if ($values->{gname} ne '') {
$block->{gname}->{value} = $values->{gname};
$block->{gid}->{value} = 0;
}
$block->{gid}->{value} = $values->{gid}
if $values->{gid} ne '';
}
sub msgverbose {
my ($string) = @_;
print STDERR "tarcust:verbose: " . $string . "\n" if $opt_v == 1;
}
sub showversion {
print STDERR <<EOT;
tarcust @version@
Copyright (C) 1999-2001 Denis Barbier <barbier\@engelschall.com>
This program is a free software released under the GNU GPL License; see
the source for copying conditions. There is NO warranty; not even for
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
EOT
exit(0);
}
sub usage {
print STDERR <<EOT;
Usage: tarcust [options] < input.tar > output.tar
Options:
-h, --help show this help and exit
-V, --version show the version of tarcust and exit
-v, --verbose processes more verbosely
-D, --force-directory always create directory entry (deprecated,
this option is now always On)
-u, --user-name=NAME change owner of all files in the archive
--owner=NAME same as -u
-U, --user-number=NUMBER numerical id of user
-g, --group-name=NAME set the group name of all files in the archive
--group=NAME same -as -g
-G, --group-number=NUMBER numerical id of group
-p, --prefix=STRING[:MODE] add a prefix directory name to every file
-s, --transform-names=EXPR apply substitutions on file and directory names
-x, --exclude=EXPR delete file from the archive
-a, --applies-to=EXPR apply -u, -U, -g and -G to specific files only
-e, --eval=EXPR perform any Perl action on attributes
-B, --begin-eval=EXPR Perl expression evalled before reading input
-E, --end-eval=EXPR Perl expression evalled after writing output
EOT
exit(1);
}
# parse arguments
$opt_h = $opt_v = $opt_d = $opt_V = 0;
$opt_g = $opt_G = $opt_u = $opt_U = $opt_p = $opt_e = $opt_B = $opt_E = '';
@opt_x = @opt_s = ();
$opt_a = '.*';
$opt_D = 1;
$longnames = {
user_name => 'u', owner => 'u',
group_name => 'g', group => 'g',
user_number => 'U', group_number => 'G',
exclude => 'x', transform_names => 's',
prefix => 'p', force_directory => 'D',
applies_to => 'a', eval => 'e',
before_eval => 'B', after_eval => 'E',
};
# I do not use getopt.pl or Getopt::Long because i want option
# names to be compatible with tar and tardy. For instance,
# -u <=> --owner <=> --user-name <=> --user_name
#
for (;;) {
my ($option, $original_option, $argument);
last if $#ARGV < 0;
last if $ARGV[0] eq '-' or $ARGV[0] !~ m/^-/;
if ($ARGV[0] eq '-h' || $ARGV[0] eq '-?' || $ARGV[0] eq '--help') {
$opt_h = 1;
last;
}
elsif ($ARGV[0] eq '-v' || $ARGV[0] =~ m/^--verb/ ) {
$opt_v = 1;
}
elsif ($ARGV[0] eq '-d' || $ARGV[0] =~ m/--deb/ ) {
$opt_d = 1;
}
elsif ($ARGV[0] eq '-V' || $ARGV[0] =~ m/^--vers/ ) {
$opt_V = 1;
last;
}
elsif ($ARGV[0] eq '-D' || $ARGV[0] =~ m/^--force-dir/ ) {
$opt_D = 1;
}
elsif ($ARGV[0] =~ m/^-([gGuUxspaeBE])$/ ) {
$option = $1;
if ($#ARGV < 1) {
print STDERR "Error:tarcust: missing value to option: $ARGV[0]\n";
$opt_h = 1;
last;
}
shift(@ARGV);
if ($option =~ m/^[xs]$/) {
eval "push(\@opt_$option, \$ARGV[0])";
}
else {
eval "\$opt_$option = \$ARGV[0]";
}
}
elsif ($ARGV[0] =~ m/^--([^=]*)=(.*)$/) {
$option = $original_option = $1;
$argument = $2;
$option =~ s/[^a-zA-Z0-9]/_/g;
if (not defined($longnames->{$option})) {
print STDERR "Error:tarcust: unknown option: $original_option\n";
$opt_h = 1;
last;
}
if ($option =~ m/^exclude|transform_names$/) {
eval "push(\@opt_$longnames->{$option}, \$argument)";
}
else {
eval "\$opt_$longnames->{$option} = \$argument";
}
}
else {
print STDERR "Error:tarcust: unknown option: $ARGV[0]\n";
$opt_h = 1;
last;
}
shift(@ARGV);
}
usage() if $opt_h == 1;
showversion() if $opt_V == 1;
$filein = *STDIN{IO};
$fileout = *STDOUT{IO};
if ($#ARGV == 0) {
if ($ARGV[0] ne '-') {
open($filein, "< $ARGV[0]");
}
}
elsif ($#ARGV == 1) {
if ($ARGV[0] ne '-') {
open($filein, "< $ARGV[0]");
}
if ($ARGV[1] ne '-') {
open($fileout, "> $ARGV[1]");
}
}
elsif ($#ARGV > 0) {
usage();
}
# keep these values in a hash table
$new_values = {
uname => $opt_u,
uid => $opt_U,
gname => $opt_g,
gid => $opt_G,
};
# --prefix is a synonym for --transform-names=s,^,PATH/,
# A directory will be inserted at the beginning of the archive,
# its mode is $dir_mode
$dir_mode = 040755;
if ($opt_p =~ m|^(.*):([0-9]+)$|) {
$opt_p = $1;
$dir_mode = $2;
$dir_mode += 40000 if $dir_mode =~ m|^\d{1,4}$|;
$dir_mode = oct($dir_mode);
}
push(@opt_s, "s,^,$opt_p/,") if $opt_p ne '';
# Trap syntax error in -s flag
my $eval_opt_s = '1';
foreach (@opt_s) {
$eval_opt_s .= "; $_";
}
my $sub_opt_s = eval "sub { my \$ref = shift; \$_ = \$\$ref; $eval_opt_s; die \$@ if \$@; \$\$ref = \$_; return 0}";
die "Invalid -s option" if $@;
# Build a subroutine to remove files specified with -x flag
my $eval_opt_x = '';
foreach (@opt_x) {
$eval_opt_x .= "return 1 if m|^$_\$|;";
}
my $sub_opt_x = eval "sub {\$_ = shift; $eval_opt_x; return 0}";
die "Invalid -x option" if $@;
# main loop
$total_size = 0;
if ($opt_B) {
eval "{$opt_B}; 1" or die "Expression \`".$opt_B."' invalid in -B flag:\n$@Exiting.\n";
}
while ( read($filein, $header_line, $blocksize) == $blocksize) {
if ($opt_p ne '') {
# A prefix is required, so a directory must be added in
# the archive.
# Do not forget to apply changes
$header = init_block($header_line);
change_values($new_values, $header);
my $dirs = '';
foreach (split('/', $opt_p)) {
$dirs .= $_ . "/";
if (not exists($list_dirs{$dirs})) {
# typeflag=5 for a directory
$header->{typeflag}->{value} = 5;
$header->{name}->{value} = $dirs;
$header->{size}->{value} = 0;
$header->{mode}->{value} = $dir_mode;
print_header($header);
print $fileout hash2string($header);
$list_dirs{$header->{realname}} = 1;
$total_size += $blocksize;
msgverbose(" Write: $header->{'name'}->{'value'}");
}
}
# clear $opt_p so that this test is wrong next time
$opt_p = '';
}
last if $header_line eq "\0" x $blocksize;
$header = init_block($header_line);
last if $header->{name}->{value} eq '';
if ($header->{typeflag}->{value} !~ m/^[0-6]?$/) {
# this file is not a regular file
$contents = read_contents($header);
print STDERR "Warning:tarcust: Don't know how to handle GNU extensions\n";
print STDERR " $header->{'name'}->{'value'} unchanged\n";
print $fileout hash2string($header);
print $fileout $contents;
next;
}
msgverbose("File " . $header->{name}->{value});
print_header($header);
change_values($new_values, $header);
$contents = read_contents($header);
# check if this file has been excluded by the -x flag
if (&$sub_opt_x($header->{'name'}->{'value'})) {
msgverbose(" Skipped");
next;
}
&$sub_opt_s(\$header->{'name'}->{'value'});
if ($opt_e) {
my %F = ();
foreach (keys %$structure_header) {
$F{$_} = $header->{$_}->{value};
}
eval "{$opt_e}; 1" or die "Expression \`".$opt_e."' invalid in -e flag:\n$@Exiting.\n";
foreach (keys %$structure_header) {
$header->{$_}->{value} = $F{$_};
}
}
print_header($header);
$header->{input} = hash2string($header);
$header_line = add_directory($header);
print $fileout $header_line if $header_line ne '';
print $fileout $header->{input};
print $fileout $contents;
$list_dirs{$header->{realname}} = 1
if $header->{typeflag}->{value} == 5;
$total_size += $blocksize + length($contents);
msgverbose(" Write: $header->{'name'}->{'value'}");
}
# Flushes input to avoid the ``broken pipe'' message
undef $/;
$_ = <$filein>;
# And writes null chars.
print $fileout "\0" x ($blocksize);
$total_size += $blocksize;
# In GNU tar, the total size is a multiple of 20 512-bytes blocks
if ($total_size % (20*$blocksize) != 0 ) {
print $fileout "\0" x (20*$blocksize - ($total_size % (20*$blocksize)));
}
if ($opt_E) {
eval "{$opt_E}; 1" or die "Expression \`".$opt_E."' invalid in -E flag:\n$@Exiting.\n";
}
exit(0);
##EOF##
__END__
=head1 NAME
tarcust -- a tar customizer
=head1 SYNOPSIS
B<tarcust>
[B<-h>, B<--help>]
[B<-V>, B<--version>]
[B<-v>, B<--verbose>]
[B<-D>, B<--force-directory>]
[B<-u>, B<--owner=NAME>, B<--user-name=NAME>]
[B<-U>, B<--user-number=NUMBER>]
[B<-g>, B<--group=NAME>, B<--group-name=NAME>]
[B<-G>, B<--group-number=NUMBER>]
[B<-p>, B<--prefix=DIR[:MODE]>]
[B<-s>, B<--transform-name=EXPR>]
[B<-x>, B<--exclude=EXPR>]
[B<-a>, B<--applies-to=EXPR>]
[B<-e>, B<--eval=EXPR>]
[B<-B>, B<--begin-eval=EXPR>]
[B<-E>, B<--end-eval=EXPR>]
[I<input.tar> [I<output.tar>]]
=head1 DESCRIPTION
The B<tarcust> program has been written to help developers building
distributions of their programs. The great B<automake> GNU tool provides
an easy way, just type `C<make dist>' and your package is ready to
distribute. But if for some reason you do not use this tool, it may be
a pain.
The B<tarcust> script has been designed to provide a portable way to
implement nice features missing in standard UNIX B<tar>, like deletion
of files, manipulation of owner/group of files, changes in names.
GNU B<tar> has such features, so B<tarcust> may seem useless in this
case, but remember it has been designed to work with any UNIX system.
=head1 OPTIONS
=over 4
=item B<-h>, B<-?>, B<--help>
Prints a message help and exit
=item B<-V>, B<--version>
Prints the version number and exit
=item B<-v>, B<--verbose>
Process verbosely
=item B<-D>, B<--force-directory>
With this option, before inserting a file into the archive, B<tarcust>
checks whether the directory in which this file reside exists in the
archive and create it if it does not. Uid and gid are those of the first
file found in this directory. Note that since 0.8.2, this option is
deprecated because it is always set.
=item B<-u>, B<--owner=NAME>, B<--user-name=NAME>
Sets the name of the owner of all files in the archive
=item B<-U>, B<--user-number=NUMBER>
Sets the numerical ID of the owner of all files
=item B<-g>, B<--group=NAME>, B<--group-name=NAME>
Sets the name of the group
=item B<-G>, B<--group-number=NUMBER>
Sets the numerical ID of the group
=item B<-p>, B<--prefix=DIR[:MODE]>
Inserts a prefix directory name so that all files are extracted under
the I<DIR> directory ; by default, this new directory has mode
permissions set to 755, this can be changed with the optional B<MODE>
attribute (only numeric modes are supported)
=item B<-s>, B<--transform-names=EXPR>
Transform file and directory names ; I<EXPR> is a Perl command to
manipulate strings, see examples below
=item B<-x>, B<--exclude=EXPR>
Exclude files matching this Perl regular expression.
=item B<-a>, B<--applies-to=EXPR>
Apply B<-u>, B<-U>, B<-g> and B<-g> flags only to the files and
directories matching this pattern.
=item B<-e>, B<--eval=EXPR>
Apply any Perl statement on file and directory attributes. Any piece of
archive contains the following attributes (see the info GNU tar
documentation for a detailed description)
name mode uid gid
size mtime chksum typeflag
linkname magic version uname
gname devmajor devminor prefix
Each attribute is accessible via the C<%F> hash array, e.g. C<$F{name}>.
=item B<-B>, B<--begin-eval=EXPR>
The argument of this flag is Perl statements which are evalled before
archive is read.
=item B<-E>, B<--end-eval=EXPR>
The argument of this flag is Perl statements which are evalled after
archive is rwritten.
=back
=head1 ARGUMENTS
You may specify input and output filenames on command-line. By default
datas are read on standard input and sent to standard output. As with
most Unix commands, an hyphen is equivalent to standard input or
standard output.
=head1 EXAMPLES
Prepends a prefix to each file name, the new directory has mode
permissions set to 775
cat MANIFEST | xargs tar cf - \
| tarcust --prefix=myprog-0.3:775 \
| gzip -c > myprog-0.3.tar.gz
This rule in your F<Makefile> will build a distribution without the need
of a F<MANIFEST> file, ignoring all CVS files
dist: distclean
find . -type f -a ! -name myprog-0.3.tar.gz \
| xargs tar cf - \
| tarcust -x '.*CVS.*' -x '.*\.cvsignore' \
| gzip -9 -c > myprog-0.3.tar.gz
Or with GNU tar
dist: distclean
find * -depth -print | sort \
| xargs tar --no-recursion -cf - |\
| tarcust -x '.*CVS.*' -x '.*\.cvsignore' \
| gzip -9 -c > myprog-0.3.tar.gz
To satisfy your egocentric feelings, try this one
dist:
cat MANIFEST | xargs tar cf - \
| tarcust -u Denis -g Barbier | gzip -9 -c > myprog-0.3.tar.gz
Change every filename in uppercase letters and suppress any suffix
dist:
cat MANIFEST | xargs tar cf - \
| tarcust -s tr[a-z][A-Z] -s 's,\.[^/]*$,,' \
| gzip -9 -c > myprog-0.3.tar.gz
Make all files with a C<.pl> suffix executable:
cat MANIFEST | xargs tar cf - \
| tarcust -e '$F{name}=~/\.pl$/ && ($F{mode} |= oct 111)' \
| gzip -9 -c > myprog-0.3.tar.gz
Print how many directories have been written into archive
cat MANIFEST | xargs tar cf - \
| tarcust -B '$cnt=0' -e '$cnt++ if $F{typeflag} == 5' \
-E 'printf STDERR "# directories: %d\n", $cnt' \
| gzip -9 -c > myprog-0.3.tar.gz
=head1 NOTES
Unlike B<tar> and B<tardy>, B<tarcust> does not perform checks on user
and group informations. And it never will, unless a kind guy explain me
the benefits of taking care of those informations when distributing a
tarball worldwide.
String manipulations may look strange at the first glance. To understand
how it works, remember that:
=over 2
=item *
the B<tarcust> program is a simple Perl script, so it is natural to use
Perl instead of shell regular expressions;
=item *
it has been designed for developers, and not end-users, so this syntax
should be quite familiar.
=back
There is one point i have to mention : when excluding files, the regular
expression must exactly match file names, in other words the regexp you
type is surrounded by C<^> and C<$>.
=head1 LIMITATIONS
GNU implementation does not follow POSIX specifications about file
names longer than 100 characters. Current C<tarcust> version do
know how to deal with filenames E<lt> 512 chars with GNU implementation.
Further releases should support both GNU and POSIX standards.
For portability reasons, directories are put in the archive to make sure
they exist when files are extracted. So when you apply a transformation
on file names which move them to another directory, you have to ensure
that directories are updated too. The B<--prefix> option will
automatically insert directories, but not B<--transform-names>.
=head1 HISTORY
The B<tarcust> program is strongly inspired by B<tardy>, a tar
post-processor written in C by Peter Miller.
I realized that a tarfile is just a formatted text, so Perl should
be a nice tool to manipulate it. On CPAN, i found the Archive::Tar
module by Calle Dybedahl, Gurusamy Sarathy and Stephen Zander.
But this module is more a Perl replacement for B<tar> than a
post-processor. So i wrote my own Perl script to follow these goals:
=over 2
=item o
Keep it simple and portable.
=item o
Do not try to emulate unnecessary features already provided by B<tar>
or other standard UNIX tools (e.g. there is no plan to handle compressed
tarfiles or to add files to an archive).
=item o
Concentrate on the building of software packages.
=back
When writing B<tarcust>, i tried to be compatible with existing
softwares, it's why option names are taken from B<tardy> or B<tar>.
=head1 SEE ALSO
tar(1)
tardy(1)
Archive::Tar(3)
=head1 AUTHOR
Denis Barbier
barbier@engelschall.com
http://www.engelschall.com/sw/tarcust/
=head1 CREDITS
John Midgley C<E<lt>jmidgley@labyrinth.net.auE<gt>> suggested the
B<-s> and B<-e> flags which make B<tarcust> much more powerful.
=cut
|