File: mbfw1.awk

package info (click to toggle)
gawk 1%3A4.0.1%2Bdfsg-2.1
  • links: PTS
  • area: main
  • in suites: wheezy
  • size: 10,944 kB
  • sloc: ansic: 38,137; awk: 6,654; yacc: 6,066; sh: 4,807; makefile: 1,802; sed: 112
file content (40 lines) | stat: -rw-r--r-- 1,062 bytes parent folder | download | duplicates (8)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
# Date: Sun, 16 Mar 2008 18:51:14 +0100
# From: Hermann Peifer <peifer@gmx.eu>
# Subject: [Fwd: Gawk FIELDWIDTHS and multibyte characters]
# To: bug-gawk@gnu.org
# Message-id: <47DD5E12.2010403@gmx.eu>
# 
# See below. Regards, Hermann
# 
# --- Original Message ---
# 
# Newsgroups: comp.lang.awk
# From: Hermann Peifer <peifer@gmx.eu>
# Date: Sun, 16 Mar 2008 01:23:38 -0700 (PDT)
# Subject: Gawk FIELDWIDTHS and multibyte characters
# 
# Hi,
# 
# It looks to me that Gawk's FIELDWIDTHS extension is not aware of
# multibyte characters, see my example below.
# 
# $ cat testdata
# CDRegion              Commune             Site
# SEVästsverige         Hallands län        Kungsbacka
# SESmåland med öarna   Västra Götalands länGöteborg
# SEKronoberg           Alvesta             Stenungsund
# 
# $ file testdata
# testdata: UTF-8 Unicode text
# 
# $ awk 'BEGIN{FIELDWIDTHS = "2 20 20 20"}{print $4}' testdata
# Site
#    Kungsbacka
#   länGöteborg
# Stenungsund
# 
# Can someone confirm?
# 
# Hermann
BEGIN { FIELDWIDTHS = "2 20 20 20" }
{ print $4 }