Commits · 65e61d5770c7cbabe86e8159eb922e207942dde0 · go / golang

02 Jun, 2012 6 commits

runtime: add (unused for now) gc field to type information · 65e61d57
Jan Ziak authored Jun 02, 2012
```
R=rsc
CC=golang-dev
https://golang.org/cl/6255074
```
65e61d57

math: amd64 versions of Ceil, Floor and Trunc · 322057cb

Charles L. Dorian authored Jun 02, 2012

Ceil  to 4.81 from 20.6 ns/op
Floor to 4.37 from 13.5 ns/op
Trunc to 3.97 from 14.3 ns/op
Also changed three MOVSDs to MOVAPDs in log_amd64.s

R=rsc, golang-dev
CC=golang-dev
https://golang.org/cl/6262048

322057cb

path/filepath: implement documented SkipDir behavior · 2b57a876

Jan Mercl authored Jun 02, 2012

Currently walk() doesn't check for err == SkipDir when iterating
a directory list, but such promise is made in the docs for WalkFunc.

Fixes #3486.

R=rsc, r
CC=golang-dev
https://golang.org/cl/6257059

2b57a876

cmd/5c, cmd/5g, cmd/5l: enable use of R12, F8-F15 · d87bc2f0
Shenghou Ma authored Jun 02, 2012
```
R=dave, rsc
CC=golang-dev
https://golang.org/cl/6248070
```
d87bc2f0

api: add FreeBSD to go1 API · b7c2ade6

Brad Fitzpatrick authored Jun 02, 2012

Now that gri has made go/parser 15% faster, I offer this
change to slow back down cmd/api ~proportionately, adding
FreeBSD to the go1-checked set of platforms.

Really we should have done this earlier. This will prevent us
from breaking FreeBSD compatibility accidentally in the
future.

R=golang-dev, r
CC=golang-dev
https://golang.org/cl/6279044

b7c2ade6

text/template/parse: restore the goroutine · 0e45890c

Rob Pike authored Jun 02, 2012

To avoid goroutines during init, the nextItem function was a
clever workaround. Now that init goroutines are permitted,
restore the original, simpler design.

R=golang-dev, bradfitz
CC=golang-dev
https://golang.org/cl/6282043

0e45890c

01 Jun, 2012 5 commits

go/parser: ~15% faster parsing · a04d4f02

Robert Griesemer authored Jun 01, 2012

- only compute current line position if needed
  (i.e., if a comment is present)

- added benchmark

benchmark         old ns/op    new ns/op    delta
BenchmarkParse     10902990      9313330  -14.58%

benchmark          old MB/s     new MB/s  speedup
BenchmarkParse         5.31         6.22    1.17x

R=golang-dev, r
CC=golang-dev
https://golang.org/cl/6270043

a04d4f02

misc/emacs: stop go-mode from spuriously marking the buffer modified when it loads · c9e698bd
Ryan Barrett authored Jun 01, 2012
```
R=golang-dev, sameer, bradfitz
CC=golang-dev, jba
https://golang.org/cl/6213056
```
c9e698bd

cmd/6l: loop alignment, disabled · c48ce693

Russ Cox authored Jun 01, 2012

Saving the code in case we improve things enough that
it matters later, but at least right now it is not worth doing.

R=ken2
CC=golang-dev
https://golang.org/cl/6248071

c48ce693

cmd/5g, cmd/6g, cmd/8g: delete clearstk · 96b05948

Russ Cox authored Jun 01, 2012

Dreg from https://golang.org/cl/4629042

R=ken2
CC=golang-dev
https://golang.org/cl/6259057

96b05948

misc/dashboard/codereview: handle abandoned CLs. · 935d8d16
David Symonds authored Jun 01, 2012
```
R=golang-dev, r
CC=golang-dev
https://golang.org/cl/6257082
```
935d8d16

31 May, 2012 8 commits

exp/html/atom: faster, hash-based lookup. · d2a6098e

Nigel Tao authored May 31, 2012

exp/html/atom benchmark:
benchmark          old ns/op    new ns/op    delta
BenchmarkLookup       199226        80770  -59.46%

exp/html benchmark:
benchmark                      old ns/op    new ns/op    delta
BenchmarkParser                  4864890      4510834   -7.28%
BenchmarkHighLevelTokenizer      2209192      1969684  -10.84%
benchmark                       old MB/s     new MB/s  speedup
BenchmarkParser                    16.07        17.33    1.08x
BenchmarkHighLevelTokenizer        35.38        39.68    1.12x

R=r
CC=golang-dev
https://golang.org/cl/6261054

d2a6098e

runtime: lower memory overhead of heap profiling. · baf91c31

Rémy Oudompheng authored May 31, 2012

The previous code was preparing arrays of entries that would be
filled if there was one entry every 128 bytes. Moving to a 4096
byte interval reduces the overhead per megabyte of address space
to 2kB from 64kB (on 64-bit systems).
The performance impact will be negative for very small MemProfileRate.

test/bench/garbage/tree2 -heapsize 800000000 (default memprofilerate)
Before: mprof 65993056 bytes (1664 bucketmem + 65991392 addrmem)
After:  mprof  1989984 bytes (1680 bucketmem +  1988304 addrmem)

R=golang-dev, rsc
CC=golang-dev, remy
https://golang.org/cl/6257069

baf91c31

CONTRIBUTORS: Add Ryan Barrett (Google CLA) · 29e32d73
Sameer Ajmani authored May 31, 2012
```
R=golang-dev, r
CC=golang-dev
https://golang.org/cl/6244071
```
29e32d73

runtime/pprof, misc/pprof: correct profile of total allocations. · c4a814f2

Rémy Oudompheng authored May 31, 2012

The previous heap profile format did not include buckets with
zero used bytes. Also add several missing MemStats fields in
debug mode.

R=golang-dev, rsc
CC=golang-dev, remy
https://golang.org/cl/6249068

c4a814f2

exp/html/atom: new package. · bb4a817a

Nigel Tao authored May 31, 2012

50% fewer mallocs in HTML tokenization, resulting in 25% fewer mallocs
in parsing go1.html.

Making the parser use integer comparisons instead of string comparisons
will be a follow-up CL, to be co-ordinated with Andy Balholm's work.

exp/html benchmarks before/after:

BenchmarkParser	     500	   4754294 ns/op	  16.44 MB/s
        parse_test.go:409: 500 iterations, 14651 mallocs per iteration
BenchmarkRawLevelTokenizer	    2000	    903481 ns/op	  86.51 MB/s
        token_test.go:678: 2000 iterations, 28 mallocs per iteration
BenchmarkLowLevelTokenizer	    2000	   1260485 ns/op	  62.01 MB/s
        token_test.go:678: 2000 iterations, 41 mallocs per iteration
BenchmarkHighLevelTokenizer	    1000	   2165964 ns/op	  36.09 MB/s
        token_test.go:678: 1000 iterations, 6616 mallocs per iteration

BenchmarkParser	     500	   4664912 ns/op	  16.76 MB/s
        parse_test.go:409: 500 iterations, 11266 mallocs per iteration
BenchmarkRawLevelTokenizer	    2000	    903065 ns/op	  86.55 MB/s
        token_test.go:678: 2000 iterations, 28 mallocs per iteration
BenchmarkLowLevelTokenizer	    2000	   1260032 ns/op	  62.03 MB/s
        token_test.go:678: 2000 iterations, 41 mallocs per iteration
BenchmarkHighLevelTokenizer	    1000	   2143356 ns/op	  36.47 MB/s
        token_test.go:678: 1000 iterations, 3231 mallocs per iteration

R=r, rsc, rogpeppe
CC=andybalholm, golang-dev
https://golang.org/cl/6255062

bb4a817a

regexp: fix a couple of bugs in the documentation · 43cf5505

Rob Pike authored May 31, 2012

Byte slices are not strings.

Fixes #3687.

R=golang-dev, dsymonds
CC=golang-dev
https://golang.org/cl/6257074

43cf5505

misc/dashboard/app: add debug logging to notifyOnFailure; remove unused Result.OK function · 735ec945
Andrew Gerrand authored May 31, 2012
```
R=golang-dev, dsymonds
CC=golang-dev
https://golang.org/cl/6258064
```
735ec945
misc/dashboard/app: fix tests · 023a7e88
Andrew Gerrand authored May 31, 2012
```
R=golang-dev, dsymonds
CC=golang-dev
https://golang.org/cl/6244069
```
023a7e88

30 May, 2012 21 commits

cmd/go: add -ccflags · 5b2cd445

Dave Cheney authored May 30, 2012

Add -ccflags to pass arguments to {5,6,8}c
similar to -gcflags for {5,6,8}g.

R=golang-dev, rsc
CC=golang-dev
https://golang.org/cl/6260047

5b2cd445

cmd/gc: contiguous loop layout · 001b75c9

Russ Cox authored May 30, 2012

Drop expecttaken function in favor of extra argument
to gbranch and bgen. Mark loop condition as likely to
be true, so that loops are generated inline.

The main benefit here is contiguous code when trying
to read the generated assembly. It has only minor effects
on the timing, and they mostly cancel the minor effects
that aligning function entry points had.  One exception:
both changes made Fannkuch faster.

Compared to before CL 6244066 (before aligned functions)
benchmark                 old ns/op    new ns/op    delta
BenchmarkBinaryTree17    4222117400   4201958800   -0.48%
BenchmarkFannkuch11      3462631800   3215908600   -7.13%
BenchmarkGobDecode         20887622     20899164   +0.06%
BenchmarkGobEncode          9548772      9439083   -1.15%
BenchmarkGzip                151687       152060   +0.25%
BenchmarkGunzip                8742         8711   -0.35%
BenchmarkJSONEncode        62730560     62686700   -0.07%
BenchmarkJSONDecode       252569180    252368960   -0.08%
BenchmarkMandelbrot200      5267599      5252531   -0.29%
BenchmarkRevcomp25M       980813500    985248400   +0.45%
BenchmarkTemplate         361259100    357414680   -1.06%

Compared to tip (aligned functions):
benchmark                 old ns/op    new ns/op    delta
BenchmarkBinaryTree17    4140739800   4201958800   +1.48%
BenchmarkFannkuch11      3259914400   3215908600   -1.35%
BenchmarkGobDecode         20620222     20899164   +1.35%
BenchmarkGobEncode          9384886      9439083   +0.58%
BenchmarkGzip                150333       152060   +1.15%
BenchmarkGunzip                8741         8711   -0.34%
BenchmarkJSONEncode        65210990     62686700   -3.87%
BenchmarkJSONDecode       249394860    252368960   +1.19%
BenchmarkMandelbrot200      5273394      5252531   -0.40%
BenchmarkRevcomp25M       996013800    985248400   -1.08%
BenchmarkTemplate         360620840    357414680   -0.89%

R=ken2
CC=golang-dev
https://golang.org/cl/6245069

001b75c9

net: fix test to avoid unintentional nil pointer dereference · aad8e954
Mikio Hara authored May 30, 2012
```
R=golang-dev, dave, rsc
CC=golang-dev
https://golang.org/cl/6248065
```
aad8e954

cmd/5l: fix PLD · 6a5660f1

Russ Cox authored May 30, 2012

Was missing break.

R=ken2
CC=golang-dev
https://golang.org/cl/6250078

6a5660f1

cmd/6l, cmd/8l, cmd/5l: add AUNDEF instruction · f2bd3a97

Russ Cox authored May 30, 2012

On 6l and 8l, this is a real instruction, guaranteed to
cause an 'undefined instruction' exception.

On 5l, we simulate it as BL to address 0.

The plan is to use it as a signal to the linker that this
point in the instruction stream cannot be reached
(hence the changes to nofollow).  This will help the
compiler explain that panicindex and friends do not
return without having to put a list of these functions
in the linker.

R=ken2
CC=golang-dev
https://golang.org/cl/6255064

f2bd3a97

cmd/ld: align function entry on arch-specific boundary · 8820ab5d

Russ Cox authored May 30, 2012

16 seems pretty standard on x86 for function entry.
I don't know if ARM would benefit, so I used just 4
(single instruction alignment).

This has a minor absolute effect on the current timings.
The main hope is that it will make them more consistent from
run to run.

benchmark                 old ns/op    new ns/op    delta
BenchmarkBinaryTree17    4222117400   4140739800   -1.93%
BenchmarkFannkuch11      3462631800   3259914400   -5.85%
BenchmarkGobDecode         20887622     20620222   -1.28%
BenchmarkGobEncode          9548772      9384886   -1.72%
BenchmarkGzip                151687       150333   -0.89%
BenchmarkGunzip                8742         8741   -0.01%
BenchmarkJSONEncode        62730560     65210990   +3.95%
BenchmarkJSONDecode       252569180    249394860   -1.26%
BenchmarkMandelbrot200      5267599      5273394   +0.11%
BenchmarkRevcomp25M       980813500    996013800   +1.55%
BenchmarkTemplate         361259100    360620840   -0.18%

R=ken2
CC=golang-dev
https://golang.org/cl/6244066

8820ab5d

cmd/6l, cmd/8l: fix chaining bug in jump rewrite · b91cf505

Russ Cox authored May 30, 2012

The code was inconsistent about when it used
brchain(x) and when it used x directly, with the result
that you could end up emitting code for brchain(x) but
leave the jump pointing at an unemitted x.

R=ken2
CC=golang-dev
https://golang.org/cl/6250077

b91cf505

compress/flate: fix overflow on 2GB input. Reset hashOffset every 16 MB. · 37f046ba

Ivan Krasin authored May 30, 2012

This bug has been introduced in the following revision:

changeset:   11404:26dceba5c610
user:        Ivan Krasin <krasin@golang.org>
date:        Mon Jan 23 09:19:39 2012 -0500
summary:     compress/flate: reduce memory pressure at cost of additional arithmetic operation.

This is the review page for that CL: https://golang.org/cl/5555070/

R=rsc, imkrasin
CC=golang-dev
https://golang.org/cl/6249067

37f046ba

go-mode: Works for both GNU-Emacs and XEmacs-21.5 · b8a02560

Mats Lidell authored May 30, 2012

Fixes some portability issues between the Emacsen.

R=golang-dev, sameer, bradfitz, ryanb
CC=golang-dev
https://golang.org/cl/6206043

b8a02560

test/bench/shootout: more speedups · 6f3ffd4d

Rob Pike authored May 30, 2012

Most significant in mandelbrot, from avoiding MOVSD between registers,
but there are others.

R=golang-dev, rsc
CC=golang-dev
https://golang.org/cl/6258063

6f3ffd4d

cmd/6g: avoid MOVSD between registers · a768de83

Russ Cox authored May 30, 2012

MOVSD only copies the low half of the packed register pair,
while MOVAPD copies both halves.  I assume the internal
register renaming works better with the latter, since it makes
our code run 25% faster.

Before:
mandelbrot 16000
        gcc -O2 mandelbrot.c	28.44u 0.00s 28.45r
        gc mandelbrot	44.12u 0.00s 44.13r
        gc_B mandelbrot	44.17u 0.01s 44.19r

After:
mandelbrot 16000
        gcc -O2 mandelbrot.c	28.22u 0.00s 28.23r
        gc mandelbrot	32.81u 0.00s 32.82r
        gc_B mandelbrot	32.82u 0.00s 32.83r

R=ken2
CC=golang-dev
https://golang.org/cl/6248068

a768de83

shootout: make mandelbrot.go more like mandelbrot.c · eb056dbe

Russ Cox authored May 30, 2012

Surprise! The C code is using floating point values for its counters.
Its off the critical path, but the Go code and C code are supposed to
be as similar as possible to make comparisons meaningful.

It doesn't have a significant effect.

R=golang-dev, r
CC=golang-dev
https://golang.org/cl/6260058

eb056dbe

A+C: add Mats Lidell. He signed the agreement with the Sweden email · 3806cc7b
Sameer Ajmani authored May 30, 2012
```
address, but his changelist is under the Gmail address.

R=golang-dev, rsc
CC=golang-dev
https://golang.org/cl/6248069
```
3806cc7b

misc/emacs: Use patch output of gofmt instead of replacing the buffer. · 7b6111a9

Jean-Marc Eurin authored May 30, 2012

This uses the patch output of gofmt (-d option) and applies each
chunk to the buffer, instead of replacing the whole buffer.  The
main advantage is that the undo history is kept across gofmt'ings,
so it can really be used as a before-save-hook.

R=sameer, sameer
CC=golang-dev
https://golang.org/cl/6198047

7b6111a9

test/bench/shootout/timing.log: mandelbrot is restored · ec4d2135
Rob Pike authored May 30, 2012
```
R=golang-dev, bradfitz, rsc
CC=golang-dev
https://golang.org/cl/6259054
```
ec4d2135

runtime: always initialise procid on netbsd · deb93b0f

Joel Sing authored May 30, 2012

The correct procid is needed for unparking LWPs on NetBSD - always
initialise procid in minit() so that cgo works correctly. The non-cgo
case already works correctly since procid is initialised via
lwp_create().

R=golang-dev, rsc
CC=golang-dev
https://golang.org/cl/6257071

deb93b0f

runtime: update field types in preparation for GC changes · 334bf95f
Jan Ziak authored May 30, 2012
```
R=rsc, remyoudompheng, minux.ma, ality
CC=golang-dev
https://golang.org/cl/6242061
```
334bf95f

cmd/ld: increase number of ELF sections · 586b6dfa

Joel Sing authored May 30, 2012

On NetBSD a cgo enabled binary has more than 32 sections - bump NSECTS
so that we can actually link them successfully.

R=golang-dev, rsc
CC=golang-dev
https://golang.org/cl/6261052

586b6dfa

runtime: hide symbol table from garbage collector · 46d7d5fc
Jan Ziak authored May 30, 2012
```
R=rsc
CC=golang-dev
https://golang.org/cl/6243059
```
46d7d5fc
exp/locale/collate: avoid double building in maketables.go. Also added check. · c633f85f
Marcel van Lohuizen authored May 30, 2012
```
R=r
CC=golang-dev
https://golang.org/cl/6202063
```
c633f85f
test/bench/go1: add mandelbrot for floating point · cb9759d0
Russ Cox authored May 30, 2012
```
R=golang-dev, bradfitz
CC=golang-dev
https://golang.org/cl/6244063
```
cb9759d0