1. 25 Apr, 2018 1 commit
  2. 24 Apr, 2018 20 commits
    • Keith Randall's avatar
      cmd/compile: update SSA TODO file · ae26d57f
      Keith Randall authored
      Get rid of a bunch of stuff we've already done.
      
      Change-Id: Ibae4be7535ddb58590a072a2390c5f3e948c2fd7
      Reviewed-on: https://go-review.googlesource.com/109136Reviewed-by: 's avatarBrad Fitzpatrick <bradfitz@golang.org>
      ae26d57f
    • Matthew Dempsky's avatar
      cmd/compile/internal/types: replace Type.Val with Type.Elem · 2083b5d6
      Matthew Dempsky authored
      This reduces the API surface of Type slightly (for #25056), but also
      makes it more consistent with the reflect and go/types APIs.
      
      Passes toolstash-check.
      
      Change-Id: Ief9a8eb461ae6e88895f347e2a1b7b8a62423222
      Reviewed-on: https://go-review.googlesource.com/109138
      Run-TryBot: Matthew Dempsky <mdempsky@google.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: 's avatarBrad Fitzpatrick <bradfitz@golang.org>
      2083b5d6
    • Matthew Dempsky's avatar
      cmd/compile/internal/types: remove ElemType wrapper · e10ee798
      Matthew Dempsky authored
      This was an artifact from when we had a separate ssa.Type interface to
      break circular dependency between packages ssa and gc. It's no longer
      needed now that package ssa directly uses package types.
      
      Change-Id: I6a93e5d79082815f7f0eb89507381969cc6cb403
      Reviewed-on: https://go-review.googlesource.com/109137
      Run-TryBot: Matthew Dempsky <mdempsky@google.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: 's avatarJosh Bleecher Snyder <josharian@gmail.com>
      e10ee798
    • Hana Kim's avatar
      cmd/trace: distinguish task endTimestamp and lastTimestamp · 011f6c5f
      Hana Kim authored
      A task may have other user annotation events after the task ends.
      So far, task.lastTimestamp returned the task end event if the
      event available. This change introduces task.endTimestamp for that
      and makes task.lastTimestamp returns the "last" seen event's timestamp
      if the task is ended.
      
      If the task is not ended, both returns the last timestamp of the entire
      trace assuming the task is still active.
      
      This fixes the task-oriented trace view mode not to drop user
      annotation instances when they appear outside a task's lifespan.
      Adds a test.
      
      Change-Id: Iba1062914f224edd521b9ee55c6cd5e180e55359
      Reviewed-on: https://go-review.googlesource.com/109175Reviewed-by: 's avatarHeschi Kreinick <heschi@google.com>
      011f6c5f
    • erifan01's avatar
      internal/bytealg: optimize IndexString on arm64 · d4e936cf
      erifan01 authored
      This CL adjusts the order of the branch instructions of the
      code to make it easier for the LIKELY branch to happen.
      
      Benchmarks:
      name                            old time/op    new time/op    delta
      pkg:strings goos:linux goarch:arm64
      IndexHard2-8                      2.17ms ± 1%    1.23ms ± 0%  -43.34%  (p=0.008 n=5+5)
      CountHard2-8                      2.13ms ± 1%    1.21ms ± 2%  -43.31%  (p=0.008 n=5+5)
      
      pkg:bytes goos:linux goarch:arm64
      IndexRune/4M-8                     661µs ±22%     513µs ± 0%  -22.32%  (p=0.008 n=5+5)
      IndexEasy/4M-8                     672µs ±23%     513µs ± 0%  -23.71%  (p=0.016 n=5+4)
      
      Change-Id: Ib96f095edf77747edc8a971e79f5c1428e5808ce
      Reviewed-on: https://go-review.googlesource.com/109015Reviewed-by: 's avatarCherry Zhang <cherryyz@google.com>
      Run-TryBot: Cherry Zhang <cherryyz@google.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      d4e936cf
    • Heschi Kreinick's avatar
      cmd/link: fix TestRuntimeTypeAttr on ppc64,solaris · 5d4267e4
      Heschi Kreinick authored
      For ppc64, skip -linkmode=external per
      https://go-review.googlesource.com/c/go/+/106775#message-f95b9bd716e3d9ebb3f47a50492cde9f2972e859
      
      For Solaris, apparently type.* isn't the same as runtime.types. I don't
      know why, but runtime.types is what goes into moduledata, and so it's
      definitely the more correct thing to use.
      
      Fixes: #24983
      
      Change-Id: I6b465ac7b8f91ce55a63acbd7fe76e4a2dbb6f22
      Reviewed-on: https://go-review.googlesource.com/108955
      Run-TryBot: Heschi Kreinick <heschi@google.com>
      Reviewed-by: 's avatarBrad Fitzpatrick <bradfitz@golang.org>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      5d4267e4
    • Josh Bleecher Snyder's avatar
      cmd/compile: improve regalloc live values debug printing · 3d6647d6
      Josh Bleecher Snyder authored
      Before:
      
      live values at end of each block
        b1: v3 v2 v7 avoid=0
        b2: v3 v13 avoid=81
        b3: v19[AX] v3 avoid=81
        b6: avoid=0
        b7: avoid=0
        b5: avoid=0
        b4: v3 v18 avoid=81
      
      After:
      
      live values at end of each block
        b1: v3 v2 v7
        b2: v3 v13 avoid=AX DI
        b3: v19[AX] v3 avoid=AX DI
        b6:
        b7:
        b5:
        b4: v3 v18 avoid=AX DI
      
      Change-Id: Ibec5c76a16151832b8d49a21c640699fdc9a9d28
      Reviewed-on: https://go-review.googlesource.com/109000
      Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: 's avatarBrad Fitzpatrick <bradfitz@golang.org>
      3d6647d6
    • Hana Kim's avatar
      runtime/trace: add simple benchmarks for user annotation · 3bf16444
      Hana Kim authored
      Also, avoid Region creation when tracing is disabled.
      Unfortunate side-effect of this change is that we no longer trace
      pre-existing regions in tracing, but we can add the feature in
      the future when we find it useful and justifiable. Until then,
      let's avoid the overhead from this low-level api use as much as
      possible.
      
      goos: linux
      goarch: amd64
      pkg: runtime/trace
      
      // Trace disabled
      BenchmarkStartRegion-12 2000000000	         0.66 ns/op	       0 B/op	       0 allocs/op
      BenchmarkNewTask-12    	30000000	        40.4 ns/op	      56 B/op	       2 allocs/op
      
      // Trace enabled, -trace=/dev/null
      BenchmarkStartRegion-12  5000000	       287 ns/op	      32 B/op	       1 allocs/op
      BenchmarkNewTask-12    	 5000000	       283 ns/op	      56 B/op	       2 allocs/op
      
      Also, skip other tests if tracing is already enabled.
      
      Change-Id: Id3028d60b5642fcab4b09a74fd7d79361a3861e5
      Reviewed-on: https://go-review.googlesource.com/109115Reviewed-by: 's avatarPeter Weinberger <pjw@google.com>
      3bf16444
    • Hana Kim's avatar
      runtime/trace: rename "Span" with "Region" · c2d10243
      Hana Kim authored
      "Span" is a commonly used term in many distributed tracing systems
      (Dapper, OpenCensus, OpenTracing, ...). They use it to refer to a
      period of time, not necessarily tied into execution of underlying
      processor, thread, or goroutine, unlike the "Span" of runtime/trace
      package.
      
      Since distributed tracing and go runtime execution tracing are
      already similar enough to cause confusion, this CL attempts to avoid
      using the same word if possible.
      
      "Region" is being used in a certain tracing system to refer to a code
      region which is pretty close to what runtime/trace.Span currently
      refers to. So, replace that.
      https://software.intel.com/en-us/itc-user-and-reference-guide-defining-and-recording-functions-or-regions
      
      This CL also tweaks APIs a bit based on jbd and heschi's comments:
      
        NewContext -> NewTask
          and it now returns a Task object that exports End method.
      
        StartSpan -> StartRegion
          and it now returns a Region object that exports End method.
      
      Also, changed WithSpan to WithRegion and it now takes func() with no
      context. Another thought is to get rid of WithRegion. It is a nice
      concept but in practice, it seems problematic (a lot of code churn,
      and polluting stack trace). Already, the tracing concept is very low
      level, and we hope this API to be used with great care.
      
      Recommended usage will be
         defer trace.StartRegion(ctx, "someRegion").End()
      
      Left old APIs untouched in this CL. Once the usage of them are cleaned
      up, they will be removed in a separate CL.
      
      Change-Id: I73880635e437f3aad51314331a035dd1459b9f3a
      Reviewed-on: https://go-review.googlesource.com/108296
      Run-TryBot: Hyang-Ah Hana Kim <hyangah@gmail.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: 's avatarJBD <jbd@google.com>
      c2d10243
    • Ilya Tocar's avatar
      cmd/compile/internal/ssa: fix endless compile loop on AMD64 · fb017c60
      Ilya Tocar authored
      We currently rewrite
      (TESTQ (MOVQconst [c] x)) into (TESTQconst [c] x)
      and (TESTQconst [-1] x) into (TESTQ x x)
      if x is a (MOVQconst [-1]) we will be stuck in the endless rewrite loop.
      Don't perform the rewrite in such cases.
      
      Fixes #25006
      
      Change-Id: I77f561ba2605fc104f1e5d5c57f32e9d67a2c000
      Reviewed-on: https://go-review.googlesource.com/108879
      Run-TryBot: Ilya Tocar <ilya.tocar@intel.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: 's avatarKeith Randall <khr@golang.org>
      fb017c60
    • Hana (Hyang-Ah) Kim's avatar
      runtime/pprof: introduce "allocs" profile · cd037bce
      Hana (Hyang-Ah) Kim authored
      The Go's heap profile contains four kinds of samples
      (inuse_space, inuse_objects, alloc_space, and alloc_objects).
      The pprof tool by default chooses the inuse_space (the bytes
      of live, in-use objects). When analyzing the current memory
      usage the choice of inuse_space as the default may be useful,
      but in some cases, users are more interested in analyzing the
      total allocation statistics throughout the program execution.
      For example, when we analyze the memory profile from benchmark
      or program test run, we are more likely interested in the whole
      allocation history than the live heap snapshot at the end of
      the test or benchmark.
      
      The pprof tool provides flags to control which sample type
      to be used for analysis. However, it is one of the less-known
      features of pprof and we believe it's better to choose the
      right type of samples as the default when producing the profile.
      
      This CL introduces a new type of profile, "allocs", which is
      the same as the "heap" profile but marks the alloc_space
      as the default type unlike heap profiles that use inuse_space
      as the default type.
      
      'go test -memprofile=...' command is changed to use the new
      "allocs" profile type instead of the traditional "heap" profile.
      
      Fixes #24443
      
      Change-Id: I012dd4b6dcacd45644d7345509936b8380b6fbd9
      Reviewed-on: https://go-review.googlesource.com/102696
      Run-TryBot: Hyang-Ah Hana Kim <hyangah@gmail.com>
      Reviewed-by: 's avatarRuss Cox <rsc@golang.org>
      cd037bce
    • quasilyte's avatar
      cmd/internal/obj/x86: forbid mem args for MOV_DR and MOV_CR · 70c5839f
      quasilyte authored
      Memory arguments for debug/control register moves are a
      minefield for programmer: not useful, but can lead to errors.
      
      See referenced issue for detailed explanation.
      
      Fixes #24981
      
      Change-Id: I918e81cd4a8b1dfcfc9023cdfc3de45abe29e749
      Reviewed-on: https://go-review.googlesource.com/107075
      Run-TryBot: Iskander Sharipov <iskander.sharipov@intel.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: 's avatarCherry Zhang <cherryyz@google.com>
      70c5839f
    • isharipo's avatar
      cmd/compile/internal/ssa: add Op{SP,SB} type checks to check.go · cb44c8de
      isharipo authored
      gc/ssa.go initilizes SP and SB values with TUINTPTR type.
      Assign same type in SSA tests and modify check.go to catch
      mismatching types for those ops.
      
      This makes SSA tests more consistent.
      
      Change-Id: I798440d57d00fb949d1a0cd796759c9b82a934bd
      Reviewed-on: https://go-review.googlesource.com/106658
      Run-TryBot: Iskander Sharipov <iskander.sharipov@intel.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: 's avatarKeith Randall <khr@golang.org>
      cb44c8de
    • ludweeg's avatar
      crypto/md5: unnecessary conversion · d18f186b
      ludweeg authored
      Fixes go lint warning.
      
      Change-Id: I5a7485a4c8316b81e6aa50b95fe75e424f2fcedc
      Reviewed-on: https://go-review.googlesource.com/109055Reviewed-by: 's avatarBrad Fitzpatrick <bradfitz@golang.org>
      Run-TryBot: Brad Fitzpatrick <bradfitz@golang.org>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      d18f186b
    • Andrei Tudor Călin's avatar
      net: add support for splice(2) in (*TCPConn).ReadFrom on Linux · f2316c27
      Andrei Tudor Călin authored
      This change adds support for the splice system call on Linux,
      for the purpose of optimizing (*TCPConn).ReadFrom by reducing
      copies of data from and to userspace. It does so by creating a
      temporary pipe and splicing data from the source connection to the
      pipe, then from the pipe to the destination connection. The pipe
      serves as an in-kernel buffer for the data transfer.
      
      No new API is added to package net, but a new Splice function is
      added to package internal/poll, because using splice requires help
      from the network poller. Users of the net package should benefit
      from the change transparently.
      
      This change only enables the optimization if the Reader in ReadFrom
      is a TCP connection. Since splice is a more general interface, it
      could, in theory, also be enabled if the Reader were a unix socket,
      or the read half of a pipe.
      
      However, benchmarks show that enabling it for unix sockets is most
      likely not a net performance gain. The tcp <- unix case is also
      fairly unlikely to be used very much by users of package net.
      
      Enabling the optimization for pipes is also problematic from an
      implementation perspective, since package net cannot easily get at
      the *poll.FD of an *os.File. A possible solution to this would be
      to dup the pipe file descriptor, register the duped descriptor with
      the network poller, and work on that *poll.FD instead of the original.
      However, this seems too intrusive, so it has not been done. If there
      was a clean way to do it, it would probably be worth doing, since
      splicing from a pipe to a socket can be done directly.
      
      Therefore, this patch only enables the optimization for what is likely
      the most common use case: tcp <- tcp.
      
      The following benchmark compares the performance of the previous
      userspace genericReadFrom code path to the new optimized code path.
      The sub-benchmarks represent chunk sizes used by the writer on the
      other end of the Reader passed to ReadFrom.
      
      benchmark                          old ns/op     new ns/op     delta
      BenchmarkTCPReadFrom/1024-4        4727          4954          +4.80%
      BenchmarkTCPReadFrom/2048-4        4389          4301          -2.01%
      BenchmarkTCPReadFrom/4096-4        4606          4534          -1.56%
      BenchmarkTCPReadFrom/8192-4        5219          4779          -8.43%
      BenchmarkTCPReadFrom/16384-4       8708          8008          -8.04%
      BenchmarkTCPReadFrom/32768-4       16349         14973         -8.42%
      BenchmarkTCPReadFrom/65536-4       35246         27406         -22.24%
      BenchmarkTCPReadFrom/131072-4      72920         52382         -28.17%
      BenchmarkTCPReadFrom/262144-4      149311        95094         -36.31%
      BenchmarkTCPReadFrom/524288-4      306704        181856        -40.71%
      BenchmarkTCPReadFrom/1048576-4     674174        357406        -46.99%
      
      benchmark                          old MB/s     new MB/s     speedup
      BenchmarkTCPReadFrom/1024-4        216.62       206.69       0.95x
      BenchmarkTCPReadFrom/2048-4        466.61       476.08       1.02x
      BenchmarkTCPReadFrom/4096-4        889.09       903.31       1.02x
      BenchmarkTCPReadFrom/8192-4        1569.40      1714.06      1.09x
      BenchmarkTCPReadFrom/16384-4       1881.42      2045.84      1.09x
      BenchmarkTCPReadFrom/32768-4       2004.18      2188.41      1.09x
      BenchmarkTCPReadFrom/65536-4       1859.38      2391.25      1.29x
      BenchmarkTCPReadFrom/131072-4      1797.46      2502.21      1.39x
      BenchmarkTCPReadFrom/262144-4      1755.69      2756.68      1.57x
      BenchmarkTCPReadFrom/524288-4      1709.42      2882.98      1.69x
      BenchmarkTCPReadFrom/1048576-4     1555.35      2933.84      1.89x
      
      Fixes #10948
      
      Change-Id: I3ce27f21f7adda8b696afdc48a91149998ae16a5
      Reviewed-on: https://go-review.googlesource.com/107715
      Run-TryBot: Brad Fitzpatrick <bradfitz@golang.org>
      Run-TryBot: Ian Lance Taylor <iant@golang.org>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: 's avatarIan Lance Taylor <iant@golang.org>
      f2316c27
    • Wèi Cōngruì's avatar
      runtime: fix errno sign for epollctl on mips, mips64 and ppc64 · cc880923
      Wèi Cōngruì authored
      The caller of epollctl expects it to return a negative errno value,
      but it returns a positive errno value on mips, mips64 and ppc64.
      The change fixes this.
      
      Updates #23446
      
      Change-Id: Ie6372eca6c23de21964caaaa433c9a45ef93531e
      Reviewed-on: https://go-review.googlesource.com/89235Reviewed-by: 's avatarCarlos Eduardo Seo <cseo@linux.vnet.ibm.com>
      Reviewed-by: 's avatarIan Lance Taylor <iant@golang.org>
      Run-TryBot: Ian Lance Taylor <iant@golang.org>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      cc880923
    • Ian Lance Taylor's avatar
      runtime: change GNU/Linux usleep to use nanosleep · 665b9b34
      Ian Lance Taylor authored
      Ever since we added sleep to the runtime back in 2008, we've
      implemented it on GNU/Linux with the select (or pselect or pselect6)
      system call. But the Linux kernel has a nanosleep system call,
      which should be a tiny bit more efficient since it doesn't have to
      check to see whether there are any file descriptors. So use it.
      
      Change-Id: Icc3430baca46b082a4d33f97c6c47e25fa91cb9a
      Reviewed-on: https://go-review.googlesource.com/108538
      Run-TryBot: Ian Lance Taylor <iant@golang.org>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: 's avatarBrad Fitzpatrick <bradfitz@golang.org>
      665b9b34
    • Matthew Dempsky's avatar
      cmd/compile: enable indexed export format by default · a3c75d9b
      Matthew Dempsky authored
      Change-Id: Id018eeb79afbe2c695a583b3845cfbc1aab08388
      Reviewed-on: https://go-review.googlesource.com/106797
      Run-TryBot: Matthew Dempsky <mdempsky@google.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: 's avatarRobert Griesemer <gri@golang.org>
      a3c75d9b
    • Matthew Dempsky's avatar
      cmd/compile: add indexed export format · ca2f85fd
      Matthew Dempsky authored
      This CL introduces a new indexed data format for package export
      data. This improves on the previous (sequential) binary format by
      allowing the compiler to selectively (and lazily) load only the data
      that's actually needed for compilation.
      
      In large Go projects, the package export data can become very large
      due to transitive type declaration dependencies and inline
      function/method bodies. By lazily loading these declarations and
      bodies as needed, we avoid wasting time and memory processing
      unnecessary and/or redundant data.
      
      In the benchmarks below, "old" is -iexport=false and "new" is
      -iexport=true. The suffixes indicate the compiler concurrency (-c) and
      inlining (-l) settings used for the build (using -gcflags=all=-foo).
      Benchmarks were run on an HP Z620.
      
      Juju is "go build -a github.com/juju/juju/cmd/...":
      
      name          old real-time/op  new real-time/op  delta
      Juju/c=1/l=0        44.0s ± 1%        38.7s ± 9%  -11.97%  (p=0.001 n=7+7)
      Juju/c=1/l=4        53.7s ± 3%        45.3s ± 4%  -15.53%  (p=0.001 n=7+7)
      Juju/c=4/l=0        39.7s ± 8%        32.0s ± 4%  -19.38%  (p=0.001 n=7+7)
      Juju/c=4/l=4        46.3s ± 4%        38.0s ± 4%  -18.06%  (p=0.001 n=7+7)
      
      name          old user-time/op  new user-time/op  delta
      Juju/c=1/l=0         371s ± 1%         300s ± 0%  -19.07%  (p=0.001 n=7+6)
      Juju/c=1/l=4         482s ± 0%         374s ± 1%  -22.37%  (p=0.001 n=7+7)
      Juju/c=4/l=0         410s ± 1%         340s ± 1%  -17.19%  (p=0.001 n=7+7)
      Juju/c=4/l=4         532s ± 1%         424s ± 1%  -20.26%  (p=0.001 n=7+7)
      
      name          old sys-time/op   new sys-time/op   delta
      Juju/c=1/l=0        33.4s ± 1%        28.4s ± 2%  -15.02%  (p=0.001 n=7+7)
      Juju/c=1/l=4        40.7s ± 2%        32.8s ± 3%  -19.51%  (p=0.001 n=7+7)
      Juju/c=4/l=0        39.8s ± 2%        34.4s ± 2%  -13.74%  (p=0.001 n=7+7)
      Juju/c=4/l=4        48.4s ± 2%        40.4s ± 2%  -16.50%  (p=0.001 n=7+7)
      
      Kubelet is "go build -a k8s.io/kubernetes/cmd/kubelet":
      
      name             old real-time/op  new real-time/op  delta
      Kubelet/c=1/l=0        42.0s ± 1%        34.8s ± 1%  -17.27%  (p=0.008 n=5+5)
      Kubelet/c=1/l=4        55.4s ± 3%        45.4s ± 3%  -18.06%  (p=0.002 n=6+6)
      Kubelet/c=4/l=0        37.4s ± 3%        29.9s ± 1%  -20.25%  (p=0.004 n=6+5)
      Kubelet/c=4/l=4        48.1s ± 2%        39.0s ± 5%  -18.93%  (p=0.002 n=6+6)
      
      name             old user-time/op  new user-time/op  delta
      Kubelet/c=1/l=0         291s ± 1%         233s ± 1%  -19.96%  (p=0.002 n=6+6)
      Kubelet/c=1/l=4         385s ± 1%         298s ± 1%  -22.51%  (p=0.002 n=6+6)
      Kubelet/c=4/l=0         325s ± 0%         268s ± 1%  -17.48%  (p=0.004 n=5+6)
      Kubelet/c=4/l=4         429s ± 1%         343s ± 1%  -20.08%  (p=0.002 n=6+6)
      
      name             old sys-time/op   new sys-time/op   delta
      Kubelet/c=1/l=0        25.1s ± 2%        20.9s ± 4%  -16.69%  (p=0.002 n=6+6)
      Kubelet/c=1/l=4        31.2s ± 3%        24.4s ± 0%  -21.67%  (p=0.010 n=6+4)
      Kubelet/c=4/l=0        30.2s ± 2%        25.6s ± 1%  -15.34%  (p=0.002 n=6+6)
      Kubelet/c=4/l=4        37.3s ± 1%        30.9s ± 2%  -17.11%  (p=0.002 n=6+6)
      
      Change-Id: Ie43eb3bbe1392cbb61c86792a17a57b33b9561f0
      Reviewed-on: https://go-review.googlesource.com/106796
      Run-TryBot: Matthew Dempsky <mdempsky@google.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: 's avatarRobert Griesemer <gri@golang.org>
      ca2f85fd
    • Matthew Dempsky's avatar
      cmd/compile/internal/types: add Pkg and SetPkg methods to Type · 03f546eb
      Matthew Dempsky authored
      The go/types API exposes what package objects were declared in, which
      includes struct fields, interface methods, and function parameters.
      
      The compiler implicitly tracks these for non-exported identifiers
      (through the Sym's associated Pkg), but exported identifiers always
      use localpkg. To simplify identifying this, add an explicit package
      field to struct, interface, and function types.
      
      Change-Id: I6adc5dc653e78f058714259845fb3077066eec82
      Reviewed-on: https://go-review.googlesource.com/107622Reviewed-by: 's avatarRobert Griesemer <gri@golang.org>
      Reviewed-by: 's avatarBrad Fitzpatrick <bradfitz@golang.org>
      03f546eb
  3. 23 Apr, 2018 10 commits
    • Josh Bleecher Snyder's avatar
      cmd/compile: rewrite 2*x+c into LEAx1 on amd64 · d292f77e
      Josh Bleecher Snyder authored
      Rewrite x<<1+c into x+x+c, which can be expressed as a single LEAQ/LEAL.
      
      Bit of a special case, but the single-instruction
      LEA is both shorter and faster than SHL then ADD.
      
      Triggers 293 times during make.bash.
      
      Change-Id: I3f09c8e9a8f3859d1eeed336f095fc3ada79c2c1
      Reviewed-on: https://go-review.googlesource.com/108938
      Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: 's avatarKeith Randall <khr@golang.org>
      d292f77e
    • Rodolfo Carvalho's avatar
      doc: correct statement in go1.10 release notes · c0769741
      Rodolfo Carvalho authored
      The language spec requires the RHS operand of shift expressions to be unsigned integers.
      
      The changes in CL 60230 and the related CL 81277 refer to a variable s of type uint.
      The "untyped constant" here refers to 1.0, not s.
      
      Change-Id: Id2b884816af7f79f453afcb8c34ade2d34e18bc2
      GitHub-Last-Rev: b26c853cae2adea7235a51ad726308e337494385
      GitHub-Pull-Request: golang/go#24989
      Reviewed-on: https://go-review.googlesource.com/108676Reviewed-by: 's avatarRobert Griesemer <gri@golang.org>
      c0769741
    • Josh Bleecher Snyder's avatar
      cmd/compile: add amd64 LEAL{1,2,4,8} ops · 22115859
      Josh Bleecher Snyder authored
      For future use in rewrite rules.
      
      Change-Id: Ic9875beb0dea6e0bbcbd4b75d99a53f4a9a7c3fd
      Reviewed-on: https://go-review.googlesource.com/101275
      Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: 's avatarBrad Fitzpatrick <bradfitz@golang.org>
      22115859
    • isharipo's avatar
      cmd/internal/obj/x86: faster Assemble for non-NaCl hosts · 8e3dd8ab
      isharipo authored
      Make span6 function (used as LinkArch.Assemble) faster
      by avoiding redundant re-assemble rounds on hosts
      that are not NaCl.
      
      NaCl is excluded because it needs Prog.Isize to fix alignment.
      
      For make.bash, there are around 50% of functions that can
      be encoded in a single trip. With this change, those function
      will be assembled with 1 round instead of 2.
      
      compilebench results:
      
          name        old time/op       new time/op       delta
          Template          305ms ± 2%        299ms ± 2%  -1.99%  (p=0.001 n=10+10)
          Unicode           139ms ± 3%        138ms ± 4%    ~     (p=0.222 n=9+9)
          GoTypes           1.05s ± 1%        1.04s ± 1%  -1.34%  (p=0.000 n=10+9)
          Compiler          4.78s ± 1%        4.71s ± 1%  -1.45%  (p=0.000 n=9+9)
          SSA               12.2s ± 1%        12.0s ± 1%  -1.90%  (p=0.000 n=9+10)
          Flate             204ms ± 3%        202ms ± 3%    ~     (p=0.052 n=10+10)
          GoParser          248ms ± 1%        244ms ± 2%  -1.79%  (p=0.000 n=10+9)
          Reflect           671ms ± 1%        664ms ± 1%  -0.96%  (p=0.001 n=9+9)
          Tar               287ms ± 2%        285ms ± 3%    ~     (p=0.393 n=10+10)
          XML               362ms ± 1%        353ms ± 2%  -2.60%  (p=0.000 n=10+9)
          StdCmd            29.2s ± 1%        29.0s ± 1%  -0.63%  (p=0.021 n=10+8)
          [Geo mean]        888ms             875ms       -1.40%
      
          name        old user-time/op  new user-time/op  delta
          Template          393ms ± 5%        373ms ± 8%  -5.12%  (p=0.013 n=9+10)
          Unicode           185ms ± 6%        184ms ± 5%    ~     (p=0.825 n=10+10)
          GoTypes           1.33s ± 1%        1.31s ± 3%  -1.60%  (p=0.004 n=10+10)
          Compiler          5.98s ± 3%        5.92s ± 1%    ~     (p=0.050 n=10+10)
          SSA               15.5s ± 2%        15.3s ± 0%    ~     (p=0.156 n=10+9)
          Flate             255ms ± 5%        252ms ± 5%    ~     (p=0.362 n=10+10)
          GoParser          309ms ± 1%        304ms ± 3%  -1.79%  (p=0.021 n=7+10)
          Reflect           839ms ± 2%        833ms ± 1%    ~     (p=0.160 n=10+9)
          Tar               363ms ± 3%        358ms ± 4%    ~     (p=0.194 n=8+10)
          XML               446ms ± 3%        442ms ± 3%    ~     (p=0.503 n=10+10)
          [Geo mean]        791ms             779ms       -1.55%
      
      Passes toolstash-check.
      
      Change-Id: Ibcdb09f2c28907932581b7566f46d34be292594b
      Reviewed-on: https://go-review.googlesource.com/108895
      Run-TryBot: Iskander Sharipov <iskander.sharipov@intel.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: 's avatarCherry Zhang <cherryyz@google.com>
      8e3dd8ab
    • ludweeg's avatar
      crypto: make receiver name consistent · 0cd0dc96
      ludweeg authored
      Fixes go lint warning.
      
      Change-Id: I63950e7c70bf431e88a04f32befd50be9beacadf
      Reviewed-on: https://go-review.googlesource.com/108815Reviewed-by: 's avatarBrad Fitzpatrick <bradfitz@golang.org>
      Run-TryBot: Brad Fitzpatrick <bradfitz@golang.org>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      0cd0dc96
    • Matthew Dempsky's avatar
      cmd/compile: remove toolstash workaround in bexport.go · 545ef110
      Matthew Dempsky authored
      Change-Id: Ie4facdcab4b35cf7d350c4b8fa06a3c5a0c6caeb
      Reviewed-on: https://go-review.googlesource.com/108875
      Run-TryBot: Matthew Dempsky <mdempsky@google.com>
      Reviewed-by: 's avatarBrad Fitzpatrick <bradfitz@golang.org>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      545ef110
    • Matthew Dempsky's avatar
      cmd/compile: replace Field.Nname.Pos with Field.Pos · 7759b32a
      Matthew Dempsky authored
      For struct fields and methods, Field.Nname was only used to store
      position information, which means we're allocating an entire ONAME
      Node+Name+Param structure just for one field. We can optimize away
      these ONAME allocations by instead adding a Field.Pos field.
      
      Unfortunately, we can't get rid of Field.Nname, because it's needed
      for function parameters, so Field grows a little bit and now has more
      redundant information in those cases. However, that was already the
      case (e.g., Field.Sym and Field.Nname.Sym), and it's still a net win
      for allocations as demonstrated by the benchmarks below.
      
      Additionally, by moving the ONAME allocation for function parameters
      to funcargs, we can avoid allocating them for function parameters that
      aren't used in corresponding function bodies (e.g., interface methods,
      function-typed variables, and imported functions/methods without
      inline bodies).
      
      name       old time/op       new time/op       delta
      Template         254ms ± 6%        251ms ± 6%  -1.04%  (p=0.000 n=487+488)
      Unicode          128ms ± 7%        128ms ± 7%    ~     (p=0.294 n=482+467)
      GoTypes          862ms ± 5%        860ms ± 4%    ~     (p=0.075 n=488+471)
      Compiler         3.91s ± 4%        3.90s ± 4%  -0.39%  (p=0.000 n=468+473)
      
      name       old user-time/op  new user-time/op  delta
      Template         339ms ±14%        336ms ±14%  -1.02%  (p=0.001 n=498+494)
      Unicode          176ms ±18%        176ms ±25%    ~     (p=0.940 n=491+499)
      GoTypes          1.13s ± 8%        1.13s ± 9%    ~     (p=0.157 n=496+493)
      Compiler         5.24s ± 6%        5.21s ± 6%  -0.57%  (p=0.000 n=485+489)
      
      name       old alloc/op      new alloc/op      delta
      Template        38.3MB ± 0%       37.3MB ± 0%  -2.58%  (p=0.000 n=499+497)
      Unicode         29.1MB ± 0%       29.1MB ± 0%  -0.03%  (p=0.000 n=500+493)
      GoTypes          116MB ± 0%        115MB ± 0%  -0.65%  (p=0.000 n=498+499)
      Compiler         492MB ± 0%        487MB ± 0%  -1.00%  (p=0.000 n=497+498)
      
      name       old allocs/op     new allocs/op     delta
      Template          364k ± 0%         360k ± 0%  -1.15%  (p=0.000 n=499+499)
      Unicode           336k ± 0%         336k ± 0%  -0.01%  (p=0.000 n=500+493)
      GoTypes          1.16M ± 0%        1.16M ± 0%  -0.30%  (p=0.000 n=499+499)
      Compiler         4.54M ± 0%        4.51M ± 0%  -0.58%  (p=0.000 n=494+495)
      
      Passes toolstash-check -gcflags=-dwarf=false. Changes DWARF output
      because position information is now tracked more precisely for
      function parameters.
      
      Change-Id: Ib8077d70d564cc448c5e4290baceab3a4396d712
      Reviewed-on: https://go-review.googlesource.com/108217
      Run-TryBot: Matthew Dempsky <mdempsky@google.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: 's avatarRobert Griesemer <gri@golang.org>
      7759b32a
    • Austin Clements's avatar
      cmd/compile: don't compact liveness maps in place · bdb65da0
      Austin Clements authored
      Currently Liveness.compact rewrites the Liveness.livevars slice in
      place. However, we're about to add register maps, which we'll want to
      track in livevars, but compact independently from the stack maps.
      Hence, this CL modifies Liveness.compact to consume Liveness.livevars
      and produce a new slice of deduplicated stack maps. This is somewhat
      clearer anyway because it avoids potential confusion over how
      Liveness.livevars is indexed.
      
      Passes toolstash -cmp.
      
      For #24543.
      
      Change-Id: I7093fbc71143f8a29e677aa30c96e501f953ca2b
      Reviewed-on: https://go-review.googlesource.com/108498
      Run-TryBot: Austin Clements <austin@google.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: 's avatarDavid Chase <drchase@google.com>
      bdb65da0
    • Elias Naur's avatar
      cmd/link: remove R_ADDR relocation workaround for macho arm64 · 3363e983
      Elias Naur authored
      The workarounds doesn't seem necessary anymore, and blocks DWARF
      on darwin/arm64.
      
      Updates #24883.
      
      Change-Id: Ic917c767d3b4f6c51be25566956296f5dd4ead10
      Reviewed-on: https://go-review.googlesource.com/108655
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: 's avatarCherry Zhang <cherryyz@google.com>
      3363e983
    • ludweeg's avatar
      crypto/md5: make receiver name consistent · 1f718926
      ludweeg authored
      Fixes go lint warning.
      
      Change-Id: I2d73208c6841f35d7a21a1fb4cfafbbd4250228f
      Reviewed-on: https://go-review.googlesource.com/108659Reviewed-by: 's avatarIskander Sharipov <iskander.sharipov@intel.com>
      Reviewed-by: 's avatarBrad Fitzpatrick <bradfitz@golang.org>
      Run-TryBot: Iskander Sharipov <iskander.sharipov@intel.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      1f718926
  4. 22 Apr, 2018 2 commits
  5. 21 Apr, 2018 6 commits
  6. 20 Apr, 2018 1 commit
    • quasilyte's avatar
      cmd/internal/obj/x86: disallow PC/FP/SB scaled index · d4874881
      quasilyte authored
      Reject to compile I386/AMD64 asm code that contains
      (Register)(PseudoReg*scale) forms of memory operands.
      Example of such program: "CALL (AX)(PC*2)".
      PseudoReg is one of the PC, FP, SB (but not SP).
      
      When pseudo-register is used in register indirect as
      scaled index base, x86 backend will panic because
      its register file misses SB/FP/PC registers.
      
      Fixes #12657.
      
      Change-Id: I30fca797b537cbc86ab47583ae96c6a0c59acaa1
      Reviewed-on: https://go-review.googlesource.com/107835
      Run-TryBot: Iskander Sharipov <iskander.sharipov@intel.com>
      Reviewed-by: 's avatarCherry Zhang <cherryyz@google.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      d4874881