Linux 3.16 released has been released on Sun, 3 Aug 2014.
Summary: This release improves performance with the support dynamically switch the clock frequency on Nvidia cards, it adds support for mapping user space memory into the GPU on Intel devices, XFS has a free inode btree for faster inode allocation, ARM 64 kernels can be used as EFI stubs, TCP Fast Open is supported in IPv6, some radeon devices have better performance thanks to improved power management support , Intel Cherryview graphics are supported, and control groups have gained an optional Unified Hierarchy mode, new drivers and many other small improvements have also been added.
Contents
-
Prominent features
- Nvidia graphics performance improvements, initial support for GK20A devices and GK110B
- Intel graphic driver allows mapping of user pages into video memory
- Unified Control Group hierarchy
- XFS free inode btree, for faster inode allocation
- Allow booting ARM 64 kernels as EFI stubs
- TCP Fast Open server mode on IPv6 support
- Intel Cherryview graphics support
- Radeon performance improvements through improved APU power management have been enabled in some APUs
- Drivers and architectures
- Core
- Memory management
- Block layer
- Power management
- File systems
- Networking
- Virtualization
- Tracing/perf
- Security
- Crypto
- Other news sites that track the changes of this release
1. Prominent features
1.1. Nvidia graphics performance improvements, initial support for GK20A devices and GK110B
Nouveau, the opensource driver for Nvidia graphic GPUs, has gained support for allowing to change the frequency of the GPU from the BIOS predefined values. This feature (which for now needs to be enabled manually) improves performance noticeably. The Nvidia GPUs that got reclocking support in this release are those with nv40, nvaa, and nve0 clock types.
This release also adds initial (but incomplete) support for NVidia GK20A graphic chips, found in Tegra K1 SoC; and GK110B devices
1.2. Intel graphic driver allows mapping of user pages into video memory
By exporting the ability to map user address into the video memory, normal application data can be used as a texture source or even as a render target (depending upon the capabilities of the chipset). This has a number of uses, with zero-copy downloads to the GPU and efficient readback making the intermixed streaming of CPU and GPU operations fairly efficient. This ability has many widespread implications from faster rendering of client-side software rasterisers (chromium), mitigation of stalls due to read back (firefox) and to faster pipelining of texture data (such as pixel buffer objects in GL or data blobs in CL).
Code: commit
1.3. Unified Control Group hierarchy
Control groups allow to create groups of arbitrary processes and apply CPU, disk or memory constraints to the processes in these control groups. The current implementation allows to create several hierarchies and apply different constraints to each hierarchy. For several reasons, detailed in the article below, this approach is not considered appropriate, and work has been done to migrate towards an implementation where only one hierarchy will be available. This release includes for first time this unified control group hierarchy (optional for now)
For more information, read this recommended LWN article: The unified control group hierarchy in 3.16
1.4. XFS free inode btree, for faster inode allocation
In this release, XFS has added a btree that tracks free inodes. It is equivalent to the existing inode allocation btree with the exception that the free inode btree tracks inode chunks with at least one free inode. The purpose is to improve lookups for free inode clusters for inode allocation.
This feature adds does not change existing on-disk structures, but adds a new one that must remain consistent with the inode allocation btree; for this reason older kernels will only be able to mount read-only filesystems with the free inode btree feature.
Code: merge commit
1.5. Allow booting ARM 64 kernels as EFI stubs
This release allows to boot a Linux kernel for ARM64 platforms as an an EFI application in systems with UEFI firmware, without needing a bootloader.
1.6. TCP Fast Open server mode on IPv6 support
TCP Fast Open is a TCP feature designed to make TCP connections faster. First support was added on Linux 3.6 for clients, in 3.7 support was added for servers and in 3.13 Fast Open was enabled by default. This release adds Fast Open support for IPv6 for servers.
Code: commit
1.7. Intel Cherryview graphics support
This release includes support for Broadwell GPUs http://en.wikipedia.org/wiki/Intel_HD_and_Iris_Graphics#Broadwell] to be found in Cherryview SoCs.
1.8. Radeon performance improvements through improved APU power management have been enabled in some APUs
bapm, or "bidirectional application power management" is a power management feature that allows the GPU and CPU to share TDP, so it allows for extra performance out of the GPU and CPU when the headroom is available. In this release, bpam has been enabled by default, but only in a few devices and cases. Future work will improve support for bapm.
2. Drivers and architectures
All the driver and architecture-specific changes can be found in the Linux_3.16-DriversArch page
3. Core
IPC System V shared memory: increase the defaults for SHMALL, SHMMAX commit
- locking
fanotify: create FAN_ACCESS event for readdir commit
Add "crash_kexec_post_notifiers" option for kdump after panic_notifers commit
Add initcall_blacklist kernel parameter commit
seccomp: use internal BPF JIT to speed up filters commit
Add generic PCI host controller driver commit
Virtual terminal: emulate 8- and 24-bit colour codes. commit
Add CONFIG_SGETMASK_SYSCALL option to optionally disable sys_sgetmask/sys_ssetmask syscalls, which are no longer supported by older libcscommit
proc: Allow for strict write position handling commit
4. Memory management
hugetlb: add support for gigantic page allocation at runtime instead of just at boot time commit
Introduce a CONFIG_DEBUG_VM_VMACACHE option to enable counting the cache hit rate -- exported in /proc/vmstat commit
Disable zone_reclaim_mode by default commit
memcontrol cgroup: remove hierarchy restrictions for swappiness and oom_control commit
non-atomically mark page accessed during page cache allocation where possible commit
Only force scan in reclaim when none of the LRUs are big enough. commit
Use proportional scanning during direct reclaim and full scan at DEF_PRIORITY commit
5. Block layer
lots of development effort has gone into blk-mq in the last 3 months. It's heading to where 3.16 will be a feature complete and performant blk-mq
raid5: speedup sync_request processing (speeds up rebuild speed) commit
raid5: add an option to avoid copy data from bio to stripe cache (in 1M randwrite tests, avoid the copy can increase the performance more than 30%) commit
rbd: add ioctl for blockdev command compatibility commit
ore: Support for raid 6 commit
6. Power management
- cpufreq
Introduce "nohibernate" boot parameter to disable both hibernation and resume at boot time - useful to use kernel features that are not compatible with hibernation commit
Introduce command line argument, relative_sleep_states, that allows to change the way in which the kernel assigns labels to system sleep states commit
7. File systems
- Btrfs
New ioctl TREE_SEARCH_V2: it allows the user to supply a buffer of varying size in which a tree search can store its results. This is much more flexible if you want to receive items which are larger than the current fixed buffer of 3992 bytes or if you want to fetch more items at once. Items larger than this buffer are for example some of the type EXTENT_CSUM commit
Implement O_TMPFILE support commit
Don't restrict DEV_INFO ioctl to CAP_SYSADMIN commit
Don't restrict FS_INFO ioctl to CAP_SYSADMIN commit
Retrieve more info from FS_INFO ioctl commit
Export more from FS_INFO to sysfs commit
Reclaim the reserved metadata space at background (improves performance in some cases) commit
async delayed refs: Allows to batch delayed extent operations commit
rework qgroup accounting commit
- XFS
- F2FS
- FAT
Add support for DOS 1.x formatted volumes commit
8. Networking
Add support to configure SR-IOV VF minimum and maximum transmission rate through ip tool commit
ipv6: setting IFA_F_MANAGETEMPADDR with RTM_DELADDR will indicate that the kernel should delete all related temporary addresses (similar to using IFA_F_MANAGETEMPADDR with RTM_NEWADDR commit
l2tp: Add support for zero IPv6 checksums commit
- Wireless
Support for dynamic channel bandwidth changes (e.g., HT 20/40 MHz changes)commit
macvlan: Support bonding events commit
macvlan: add netpoll support commit
Add Generic Segmentation Offload support for UDP tunnels with checksum commit
Implemement zero RX checksums for UDP/IPv6 (RFC 6936). RFC 6936 relaxes the requirement of RFC 2460 that UDP/IPv6 packets which are received with a zero UDP checksum value must be dropped. RFC 6936 allows zero checksums to support tunnels over UDP commit
Kernel-originated IP packets that have no user socket associated with them (e.g., ICMP errors and echo replies, TCP RSTs, etc.) are emitted with a mark of zero. Add a sysctl to make them have the same mark as the packet they are replying to commit
When using mark-based routing, sockets returned from accept() may need to be marked differently depending on the incoming connection request. This patch adds a sysctl to mark TCP sockets based on the fwmark of the incoming SYN packet. If enabled, and an unmarked socket receives a SYN, then the SYN packet's fwmark is written to the connection's inet_request_sock, and later written back to the accepted socket when the connection is established. If the socket already has a nonzero mark, then the behaviour is the same as it is today, i.e., the listening socket's fwmark is used commit
tcp: Add a TCP_FASTOPEN socket option to get a max backlog on its listner commit
tipc: add ioctl to fetch link names commit
tipc: add support for link state subscriptions commit
sctp: add support for busy polling to sctp protocol commit
vxlan: Add support for UDP checksums (v4 sending, v6 zero csums) commit, add x-netns support commit
tunnels - enable module autoloading commit
- netfilter
ethtool: Support for configurable RSS hash key commit
bonding: Support macvlans on top of TLB/RLB mode bonds commit
bridge: Automatically manage port promiscuous mode commit, commit
bridge: Add 802.1ad tx VLAN acceleration commit
bridge: Support 802.1ad VLAN filtering commit
Bluetooth: Perform directed advertising support through connect() when we're in peripheral mode commit
NFC: Add RAW socket type support for SOCKPROTO_RAW to read RAW NFC frames commit
- Wireless
Add an option to hint the wireless core that the user is operating in an indoor environment commit
cfg80211: Support multiple CSA counters commit
Allow userspace to take ownership of interfaces. If an interface is owned by a netlink socket, then it will be destroyed if the netlink socket is closed for any reason, including if the process it belongs to crashed. This gives us a race-free way to get rid of any such interfaces commit
9. Virtualization
hyperv: Add support for virtual Receive Side Scaling (vRSS) commit
xen-netback: Add support for multiple queues commit
xen-netfront: Add support for multiple queues commit
virtio-rng: support multiple virtio-rng devices commit
10. Tracing/perf
Allow probing on some kprobes and ftrace functions commit
Add tracepoint benchmark tracepoint commit
- perf
Add conditional branch filter 'cond' to perf record commit, commit, commit
diff: Add --percentage option commit
report: Add --children option commit, add --percentage option commit, add -F option to specify output fields commit, add report.children config option commit
Add dcacheline sort commit
Add hist.percentage config option commit
Allow hpp fields to be sort keys commit
top: Add --children option commit, add --fields option to specify output fields commit, add --percentage option commit, add top.children config option commit
tui: Add 'F' hotkey to toggle percentage output commit
11. Security
Smack: adds smackfs/ptrace interface commit
Smack: Verify read access on file open - v3 commit
audit: add netlink multicast socket with one group to kaudit for "best-effort" delivery to read-only userspace clients such as systemd commit
ima: audit log files opened with O_DIRECT flag commit
12. Crypto