wireguard-monolithic-historical - Historical monolithic WireGuard repository, split into wireguard-tools, wireguard-linux, and wireguard-linux-compat.

	Commit message (Collapse)	Author	Age	Files	Lines
*	poly1305: do not require simd context for arch	Jason A. Donenfeld	2018-09-17	8	-22/+14
\|
*	crypto: make MIT	Jason A. Donenfeld	2018-09-16	39	-39/+39
\|
*	chacha20-arm: swap scalar and neon functions	Jason A. Donenfeld	2018-09-13	1	-697/+697
\| \| \| \|	This brings us closer to the original code.
*	poly1305: precompute 5*r in init instead of blocks	Jason A. Donenfeld	2018-09-12	2	-6/+18
\|
*	curve25519-x86_64: remove useless define	Jason A. Donenfeld	2018-09-12	1	-1/+0
\|
*	chacha20: add constant for words in block	Jason A. Donenfeld	2018-09-12	2	-2/+3
\|
*	poly1305: rename finish to final	Jason A. Donenfeld	2018-09-11	5	-13/+13
\|
*	crypto: make sure UML is properly disabled	Jason A. Donenfeld	2018-09-11	1	-4/+4
\|
*	crypto: do not use compound literals in selftests	Jason A. Donenfeld	2018-09-11	2	-7704/+7710
\| \| \| \| \| \| \|	gcc can't apply section attributes to compound literals, so we can't mark the actual data as __initconst. We thus waste space instead, but this shouldn't matter much, since it's cleared after init anyway, and because this is only for debugging.
*	blake2s-x86_64: fix whitespace errors	Jason A. Donenfeld	2018-09-10	1	-2/+2
\|
*	poly1305: switch to donna	Jason A. Donenfeld	2018-09-10	3	-183/+398
\|
*	poly1305: rewrite self tests from scratch	Jason A. Donenfeld	2018-09-08	1	-1529/+831
\| \| \| \|	This removes the old cruft and makes things a bit more idiomatic.
*	compat: move simd.h from crypto to compat since it's going upstream	Jason A. Donenfeld	2018-09-06	1	-65/+0
\|
*	crypto: use CRYPTOGAMS license	Jason A. Donenfeld	2018-09-06	9	-23/+27
\|
*	curve25519: arm: do not modify sp directly	Jason A. Donenfeld	2018-09-06	1	-3/+3
\| \| \| \| \| \|	Thumb doesn't like this. Reported-by: Roman Mamedov <rm@romanrm.net>
*	global: prefer sizeof(*pointer) when possible	Jason A. Donenfeld	2018-09-04	2	-2/+2
\| \| \| \|	Suggested-by: Sultan Alsawaf <sultanxda@gmail.com>
*	crypto: import zinc	Jason A. Donenfeld	2018-09-03	42	-984/+14670
\|
*	curve25519-arm: prefix immediates with #	Jason A. Donenfeld	2018-08-28	1	-18/+18
\|
*	curve25519-arm: do not waste 32 bytes of stack	Jason A. Donenfeld	2018-08-28	1	-88/+88
\|
*	curve25519-arm: use ordinary prolog and epilogue	Samuel Neves	2018-08-28	1	-18/+6
\| \| \| \|	Signed-off-by: Samuel Neves <sneves@dei.uc.pt>
*	curve25519-arm: add spaces after commas	Jason A. Donenfeld	2018-08-28	1	-2074/+2074
\|
*	curve25519-arm: cleanups from lkml	Jason A. Donenfeld	2018-08-28	1	-33/+30
\| \| \| \|	Suggested-by: Ard Biesheuvel <ard.biesheuvel@linaro.org>
*	curve25519-arm: reformat	Jason A. Donenfeld	2018-08-28	1	-2096/+2096
\|
*	curve25519-x86_64: let the compiler decide when/how to load constants	Samuel Neves	2018-08-28	1	-5/+2
\| \| \| \|	Signed-off-by: Samuel Neves <sneves@dei.uc.pt>
*	curve25519-hacl64: use formally verified C for comparisons	Jason A. Donenfeld	2018-08-28	1	-6/+19
\| \| \| \| \| \|	The previous code had been proved in Z3, but this new code from upstream KreMLin is directly generated from the F*, which is preferable. The assembly generated is identical.
*	crypto: use unaligned helpers	Jason A. Donenfeld	2018-08-28	7	-48/+51
\| \| \| \| \| \|	This is not useful for WireGuard, but for the general use case we probably want it this way, and the speed difference is mostly lost in the noise.
*	curve25519-hacl64: correct u64_gte_mask	Samuel Neves	2018-08-07	1	-3/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Remove signed right shifts. Previously u64_gte_mask was only correct for x < 2^63. Z3 script proving correctness: >>> from z3 import * >>> >>> x = BitVec("x", 64) >>> y = BitVec("y", 64) >>> >>> t = LShR(x^((x^y)\|((x-y)^y)), 63) - 1 >>> >>> prove(If(UGE(x, y), BitVecVal(-1, 64), BitVecVal(0, 64)) == t) proved Signed-off-by: Samuel Neves <sneves@dei.uc.pt>
*	curve25519-hacl64: simplify u64_eq_mask	Samuel Neves	2018-08-07	1	-8/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Avoid signed right shift. Z3 script showing equivalence: >>> from z3 import * >>> >>> x = BitVec("x", 64) >>> y = BitVec("y", 64) >>> >>> # Before ... x_ = ~(x ^ y) >>> x_ &= x_ << 32 >>> x_ &= x_ << 16 >>> x_ &= x_ << 8 >>> x_ &= x_ << 4 >>> x_ &= x_ << 2 >>> x_ &= x_ << 1 >>> x_ >>= 63 >>> >>> # After ... y_ = x ^ y >>> y_ = y_ \| -y_ >>> y_ = LShR(y_, 63) - 1 >>> >>> prove(x_ == y_) proved Signed-off-by: Samuel Neves <sneves@dei.uc.pt>
*	chacha20: use memmove in case buffers overlap	Jason A. Donenfeld	2018-08-07	1	-1/+1
\| \| \| \|	Suggested-by: Samuel Neves <sneves@dei.uc.pt>
*	curve25519-x86_64: avoid use of r12	Jason A. Donenfeld	2018-08-07	1	-107/+107
\| \| \| \| \| \| \|	This causes problems with RAP and KERNEXEC for PaX, as r12 is a reserved register. Suggested-by: PaX Team <pageexec@freemail.hu>
*	crypto: move simd context to specific type	Jason A. Donenfeld	2018-08-06	7	-98/+106
\| \| \| \|	Suggested-by: Andy Lutomirski <luto@kernel.org>
*	main: add missing chacha20poly1305 header	Jason A. Donenfeld	2018-07-31	1	-1/+0
\|
*	curve25519-x86_64: tighten reductions modulo 2^256-38	Samuel Neves	2018-07-28	1	-21/+18
\| \| \| \| \| \| \| \| \|	At this stage the value if C[4] is at most ((2^256-1) + 38*(2^256-1)) / 2^256 = 38, so there is no need to use a wide multiplication. Change inspired by Andy Polyakov's OpenSSL implementation. Signed-off-by: Samuel Neves <sneves@dei.uc.pt>
*	curve25519-x86_64: simplify the final reduction by adding 19 beforehand	Samuel Neves	2018-07-28	1	-40/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Correctness can be quickly verified with the following z3py script: >>> from z3 import * >>> x = BitVec("x", 256) # any 256-bit value >>> ref = URem(x, 2255 - 19) # correct value >>> t = Extract(255, 255, x); x &= 2255 - 1; # btrq $63, %3 >>> u = If(t != 0, BitVecVal(38, 256), BitVecVal(19, 256)) # cmovncl %k5, %k4 >>> x += u # addq %4, %0; adcq $0, %1; adcq $0, %2; adcq $0, %3; >>> t = Extract(255, 255, x); x &= 2**255 - 1; # btrq $63, %3 >>> u = If(t != 0, BitVecVal(0, 256), BitVecVal(19, 256)) # cmovncl %k5, %k4 >>> x -= u # subq %4, %0; sbbq $0, %1; sbbq $0, %2; sbbq $0, %3; >>> prove(x == ref) proved Change inspired by Andy Polyakov's OpenSSL implementation. Signed-off-by: Samuel Neves <sneves@dei.uc.pt>
*	curve25519-x86_64: tighten the x25519 assembly	Samuel Neves	2018-07-28	1	-3/+3
\| \| \| \| \| \| \| \| \| \|	The wide multiplication by 38 in mul_a24_eltfp25519_1w is redundant: (2^256-1) * 121666 / 2^256 is at most 121665, and therefore a 64-bit multiplication can never overflow. Change inspired by Andy Polyakov's OpenSSL implementation. Signed-off-by: Samuel Neves <sneves@dei.uc.pt>
*	simd: add missing header	Jason A. Donenfeld	2018-06-22	1	-0/+1
\| \| \| \|	Suggested-by: Shlomi Steinberg <shlomi@shlomisteinberg.com>
*	poly1305: give linker the correct constant data section size	Jason A. Donenfeld	2018-06-22	1	-1/+1
\| \| \| \| \| \|	Otherwise these constants will be merged wrong or excluded, and we'll wind up with wrong calculations. While bfd (the normal kernel linker) doesn't seem to mind, recent versions of gold do bad things.
*	poly1305: add missing string.h header	Jason A. Donenfeld	2018-06-20	1	-0/+1
\| \| \| \|	Reported-by: Peter Korsgaard <peter@korsgaard.com>
*	simd: no need to restore fpu state when no preemption	Jason A. Donenfeld	2018-06-17	1	-0/+2
\|
*	simd: encapsulate fpu amortization into nice functions	Jason A. Donenfeld	2018-06-17	3	-47/+66
\|
*	chacha20poly1305: use slow crypto on -rt kernels on arm too	Jason A. Donenfeld	2018-06-14	1	-1/+1
\|
*	chacha20poly1305: use slow crypto on -rt kernels	Jason A. Donenfeld	2018-06-13	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	In rt kernels, spinlocks call schedule(), which means preemption can't be disabled. The FPU disables preemption. Hence, we can either restructure things to move the calls to kernel_fpu_begin/end to be really close to the actual crypto routines, or we can do the slower lazier solution of just not using the FPU at all on -rt kernels. This patch goes with the latter lazy solution. The reason why we don't place the calls to kernel_fpu_begin/end close to the crypto routines in the first place is that they're very expensive, as it usually involves a call to XSAVE. So on sane kernels, we benefit from only having to call it once.
*	chacha20: add missing include to header	Jason A. Donenfeld	2018-06-02	1	-0/+1
\|
*	poly1305: mips: compute S on fly	René van Dorst	2018-05-31	1	-31/+22
\| \| \| \| \| \|	This reduces memory access and the total opaque size. Signed-off-by: René van Dorst <opensource@vdorst.com>
*	crypto: consistent constification	Jason A. Donenfeld	2018-05-31	6	-23/+23
\|
*	chacha20poly1305: combine stack variables into union	Jason A. Donenfeld	2018-05-31	1	-54/+53
\|
*	chacha20poly1305: split up into separate files	Jason A. Donenfeld	2018-05-31	6	-614/+724
\|
*	curve25519: x86_64: make symbol static	Jason A. Donenfeld	2018-05-29	1	-2/+2
\|
*	curve25519: x86_64: satisfy sparse	Jason A. Donenfeld	2018-05-29	1	-260/+260
\|
*	chacha20poly1305: add mips32 implementation	René van Dorst	2018-05-18	3	-5/+912
\| \| \| \|	Signed-off-by: René van Dorst <opensource@vdorst.com>