'Re: [tor-dev] New revision: Proposal 295: Using ADL for relay cryptography (solving the crypto-taggi'

[prev in list] [next in list] [prev in thread] [next in thread] 

List: tor-dev
Subject: Re: [tor-dev] New revision: Proposal 295: Using ADL for relay cryptography (solving the crypto-taggi
From: Watson Ladd <watsonbladd () gmail ! com>
Date: 2019-03-18 16:04:41
Message-ID: CACsn0c=Zhw6+vXQHMkvxZHsf0Shc4TxoPEgNBk4y_NzYTO+H1g () mail ! gmail ! com
[Download RAW message or body]

[Attachment #2 (multipart/alternative)]

Some comments: some purely editorial, some substantive.
Editorial: stuff is xored with zero, the concatenation language is not used
consistently. I found it difficult to understand the proposed scheme and
check equivalence to the paper. Maybe some more words to explain the
layering would help.

Substantive: Does it matter that it is possible to compute a message that
doesnt change the digest if you know the key?

On Fri, Mar 1, 2019 at 9:05 AM Nick Mathewson <nickm@torproject.org> wrote:
>
> Hi!
>
> I'm sending a new version of proposal 295 from Tomer Ashur, Orr
> Dunkelman, and Atul Luykx.  It's an updated version of their design
> for an improved relay cell encryption scheme, to prevent tagging
> attacks.
>
> This proposal is checked into the torspec repository.  I'm also
> linking to a diagram for this scheme (and its latex source) from Atul
> Luykx: https://people.torproject.org/~nickm/prop295/
>
> Finally, I have a draft python reference implementation for an older
> version of this proposal.  I hope to be updating it soon and sending
> out a link next week.
>
> cheers!  -- Nick
>
>
>
> Filename: 295-relay-crypto-with-adl.txt
> Title: Using ADL for relay cryptography (solving the crypto-tagging
attack)
> Author: Tomer Ashur, Orr Dunkelman, Atul Luykx
> Created: 22 Feb 2018
> Last-Modified: 1 March 2019
> Status: Open
>
>
> 0. Context
>
>    Although Crypto Tagging Attacks were identified already in the
>    original Tor design, it was not before the rise of the
>    Procyonidae in 2012 that their severity was fully realized. In
>    Proposal 202 (Two improved relay encryption protocols for Tor
>    cells) Nick Mathewson discussed two approaches to stymie tagging
>    attacks and generally improve Tor's cryptography. In Proposal 261
>    (AEZ for relay cryptography) Mathewson puts forward a concrete
>    approach which uses the tweakable wide-block cipher AEZ.
>
>    This proposal suggests an alternative approach to Proposal 261
>    using the notion of Release (of) Unverified Plaintext (RUP)
>    security. It describes an improved algorithm for circuit
>    encryption based on CTR-mode which is already used in Tor, and an
>    additional component for hashing.
>
>    Incidentally, and similar to Proposal 261, this proposal employs
>    the ENCODE-then-ENCIPHER approach thus it improves Tor's E2E
>    integrity by using (sufficient) redundancy.
>
>    For more information about the scheme and a security proof for
>    its RUP-security see
>
>        Tomer Ashur, Orr Dunkelman, Atul Luykx: Boosting
>        Authenticated Encryption Robustness with Minimal
>        Modifications. CRYPTO (3) 2017: 3-33
>
>    available online at https://eprint.iacr.org/2017/239 .
>
>    For authentication between the OP and the edge node we use
>    the PIV scheme: https://eprint.iacr.org/2013/835
>
> 2. Preliminaries
>
> 2.1 Motivation
>
>    For motivation, see proposal 202.
>
> 2.2. Notation
>
>    Symbol               Meaning
>    ------               -------
>    M                    Plaintext
>    C_I                  Ciphertext
>    CTR                  Counter Mode
>    N_I                  A de/encryption nonce (to be used in CTR-mode)
>    T_I                  A tweak (to be used to de/encrypt the nonce)
>    T'_I                 A running digest
>    ^                    XOR
>    ||                   Concatenation
>           (This is more readable than a single | but must be adapted
>           before integrating the proposal into tor-spec.txt)
>
> 2.3. Security parameters
>
>    HASH_LEN -- The length of the hash function's output, in bytes.
>
>    PAYLOAD_LEN -- The longest allowable cell payload, in bytes. (509)
>
>    DIG_KEY_LEN -- The key length used to digest messages (e.g.,
>    using GHASH). Since GHASH is only defined for 128-bit keys, we
>    recommend DIG_KEY_LEN = 128.
>
>    ENC_KEY_LEN -- The key length used for encryption (e.g., AES). We
>    recommend ENC_KEY_LEN = 128.
>
> 2.4. Key derivation (replaces Section 5.2.2)
>
>    For newer KDF needs, Tor uses the key derivation function HKDF
>    from RFC5869, instantiated with SHA256. The generated key
>    material is:
>
>                  K = K_1 | K_2 | K_3 | ...
>
>    where, if H(x,t) denotes HMAC_SHA256 with value x and key t,
>          and m_expand denotes an arbitrarily chosen value,
>          and INT8(i) is an octet with the value "i", then
>              K_1     = H(m_expand | INT8(1) , KEY_SEED )
>          and K_(i+1) = H(K_i | m_expand | INT8(i+1) , KEY_SEED ),
>    in RFC5869's vocabulary, this is HKDF-SHA256 with info ==
>    m_expand, salt == t_key, and IKM == secret_input.
>
>    When used in the ntor handshake a string of key material is
>    generated and is used in the following way:
>
>    Length       Purpose                         Notation
>    ------        -------                        --------
>    HASH_LEN     forward digest IV               DF      *
>    HASH_LEN     backward digest IV              DB      *
>    ENC_KEY_LEN  encryption key                  Kf
>    ENC_KEY_LEN  decryption key                  Kb
>    DIG_KEY_LEN  forward digest key              Khf
>    DIG_KEY_LEN  backward digest key             Khb
>    ENC_KEY_LEN  forward tweak key               Ktf
>    ENC_KEY_LEN  backward tweak key              Ktb
>    DIGEST_LEN   nonce to use in the                      *
>                   hidden service protocol
>
>       * I am not sure that we need these any longer.
>
>    Excess bytes from K are discarded.
>
> 2.6. Ciphers
>
>    For hashing(*) we use GHASH with a DIG_KEY_LEN-bit key. We write
>    this as Digest(K,M) where K is the key and M the message to be
>    hashed.
>
>    We use AES with an ENC_KEY_LEN-bit key. For AES encryption
>    (resp., decryption) we write E(K,X) (resp., D(K,X)) where K is an
>    ENC_KEY_LEN-bit key and X the block to be encrypted (resp.,
>    decrypted).
>
>    For a stream cipher, unless otherwise specified, we use
>    ENC_KEY_LEN-bit AES in counter mode, with a nonce that is
>    generated as explained below. We write this as Encrypt(K,N,X)
>    (resp., Decrypt(K,N,X)) where K is the key, N the nonce, and X
>    the message to be encrypted (resp., decrypted).
>
>    (*) The terms hash and digest are used interchangeably.
>
> 3. Routing relay cells
>
> 3.1. Forward Direction
>
>    The forward direction is the direction that CREATE/CREATE2 cells
>    are sent.
>
> 3.1.1. Routing from the Origin
>
>    Let n denote the integer representing the destination node. For
>    I = 1...n+1, T'_{I} is initialized to the 128-bit string consisting
>    entirely of '0's. When an OP sends a relay cell, they prepare the
>    cell as follows:
>
>         The OP prepares the authentication part of the message:
>
>                 C_{n+1} = M
>                 T_{n+1} = Digest(Khf_n,T'_{n+1}||C_{n+1})
>                 N_{n+1} = T_{n+1} ^ E(Ktf_n,T_{n+1} ^ 0)
>                 T'_{n+1} = T_{n+1}
>
>         Then, the OP prepares the multi-layered encryption:
>
>                 For I=n...1:
>                         C_I = Encrypt(Kf_I,N_{I+1},C_{I+1})
>                         T_I = Digest(Khf_I,T'_I||C_I)
>                         N_I = T_I ^ E(Ktf_I,T_I ^ N_{I+1})
>                         T'_I = T_I
>
>           The OP sends C_1 and N_1 to node 1.
>
> 3.1.2. Relaying Forward at Onion Routers
>
>    When a forward relay cell is received by OR I, it decrypts the
>    payload with the stream cipher, as follows:
>
>         'Forward' relay cell:
>
>                 T_I = Digest(Khf_I,T'_I||C_I)
>                 N_{I+1} = T_I ^ D(Ktf_I,T_I ^ N_I)
>                 C_{I+1} = Decrypt(Kf_I,N_{I+1},C_I)
>                 T'_I = T_I
>
>    The OR then decides whether it recognizes the relay cell as
>    described below. If the OR recognizes the cell, it processes the
>    contents of the relay cell. Otherwise, it passes C_{I+1}||N_{I+1}
>    along the circuit if the circuit continues.
>
>    For more information, see section 4 below.
>
> 3.2. Backward Direction
>
>    The backward direction is the opposite direction from
>    CREATE/CREATE2 cells.
>
> 3.2.1. Relaying Backward at Onion Routers
>
>    When a backward relay cell is received by OR I, it encrypts the
>    payload with the stream cipher, as follows:
>
>         'Backward' relay cell:
>
>                 T_I = Digest(Khb_I,T'_I||C_{I+1})
>                 N_I = T_I ^ E(Ktb_I,T_I ^ N_{I+1})
>                 C_I = Encrypt(Kb_I,N_I,C_{I+1})
>                 T'_I = T_I
>
>    with C_{n+1} = M and N_{n+1}=0. Once encrypted, the node passes
>    C_I and N_I along the circuit towards the OP.
>
> 3.2.2. Routing to the Origin
>
>    When a relay cell arrives at an OP, the OP decrypts the payload
>    with the stream cipher as follows:
>
>         OP receives relay cell from node 1:
>
>                 For I=1...n, where n is the end node on the circuit:
>                         C_{I+1} = Decrypt(Kb_I,N_I,C_I)
>                         T_I = Digest(Khb_I,T'_I||C_{I+1})
>                         N_{I+1} = T_I ^ D(Ktb_I,T_I ^ N_I)
>                         T'_I = T_I
>
>                 If the payload is recognized (see Section 4.1),
>                 then:
>
>                        The sending node is I. Stop, process the
>                        payload and authenticate.
>
> 4. Application connections and stream management
>
> 4.1. Relay cells
>
>   Within a circuit, the OP and the end node use the contents of
>   RELAY packets to tunnel end-to-end commands and TCP connections
>   ("Streams") across circuits. End-to-end commands can be initiated
>   by either edge; streams are initiated by the OP.
>
>         The payload of each unencrypted RELAY cell consists of:
>
>                 Relay command           [1 byte]
>                 'Recognized'            [2 bytes]
>                 StreamID                [2 bytes]
>                 Length                  [2 bytes]
>                 Data                    [PAYLOAD_LEN-23 bytes]
>
>    The 'recognized' field is used as a simple indication that the
>    cell is still encrypted. It is an optimization to avoid
>    calculating expensive digests for every cell. When sending cells,
>    the unencrypted 'recognized' MUST be set to zero.
>
>    When receiving and decrypting cells the 'recognized' will always
>    be zero if we're the endpoint that the cell is destined for. For
>    cells that we should relay, the 'recognized' field will usually
>    be nonzero, but will accidentally be zero with P=2^-16.
>
>    If the cell is recognized, the node moves to verifying the
>    authenticity of the message as follows(*):
>
>           forward direction (executed by the end node):
>
>                 T_{n+1} = Digest(Khf_n,T'_{n+1}||C_{n+1})
>                 Tag = T_{n+1} ^ D(Ktf_n,T_{n+1} ^ N_{n+1})
>                 T'_{n+1} = T_{n+1}
>
>                 The message is authenticated (i.e., M = C_{n+1}) if
>                 and only if Tag = 0
>
>           backward direction (executed by the OP):
>
>                 The message is authenticated (i.e., C_{n+1} = M) if
>                 and only if N_{n+1} = 0
>
>
>    The old Digest field is removed since sufficient information for
>    authentication is now included in the nonce part of the payload.
>
>        (*) we should consider dropping the 'recognized' field
>        altogether and always try to authenticate. Note that this is
>        an optimization question and the crypto works just as well
>        either way.
>
>    The 'Length' field of a relay cell contains the number of bytes
>    in the relay payload which contain real payload data. The
>    remainder of the payload is padding bytes.
>
> 4.2. Appending the encrypted nonce and dealing with version-homogenic
>      and version-heterogenic circuits
>
>    When a cell is prepared to be routed from the origin (see Section
>    3.1.1) the encrypted nonce N is appended to the encrypted cell
>    (occupying the last 16 bytes of the cell). If the cell is
>    prepared to be sent to a node supporting the new protocol, S is
>    combined with other sources to generate the layer's
>    nonce. Otherwise, if the node only supports the old protocol, n
>    is still appended to the encrypted cell (so that following nodes
>    can still recover their nonce), but a synchronized nonce (as per
>    the old protocol) is used in CTR-mode.
>
>    When a cell is sent along the circuit in the 'backward'
>    direction, nodes supporting the new protocol always assume that
>    the last 16 bytes of the input are the nonce used by the previous
>    node, which they process as per Section 3.2.1. If the previous
>    node also supports the new protocol, these cells are indeed the
>    nonce. If the previous node only supports the old protocol, these
>    bytes are either encrypted padding bytes or encrypted data.
>
> 5. Security
>
> 5.1. Resistance to crypto-tagging attacks
>
>    A crypto-tagging attack involves a circuit with two colluding
>    nodes and at least one honest node between them. The attack works
>    when one node makes a change to the cell (tagging) in a way that
>    can be undone by the other colluding party. In between, the
>    tagged cell is processed by honest nodes which do not detect the
>    change. The attack is possible due to the malleability property
>    of CTR-mode: a change to a ciphertext bit effects only the
>    respective plaintext bit in a predicatble way. This proposal
>    frustrates the crypto-tagging attack by linking the nonce to the
>    encrypted message such that any change to the ciphertext results
>    in a random nonce and hence, random plaintext.
>
>    Let us consider the following 3-hop scenario: the entry and end
>    nodes are malicious and colluding and the middle node is honest.
>
> 5.1.1. forward direction
>
>    Suppose that node I tags the ciphertext part of the message
>    (C'_{I+1} != C_{I+1}) then forwards it to the next node (I+1). As
>    per Section 3.1.2. Node I+1 digests C'_{I+1} to generate T_{I+1}
>    and N_{I+2}. Since C'_{I+2} is different than it should be, so
>    are the resulting T_{I+1} and N_{I+2}. Hence, decrypting C'_{I+2}
>    using these values results in a random string for C_{I+2}. Since
>    C_{I+2} is now just a random string, it is decrypted into a
>    random string and cannot be 'recognized' nor
>    authenticated. Furthermore, since C'_{I+1} is different than what
>    it should be, T'_{I+1} (i.e., the running digest of the middle
>    node) is now out of sync with that of the OP, which means that
>    all future cells sent through this node will decrypt into garbage
>    (random strings).
>
>    Likewise, suppose that instead of tagging the ciphertext, Node I
>    node tags the encrypted nonce N'_{I+1} != N_{I+1}. Now, when Node
>    I+1 digests the payload the tweak T_{I+1} is find, but using it
>    to decrypt N'_{I+1} again results in a random nonce for
>    N_{I+2}. This random nonce is used to decrypt C_{I+1} into a
>    random C'_{I+2} which is not recognized by the end node. Since
>    C_{I+2} is now a random string, the running digest of the end
>    node is now out of sync, which prevents the end node from
>    decrypting further cells.
>
> 5.1.2. Backward direction
>
>    In the backward direction the tagging is done by Node I+2
>    untagging by the Node I. Suppose first that Node I+2 tags the
>    ciphertext C_{I+2} and sends it to Node I+1. As per Section
>    3.2.1, Node I+1 first digests C_{I+2} and uses the resulting
>    T_{I+1} to generate a nonce N_{I+1}. From this it is clear that
>    any change introduced by Node I+2 influences the entire payload
>    and cannot be removed by Node I.
>
>    Unlike in Section 5.1.1., the cell is blindly delivered by Node I
>    to the OP which decrypts it. However, since the payload leaving
>    the end node was modified, the message cannot be authenticated by
>    the OP which can be trusted to tear down the circuit.
>
>    Suppose now that tagging is done by Node I+2 to the nonce part of
>    the payload, i.e., N_{I+2}. Since this value is encrypted by Node
>    I+1 to generate its own nonce N_{I+1}, again, a random nonce is
>    used which affects the entire keystream of CTR-mode. The cell
>    again cannot be authenticated by the OP and the circuit is torn
>    down.
>
>    We note that the end node can modify the plain message before
>    ever encrypting it and this cannot be discovered by the Tor
>    protocol. This vulnerability is outside the scope of this
>    proposal and users should always use TLS to make sure that their
>    application data is encrypted before it enters the Tor network.
>
> 5.2. End-to-end authentication
>
>    Similar to the old protocol, this proposal only offers end-to-end
>    authentication rather than per-hop authentication. However,
>    unlike the old protocol, the ADL-construction is non-malleable
>    and hence, once a non-authentic message was processed by an
>    honest node supporting the new protocol, it is effectively
>    destroyed for all nodes further down the circuit. This is because
>    the nonce used to de/encrypt all messages is linked to (a digest
>    of) the payload data.
>
>    As a result, while honest nodes cannot detect non-authentic
>    messages, such nodes still destroy the message thus invalidating
>    its authentication tag when it is checked by edge nodes. As a
>    result, security against crypto-tagging attacks is ensured as
>    long as an honest node supporting the new protocol processes the
>    message between two dishonest ones.
>
> 5.3 The Running Digest
>
>    Unlike the old protocol, the running digest is now computed as
>    the output of a GHASH call instead of a hash function call
>    (SHA256). Since GHASH does not provide the same type of security
>    guarantees as SHA256, it is worth discussing why security is not
>    lost from computing the running digest differently.
>
>    The running digets is used to ensure that if the same payload is
>    encrypted twice, then the resulting ciphertext does not remain
>    the same. Therefore, all that is needed is that the digest should
>    repeat with low probability. GHASH is a universal hash function,
>    hence it gives such a guarantee assuming its key is chosen
>    uniformly at random.
> _______________________________________________
> tor-dev mailing list
> tor-dev@lists.torproject.org
> https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-dev

[Attachment #5 (text/html)]

<div dir="auto">Some comments: some purely editorial, some substantive. 
Editorial: stuff is xored with zero, the concatenation language is not used \
consistently. I found it difficult to understand the proposed scheme and check \
equivalence to the paper. Maybe some more words to explain the layering would \
help. 
Substantive: Does it matter that it is possible to compute a message that doesnt \
change the digest if you know the key? </div> 
On Fri, Mar 1, 2019 at 9:05 AM Nick Mathewson &lt;<a \
href="mailto:nickm@torproject.org" target="_blank" \
rel="noreferrer">nickm@torproject.org</a>&gt; wrote: &gt; 
&gt; Hi! 
&gt; 
&gt; I&#39;m sending a new version of proposal 295 from Tomer Ashur, Orr 
&gt; Dunkelman, and Atul Luykx. It&#39;s an updated version of their design 
&gt; for an improved relay cell encryption scheme, to prevent tagging 
&gt; attacks. 
&gt; 
&gt; This proposal is checked into the torspec repository. I&#39;m also 
&gt; linking to a diagram for this scheme (and its latex source) from Atul 
&gt; Luykx: <a href="https://people.torproject.org/~nickm/prop295/" rel="noreferrer \
noreferrer" target="_blank">https://people.torproject.org/~nickm/prop295/</a> \
&gt; &gt; Finally, I have a draft python reference implementation for an \
older &gt; version of this proposal. I hope to be updating it soon and \
sending &gt; out a link next week. 
&gt; 
&gt; cheers! -- Nick 
&gt; 
&gt; 
&gt; 
&gt; Filename: 295-relay-crypto-with-adl.txt 
&gt; Title: Using ADL for relay cryptography (solving the crypto-tagging attack) 
&gt; Author: Tomer Ashur, Orr Dunkelman, Atul Luykx 
&gt; Created: 22 Feb 2018 
&gt; Last-Modified: 1 March 2019 
&gt; Status: Open 
&gt; 
&gt; 
&gt; 0. Context 
&gt; 
&gt; Although Crypto Tagging Attacks were identified already in the 
&gt; original Tor design, it was not before the rise of the 
&gt; Procyonidae in 2012 that their severity was fully realized. In 
&gt; Proposal 202 (Two improved relay encryption protocols for Tor 
&gt; cells) Nick Mathewson discussed two approaches to stymie tagging 
&gt; attacks and generally improve Tor&#39;s cryptography. In Proposal 261 
&gt; (AEZ for relay cryptography) Mathewson puts forward a concrete 
&gt; approach which uses the tweakable wide-block cipher AEZ. 
&gt; 
&gt; This proposal suggests an alternative approach to Proposal 261 
&gt; using the notion of Release (of) Unverified Plaintext (RUP) 
&gt; security. It describes an improved algorithm for circuit 
&gt; encryption based on CTR-mode which is already used in Tor, and an 
&gt; additional component for hashing. 
&gt; 
&gt; Incidentally, and similar to Proposal 261, this proposal employs 
&gt; the ENCODE-then-ENCIPHER approach thus it improves Tor&#39;s E2E 
&gt; integrity by using (sufficient) redundancy. 
&gt; 
&gt; For more information about the scheme and a security proof for 
&gt; its RUP-security see 
&gt; 
&gt; Tomer Ashur, Orr Dunkelman, Atul Luykx: Boosting 
&gt; Authenticated Encryption Robustness with Minimal 
&gt; Modifications. CRYPTO (3) 2017: 3-33 
&gt; 
&gt; available online at <a href="https://eprint.iacr.org/2017/239" \
rel="noreferrer noreferrer" target="_blank">https://eprint.iacr.org/2017/239</a> \
. &gt; 
&gt; For authentication between the OP and the edge node we use 
&gt; the PIV scheme: <a href="https://eprint.iacr.org/2013/835" rel="noreferrer \
noreferrer" target="_blank">https://eprint.iacr.org/2013/835</a> &gt; 
&gt; 2. Preliminaries 
&gt; 
&gt; 2.1 Motivation 
&gt; 
&gt; For motivation, see proposal 202. 
&gt; 
&gt; 2.2. Notation 
&gt; 
&gt; Symbol Meaning 
&gt; ------ ------- 
&gt; M Plaintext 
&gt; C_I Ciphertext 
&gt; CTR Counter Mode 
&gt; N_I A de/encryption nonce (to be used in \
CTR-mode) &gt; T_I A tweak (to be used to \
de/encrypt the nonce) &gt; T&#39;_I A running \
digest &gt; ^ XOR 
&gt; || Concatenation 
&gt; (This is more readable than a single | but must be adapted 
&gt; before integrating the proposal into tor-spec.txt) 
&gt; 
&gt; 2.3. Security parameters 
&gt; 
&gt; HASH_LEN -- The length of the hash function&#39;s output, in bytes. 
&gt; 
&gt; PAYLOAD_LEN -- The longest allowable cell payload, in bytes. (509) 
&gt; 
&gt; DIG_KEY_LEN -- The key length used to digest messages (e.g., 
&gt; using GHASH). Since GHASH is only defined for 128-bit keys, we 
&gt; recommend DIG_KEY_LEN = 128. 
&gt; 
&gt; ENC_KEY_LEN -- The key length used for encryption (e.g., AES). We 
&gt; recommend ENC_KEY_LEN = 128. 
&gt; 
&gt; 2.4. Key derivation (replaces Section 5.2.2) 
&gt; 
&gt; For newer KDF needs, Tor uses the key derivation function HKDF 
&gt; from RFC5869, instantiated with SHA256. The generated key 
&gt; material is: 
&gt; 
&gt; K = K_1 | K_2 | K_3 | ... 
&gt; 
&gt; where, if H(x,t) denotes HMAC_SHA256 with value x and key t, 
&gt; and m_expand denotes an arbitrarily chosen value, 
&gt; and INT8(i) is an octet with the value &quot;i&quot;, then 
&gt; K_1 = H(m_expand | INT8(1) , KEY_SEED ) 
&gt; and K_(i+1) = H(K_i | m_expand | INT8(i+1) , KEY_SEED ), 
&gt; in RFC5869&#39;s vocabulary, this is HKDF-SHA256 with info == 
&gt; m_expand, salt == t_key, and IKM == secret_input. 
&gt; 
&gt; When used in the ntor handshake a string of key material is 
&gt; generated and is used in the following way: 
&gt; 
&gt; Length Purpose Notation 
&gt; ------ ------- -------- 
&gt; HASH_LEN forward digest IV DF * 
&gt; HASH_LEN backward digest IV DB * 
&gt; ENC_KEY_LEN encryption key Kf 
&gt; ENC_KEY_LEN decryption key Kb 
&gt; DIG_KEY_LEN forward digest key Khf 
&gt; DIG_KEY_LEN backward digest key Khb 
&gt; ENC_KEY_LEN forward tweak key Ktf 
&gt; ENC_KEY_LEN backward tweak key Ktb 
&gt; DIGEST_LEN nonce to use in the * 
&gt; hidden service protocol 
&gt; 
&gt; * I am not sure that we need these any longer. 
&gt; 
&gt; Excess bytes from K are discarded. 
&gt; 
&gt; 2.6. Ciphers 
&gt; 
&gt; For hashing(*) we use GHASH with a DIG_KEY_LEN-bit key. We write 
&gt; this as Digest(K,M) where K is the key and M the message to be 
&gt; hashed. 
&gt; 
&gt; We use AES with an ENC_KEY_LEN-bit key. For AES encryption 
&gt; (resp., decryption) we write E(K,X) (resp., D(K,X)) where K is an 
&gt; ENC_KEY_LEN-bit key and X the block to be encrypted (resp., 
&gt; decrypted). 
&gt; 
&gt; For a stream cipher, unless otherwise specified, we use 
&gt; ENC_KEY_LEN-bit AES in counter mode, with a nonce that is 
&gt; generated as explained below. We write this as Encrypt(K,N,X) 
&gt; (resp., Decrypt(K,N,X)) where K is the key, N the nonce, and X 
&gt; the message to be encrypted (resp., decrypted). 
&gt; 
&gt; (*) The terms hash and digest are used interchangeably. 
&gt; 
&gt; 3. Routing relay cells 
&gt; 
&gt; 3.1. Forward Direction 
&gt; 
&gt; The forward direction is the direction that CREATE/CREATE2 cells 
&gt; are sent. 
&gt; 
&gt; 3.1.1. Routing from the Origin 
&gt; 
&gt; Let n denote the integer representing the destination node. For 
&gt; I = 1...n+1, T&#39;_{I} is initialized to the 128-bit string consisting 
&gt; entirely of &#39;0&#39;s. When an OP sends a relay cell, they prepare \
the &gt; cell as follows: 
&gt; 
&gt; The OP prepares the authentication part of the message: 
&gt; 
&gt; C_{n+1} = M 
&gt; T_{n+1} = Digest(Khf_n,T&#39;_{n+1}||C_{n+1}) 
&gt; N_{n+1} = T_{n+1} ^ E(Ktf_n,T_{n+1} ^ 0) 
&gt; T&#39;_{n+1} = T_{n+1} 
&gt; 
&gt; Then, the OP prepares the multi-layered encryption: 
&gt; 
&gt; For I=n...1: 
&gt; C_I = Encrypt(Kf_I,N_{I+1},C_{I+1}) 
&gt; T_I = Digest(Khf_I,T&#39;_I||C_I) 
&gt; N_I = T_I ^ E(Ktf_I,T_I ^ N_{I+1}) 
&gt; T&#39;_I = T_I 
&gt; 
&gt; The OP sends C_1 and N_1 to node 1. 
&gt; 
&gt; 3.1.2. Relaying Forward at Onion Routers 
&gt; 
&gt; When a forward relay cell is received by OR I, it decrypts the 
&gt; payload with the stream cipher, as follows: 
&gt; 
&gt; &#39;Forward&#39; relay cell: 
&gt; 
&gt; T_I = Digest(Khf_I,T&#39;_I||C_I) 
&gt; N_{I+1} = T_I ^ D(Ktf_I,T_I ^ N_I) 
&gt; C_{I+1} = Decrypt(Kf_I,N_{I+1},C_I) 
&gt; T&#39;_I = T_I 
&gt; 
&gt; The OR then decides whether it recognizes the relay cell as 
&gt; described below. If the OR recognizes the cell, it processes the 
&gt; contents of the relay cell. Otherwise, it passes C_{I+1}||N_{I+1} 
&gt; along the circuit if the circuit continues. 
&gt; 
&gt; For more information, see section 4 below. 
&gt; 
&gt; 3.2. Backward Direction 
&gt; 
&gt; The backward direction is the opposite direction from 
&gt; CREATE/CREATE2 cells. 
&gt; 
&gt; 3.2.1. Relaying Backward at Onion Routers 
&gt; 
&gt; When a backward relay cell is received by OR I, it encrypts the 
&gt; payload with the stream cipher, as follows: 
&gt; 
&gt; &#39;Backward&#39; relay cell: 
&gt; 
&gt; T_I = Digest(Khb_I,T&#39;_I||C_{I+1}) 
&gt; N_I = T_I ^ E(Ktb_I,T_I ^ N_{I+1}) 
&gt; C_I = Encrypt(Kb_I,N_I,C_{I+1}) 
&gt; T&#39;_I = T_I 
&gt; 
&gt; with C_{n+1} = M and N_{n+1}=0. Once encrypted, the node passes 
&gt; C_I and N_I along the circuit towards the OP. 
&gt; 
&gt; 3.2.2. Routing to the Origin 
&gt; 
&gt; When a relay cell arrives at an OP, the OP decrypts the payload 
&gt; with the stream cipher as follows: 
&gt; 
&gt; OP receives relay cell from node 1: 
&gt; 
&gt; For I=1...n, where n is the end node on the \
circuit: &gt; C_{I+1} = \
Decrypt(Kb_I,N_I,C_I) &gt; T_I = \
Digest(Khb_I,T&#39;_I||C_{I+1}) &gt; N_{I+1} \
= T_I ^ D(Ktb_I,T_I ^ N_I) &gt; T&#39;_I = \
T_I &gt; 
&gt; If the payload is recognized (see Section 4.1), 
&gt; then: 
&gt; 
&gt; The sending node is I. Stop, process the 
&gt; payload and authenticate. 
&gt; 
&gt; 4. Application connections and stream management 
&gt; 
&gt; 4.1. Relay cells 
&gt; 
&gt; Within a circuit, the OP and the end node use the contents of 
&gt; RELAY packets to tunnel end-to-end commands and TCP connections 
&gt; (&quot;Streams&quot;) across circuits. End-to-end commands can be \
initiated &gt; by either edge; streams are initiated by the OP. 
&gt; 
&gt; The payload of each unencrypted RELAY cell consists of: 
&gt; 
&gt; Relay command [1 byte] 
&gt; &#39;Recognized&#39; [2 bytes] 
&gt; StreamID [2 bytes] 
&gt; Length [2 bytes] 
&gt; Data [PAYLOAD_LEN-23 \
bytes] &gt; 
&gt; The &#39;recognized&#39; field is used as a simple indication that the 
&gt; cell is still encrypted. It is an optimization to avoid 
&gt; calculating expensive digests for every cell. When sending cells, 
&gt; the unencrypted &#39;recognized&#39; MUST be set to zero. 
&gt; 
&gt; When receiving and decrypting cells the &#39;recognized&#39; will \
always &gt; be zero if we&#39;re the endpoint that the cell is destined for. \
For &gt; cells that we should relay, the &#39;recognized&#39; field will \
usually &gt; be nonzero, but will accidentally be zero with P=2^-16. 
&gt; 
&gt; If the cell is recognized, the node moves to verifying the 
&gt; authenticity of the message as follows(*): 
&gt; 
&gt; forward direction (executed by the end node): 
&gt; 
&gt; T_{n+1} = Digest(Khf_n,T&#39;_{n+1}||C_{n+1}) 
&gt; Tag = T_{n+1} ^ D(Ktf_n,T_{n+1} ^ N_{n+1}) 
&gt; T&#39;_{n+1} = T_{n+1} 
&gt; 
&gt; The message is authenticated (i.e., M = C_{n+1}) if 
&gt; and only if Tag = 0 
&gt; 
&gt; backward direction (executed by the OP): 
&gt; 
&gt; The message is authenticated (i.e., C_{n+1} = M) if 
&gt; and only if N_{n+1} = 0 
&gt; 
&gt; 
&gt; The old Digest field is removed since sufficient information for 
&gt; authentication is now included in the nonce part of the payload. 
&gt; 
&gt; (*) we should consider dropping the &#39;recognized&#39; field 
&gt; altogether and always try to authenticate. Note that this is 
&gt; an optimization question and the crypto works just as well 
&gt; either way. 
&gt; 
&gt; The &#39;Length&#39; field of a relay cell contains the number of bytes 
&gt; in the relay payload which contain real payload data. The 
&gt; remainder of the payload is padding bytes. 
&gt; 
&gt; 4.2. Appending the encrypted nonce and dealing with version-homogenic 
&gt; and version-heterogenic circuits 
&gt; 
&gt; When a cell is prepared to be routed from the origin (see Section 
&gt; 3.1.1) the encrypted nonce N is appended to the encrypted cell 
&gt; (occupying the last 16 bytes of the cell). If the cell is 
&gt; prepared to be sent to a node supporting the new protocol, S is 
&gt; combined with other sources to generate the layer&#39;s 
&gt; nonce. Otherwise, if the node only supports the old protocol, n 
&gt; is still appended to the encrypted cell (so that following nodes 
&gt; can still recover their nonce), but a synchronized nonce (as per 
&gt; the old protocol) is used in CTR-mode. 
&gt; 
&gt; When a cell is sent along the circuit in the &#39;backward&#39; 
&gt; direction, nodes supporting the new protocol always assume that 
&gt; the last 16 bytes of the input are the nonce used by the previous 
&gt; node, which they process as per Section 3.2.1. If the previous 
&gt; node also supports the new protocol, these cells are indeed the 
&gt; nonce. If the previous node only supports the old protocol, these 
&gt; bytes are either encrypted padding bytes or encrypted data. 
&gt; 
&gt; 5. Security 
&gt; 
&gt; 5.1. Resistance to crypto-tagging attacks 
&gt; 
&gt; A crypto-tagging attack involves a circuit with two colluding 
&gt; nodes and at least one honest node between them. The attack works 
&gt; when one node makes a change to the cell (tagging) in a way that 
&gt; can be undone by the other colluding party. In between, the 
&gt; tagged cell is processed by honest nodes which do not detect the 
&gt; change. The attack is possible due to the malleability property 
&gt; of CTR-mode: a change to a ciphertext bit effects only the 
&gt; respective plaintext bit in a predicatble way. This proposal 
&gt; frustrates the crypto-tagging attack by linking the nonce to the 
&gt; encrypted message such that any change to the ciphertext results 
&gt; in a random nonce and hence, random plaintext. 
&gt; 
&gt; Let us consider the following 3-hop scenario: the entry and end 
&gt; nodes are malicious and colluding and the middle node is honest. 
&gt; 
&gt; 5.1.1. forward direction 
&gt; 
&gt; Suppose that node I tags the ciphertext part of the message 
&gt; (C&#39;_{I+1} != C_{I+1}) then forwards it to the next node (I+1). As 
&gt; per Section 3.1.2. Node I+1 digests C&#39;_{I+1} to generate T_{I+1} 
&gt; and N_{I+2}. Since C&#39;_{I+2} is different than it should be, so 
&gt; are the resulting T_{I+1} and N_{I+2}. Hence, decrypting C&#39;_{I+2} 
&gt; using these values results in a random string for C_{I+2}. Since 
&gt; C_{I+2} is now just a random string, it is decrypted into a 
&gt; random string and cannot be &#39;recognized&#39; nor 
&gt; authenticated. Furthermore, since C&#39;_{I+1} is different than what 
&gt; it should be, T&#39;_{I+1} (i.e., the running digest of the middle 
&gt; node) is now out of sync with that of the OP, which means that 
&gt; all future cells sent through this node will decrypt into garbage 
&gt; (random strings). 
&gt; 
&gt; Likewise, suppose that instead of tagging the ciphertext, Node I 
&gt; node tags the encrypted nonce N&#39;_{I+1} != N_{I+1}. Now, when Node 
&gt; I+1 digests the payload the tweak T_{I+1} is find, but using it 
&gt; to decrypt N&#39;_{I+1} again results in a random nonce for 
&gt; N_{I+2}. This random nonce is used to decrypt C_{I+1} into a 
&gt; random C&#39;_{I+2} which is not recognized by the end node. Since 
&gt; C_{I+2} is now a random string, the running digest of the end 
&gt; node is now out of sync, which prevents the end node from 
&gt; decrypting further cells. 
&gt; 
&gt; 5.1.2. Backward direction 
&gt; 
&gt; In the backward direction the tagging is done by Node I+2 
&gt; untagging by the Node I. Suppose first that Node I+2 tags the 
&gt; ciphertext C_{I+2} and sends it to Node I+1. As per Section 
&gt; 3.2.1, Node I+1 first digests C_{I+2} and uses the resulting 
&gt; T_{I+1} to generate a nonce N_{I+1}. From this it is clear that 
&gt; any change introduced by Node I+2 influences the entire payload 
&gt; and cannot be removed by Node I. 
&gt; 
&gt; Unlike in Section 5.1.1., the cell is blindly delivered by Node I 
&gt; to the OP which decrypts it. However, since the payload leaving 
&gt; the end node was modified, the message cannot be authenticated by 
&gt; the OP which can be trusted to tear down the circuit. 
&gt; 
&gt; Suppose now that tagging is done by Node I+2 to the nonce part of 
&gt; the payload, i.e., N_{I+2}. Since this value is encrypted by Node 
&gt; I+1 to generate its own nonce N_{I+1}, again, a random nonce is 
&gt; used which affects the entire keystream of CTR-mode. The cell 
&gt; again cannot be authenticated by the OP and the circuit is torn 
&gt; down. 
&gt; 
&gt; We note that the end node can modify the plain message before 
&gt; ever encrypting it and this cannot be discovered by the Tor 
&gt; protocol. This vulnerability is outside the scope of this 
&gt; proposal and users should always use TLS to make sure that their 
&gt; application data is encrypted before it enters the Tor network. 
&gt; 
&gt; 5.2. End-to-end authentication 
&gt; 
&gt; Similar to the old protocol, this proposal only offers end-to-end 
&gt; authentication rather than per-hop authentication. However, 
&gt; unlike the old protocol, the ADL-construction is non-malleable 
&gt; and hence, once a non-authentic message was processed by an 
&gt; honest node supporting the new protocol, it is effectively 
&gt; destroyed for all nodes further down the circuit. This is because 
&gt; the nonce used to de/encrypt all messages is linked to (a digest 
&gt; of) the payload data. 
&gt; 
&gt; As a result, while honest nodes cannot detect non-authentic 
&gt; messages, such nodes still destroy the message thus invalidating 
&gt; its authentication tag when it is checked by edge nodes. As a 
&gt; result, security against crypto-tagging attacks is ensured as 
&gt; long as an honest node supporting the new protocol processes the 
&gt; message between two dishonest ones. 
&gt; 
&gt; 5.3 The Running Digest 
&gt; 
&gt; Unlike the old protocol, the running digest is now computed as 
&gt; the output of a GHASH call instead of a hash function call 
&gt; (SHA256). Since GHASH does not provide the same type of security 
&gt; guarantees as SHA256, it is worth discussing why security is not 
&gt; lost from computing the running digest differently. 
&gt; 
&gt; The running digets is used to ensure that if the same payload is 
&gt; encrypted twice, then the resulting ciphertext does not remain 
&gt; the same. Therefore, all that is needed is that the digest should 
&gt; repeat with low probability. GHASH is a universal hash function, 
&gt; hence it gives such a guarantee assuming its key is chosen 
&gt; uniformly at random. 
&gt; _______________________________________________ 
&gt; tor-dev mailing list 
&gt; <a href="mailto:tor-dev@lists.torproject.org" target="_blank" \
rel="noreferrer">tor-dev@lists.torproject.org</a> &gt; <a \
href="https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-dev" rel="noreferrer \
noreferrer" target="_blank">https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-dev</a>

[Attachment #6 (text/plain)]

_______________________________________________
tor-dev mailing list
tor-dev@lists.torproject.org
https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-dev

[prev in list] [next in list] [prev in thread] [next in thread]