1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307
|
# tls-parser
[](./LICENSE-MIT)
[](./LICENSE-APACHE)
[](https://crates.io/crates/tls-parser)
[](https://github.com/cpu/tls-parser/actions/workflows/rust.yml)
[](#rust-version-requirements)
# TLS Parser
A TLS parser, implemented with the [nom](https://github.com/Geal/nom)
parser combinator framework.
The goal of this parser is to implement TLS messages analysis, for example
to use rules from a network IDS, for ex during the TLS handshake.
It implements structures and parsing functions for records and messages, but
need additional code to handle fragmentation, or to fully inspect messages.
Parsing some TLS messages requires to know the previously selected parameters.
See [the rusticata TLS parser](https://github.com/rusticata/rusticata/blob/master/src/tls.rs)
for a full example.
It is written in pure Rust, fast, and makes extensive use of zero-copy. A lot of care is taken
to ensure security and safety of this crate, including design (recursion limit, defensive
programming), tests, and fuzzing. It also aims to be panic-free.
The code is available on [Github](https://github.com/rusticata/tls-parser)
and is part of the [Rusticata](https://github.com/rusticata) project.
## Parsing records
The main parsing functions are located in the [tls.rs](src/tls.rs) file. The entry functions are:
- `parse_tls_plaintext`: parses a record as plaintext
- `parse_tls_encrypted`: read an encrypted record. The parser has no crypto or decryption features, so the content
will be left as opaque data.
# Examples
```rust
use tls_parser::parse_tls_plaintext;
use tls_parser::nom::{Err, IResult};
let bytes : &[u8]= include_bytes!("../assets/client_hello_dhe.bin");
// [ 0x16, 0x03, 0x01 ... ];
let res = parse_tls_plaintext(&bytes);
match res {
Ok((rem,record)) => {
// rem is the remaining data (not parsed)
// record is an object of type TlsRecord
},
Err(Err::Incomplete(needed)) => {
eprintln!("Defragmentation required (TLS record)");
},
Err(e) => { eprintln!("parse_tls_record_with_header failed: {:?}",e); }
}
```
Note that knowing if a record is plaintext or not is the responsibility of the caller.
As reading TLS records may imply defragmenting records, some functions are
provided to only read the record as opaque data (which ensures the record is
complete and gives the record header) and then reading messages from data.
Here is an example of two-steps parsing:
```rust
// [ 0x16, 0x03, 0x01 ... ];
match parse_tls_raw_record(bytes) {
Ok((rem, ref r)) => {
match parse_tls_record_with_header(r.data, &r.hdr) {
Ok((rem2,ref msg_list)) => {
for msg in msg_list {
// msg has type TlsMessage
}
}
Err(Err::Incomplete(needed)) => { eprintln!("incomplete record") }
Err(_) => { eprintln!("error while parsing record") }
}
}
Err(Err::Incomplete(needed)) => { eprintln!("incomplete record header") }
Err(_) => { eprintln!("error while parsing record header") }
}
```
Some additional work is required if reading packets from the network, to support
reassembly of TCP segments and reassembly of TLS records.
For a complete example of a TLS parser supporting defragmentation and states, see the
[rusticata/src/tls.rs](https://github.com/rusticata/rusticata/blob/master/src/tls.rs) file of
the [rusticata](https://github.com/rusticata/rusticata) crate.
## State machine
A TLS state machine is provided in [tls_states.rs](src/tls_states.rs). The state machine is separated from the
parsing functions, and is almost independent.
It is implemented as a table of transitions, mainly for the handshake phase.
After reading a TLS message using the previous functions, the TLS state can be
updated using the `tls_state_transition` function. If the transition succeeds,
it returns `Ok(new_state)`, otherwise it returns `Err(error_state)`.
```rust
struct ParseContext {
state: TlsState,
}
match tls_state_transition(ctx.state, msg, to_server) {
Ok(s) => { ctx.state = s; Ok(()) }
Err(_) => {
ctx.state = TlsState::Invalid;
Err("Invalid state")
}
}
```
# Implementation notes
When parsing messages, if a field is an integer corresponding to an enum of known values,
it is not parsed as an enum type, but as an integer. While this complicates accesses,
it allows to read invalid values and continue parsing (for an IDS, it's better to read
values than to get a generic parse error).
<!-- cargo-sync-readme end -->
## Changes
### 0.12.2
- Reintroduce lifetime in `parse_content_and_signature`, compiler infers a wrong lifetime
when elided.
### 0.12.1
- Set MAX_RECORD_LEN to 2^14 + 256 for TLSCipherText (#72)
- Change parse_content_and_signature definition to elide useless lifetimes
### 0.12.0
- Set MSRV to 1.70 (required by num_enum)
- Make functions to parse handshake messages public (#66)
- Cargo: use phf without std if no_std was specified (#56)
- Fix parsing and export of RFCs for SCSV
- DTLS fragments as DTLSMessageHandshakeBody::Fragment
- Update ciphersuites with old drafts + new AEGIS
- Support SSLv3 ServerHello
- Improve TlsCipherSuite: add PRF and methods to get parameters
- Remove `rand_time` from `ClientHello`/`ServerHello` (this is deprecated since a long time, and makes
getting the entire random value difficult)
To access sub elements, use the `rand_time()` and `rand_bytes()` methods
Thanks: Daniel McCarney, Lucas Kent, Andrew Finn, Adrien Guinet, Martin Algesten, Benoit Lemarchand
### 0.11.0
- Upgrade to nom 7
- Add `ClientHello` trait for common CH attributes
- Get extensions in DTLS `ClientHello`
- Add example/helper program to list/query ciphersuite information
- Add pseudo-entries for `TLS_EMPTY_RENEGOTIATION_INFO_SCSV` and `TLS_FALLBACK_SCSV` (#16)
- Update ciphersuites file
- Added parsing for SignedCertificateTimestamp list
- Re-export nom
- Add support for `no_std`
Thanks: @JackLiar, @xonatius
### 0.10.0
- Upgrade to nom 6
- Remove all macro-base parsers (use functions, and nom-derive when possible)
- Add support for DTLS (Handshake)
- Add functions to parse extensions expected in Client/Server Hello
### 0.9.4
- In ServerHello, an empty SNI extension can be sent (RFC 6066)
### 0.9.3
- Fix error in state machine (wrong Client Certificate direction)
### 0.9.2
- Upgrade to phf 0.8
- Upgrade cookie-factory to 0.3.0
### 0.9.1
- Mark cookie-factory as optional (only used for serialization)
### 0.9.0
- Upgrade to nom 5
- Rustfmt
### 0.8.1
- Set edition to 2018
- Check heartbeat message length (subtraction could underflow)
- Add more checks for record length (RFC compliance, not for parser safety)
### 0.8.0
- Add support for record size limit extension
- Add support for encrypted server name (eSNI) extension
- State machine: use direction and support TLS 1.3 0-RTT
- State machine: add new state to indicate connection is closed (after fatal alert)
- Use TlsVersion type for SSL record version
- Update doc, and use cargo sync-readme
### 0.7.1
- Improve state machine, handle resumption failure, and non-fatal alerts
- Improve handling of Signature/Hash algorithms, and display
- Update ciphersuites to 2019-03-19
### 0.7.0
- Convert most enums to newtypes
- warning: this is a breaking change
- Update dependencies and remove unused crates
- Update ciphersuites to 2019-01-23
### 0.6.0
- Upgrade to nom 4.0
- warning: this is a breaking change
- Fix wrong extension ID for padding and signed timestamp
- Rewrite parse_cipher_suites and parse_compressions_algs to be faster
- Update ciphersuites to 2018-08-13
## Standards
Here is a non-exhaustive list of RFCs this parser is based on:
- [RFC 2246](https://tools.ietf.org/html/rfc2246): The TLS Protocol Version 1.0
- [RFC 4346](https://tools.ietf.org/html/rfc4346): The Transport Layer Security (TLS) Protocol Version 1.1
- [RFC 4366](https://tools.ietf.org/html/rfc4366): Transport Layer Security (TLS) Extensions
- [RFC 4492](https://tools.ietf.org/html/rfc4492): Elliptic Curve Cryptography (ECC) Cipher Suites for Transport Layer Security (TLS)
- [RFC 4507](https://tools.ietf.org/html/rfc4507): Transport Layer Security (TLS) Session
Resumption without Server-Side State
- [RFC 5077](https://tools.ietf.org/html/rfc5077): Transport Layer Security (TLS) Session
Resumption without Server-Side State
- [RFC 5246](https://tools.ietf.org/html/rfc5246): The Transport Layer Security (TLS) Protocol Version 1.2
- [RFC 5430](https://tools.ietf.org/html/rfc5430): Suite B Profile for Transport Layer Security (TLS)
- [RFC 5746](https://tools.ietf.org/html/rfc5746): Transport Layer Security (TLS) Renegotiation Indication Extension
- [RFC 6066](https://tools.ietf.org/html/rfc6066): Transport Layer Security (TLS) Extensions: Extension Definitions
- [RFC 6520](https://tools.ietf.org/html/rfc6520): Transport Layer Security (TLS) and
Datagram Transport Layer Security (DTLS) Heartbeat Extension
- [RFC 6961](https://tools.ietf.org/html/rfc6961): The Transport Layer Security (TLS)
Multiple Certificate Status Request Extension
- [RFC 7027](https://tools.ietf.org/html/rfc7027): Elliptic Curve Cryptography (ECC) Brainpool Curves
for Transport Layer Security (TLS)
- [RFC 7301](https://tools.ietf.org/html/rfc7301): Transport Layer Security (TLS)
Application-Layer Protocol Negotiation Extension
- [RFC 7366](https://tools.ietf.org/html/rfc7366): Encrypt-then-MAC for Transport Layer Security (TLS) and
Datagram Transport Layer Security (DTLS)
- [RFC 7627](https://tools.ietf.org/html/rfc7627): Transport Layer Security (TLS) Session Hash and
Extended Master Secret Extension
- [RFC 7919](https://tools.ietf.org/html/rfc7919): Negotiated Finite Field Diffie-Hellman Ephemeral Parameters
for Transport Layer Security (TLS)
- [RFC 8422](https://tools.ietf.org/html/rfc8422): Elliptic Curve Cryptography (ECC) Cipher Suites
for Transport Layer Security (TLS) Versions 1.2 and Earlier
- [RFC 8446](https://tools.ietf.org/html/rfc8446): The Transport Layer Security (TLS) Protocol Version 1.3
- [draft-agl-tls-nextprotoneg-03](https://tools.ietf.org/html/draft-agl-tls-nextprotoneg-03): Transport Layer Security (TLS) Next Protocol Negotiation Extension
## FAQ and limitations
### Can the parser decrypt a TLS session if I provide the master secret ?
No, it's not implemented
### Does the parser support TLS compression ?
No. Note that most TLS implementations disabled it after the FREAK attack, so
while detecting compression in `ServerHello` is possible in tls-parser, it
should probably be interpreted as an alert.
### Where are located the TLS CipherSuites ?
They are built when running `cargo build`.
To ease updating the list from the
[IANA TLS
parameters](http://www.iana.org/assignments/tls-parameters/tls-parameters.xml),
a script is provided ([scripts/extract-iana-ciphers.py](scripts/extract-iana-ciphers.py)).
This script will download and pre-parse the list from IANA, and produce a file containing
all ciphersuites names and parameters.
During the build, [build.rs](build.rs) parses this file and produces a static,
read-only hash table of all known ciphers and their properties.
## License
Licensed under either of
* Apache License, Version 2.0
([LICENSE-APACHE](LICENSE-APACHE) or http://www.apache.org/licenses/LICENSE-2.0)
* MIT license
([LICENSE-MIT](LICENSE-MIT) or http://opensource.org/licenses/MIT)
at your option.
## Contribution
Unless you explicitly state otherwise, any contribution intentionally submitted
for inclusion in the work by you, as defined in the Apache-2.0 license, shall be
dual licensed as above, without any additional terms or conditions.
|