Consistency: consume a single character at a time during attribute name state #519

jayaddison · 2020-12-29T14:58:10Z

This is a small consistency fixup relating to the way that attribute names are retrieved; it also makes some follow-up refactoring work a little cleaner.

Parsing continues fine if we consume a single character at a time during attribute name tokenization, and this doesn't appear to affect performance positively or negatively.

…me state

gsnedders · 2021-01-04T16:51:33Z

it also makes some follow-up refactoring work a little cleaner

FWIW, I think it's pretty likely that any Cython-compiled version of html5lib, once that exists, will use charsUntil more widely than we do today.

jayaddison · 2021-01-04T17:04:22Z

it also makes some follow-up refactoring work a little cleaner

FWIW, I think it's pretty likely that any Cython-compiled version of html5lib, once that exists, will use charsUntil more widely than we do today.

That's a good goal/consideration to keep in mind, thanks. For this instance, the suggested change is largely to help indicate that there's no accidental change-of-behaviour introduced by the refactoring in https://github.com/html5lib/html5lib-python/pull/521/files#diff-84be0df9e74521d407f26e2277a2c70be21dbe6012fea9a5786721c5027e2cfaL894-R868

It also seems consistent with the comment and logic in tagNameState (I didn't copy the comment over - but could do)

jayaddison · 2022-12-24T01:03:08Z

Cleaning up some old / stale pull requests; please let me know if this changeset is considered worthwhile and I'll reopen if so.

Consistency: consume a single character at a time during attribute na…

183d8a0

…me state

This was referenced Dec 29, 2020

Tokenizer: pretranslate lowercase element and attribute names #520

Closed

Tokenizer: use Python objects to represent tokens #521

Closed

Merge branch 'master' into cleanup/attribute-name-char-consumption

3045adb

jayaddison closed this Dec 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Consistency: consume a single character at a time during attribute name state #519

Consistency: consume a single character at a time during attribute name state #519

Uh oh!

jayaddison commented Dec 29, 2020

Uh oh!

gsnedders commented Jan 4, 2021

Uh oh!

jayaddison commented Jan 4, 2021

Uh oh!

jayaddison commented Dec 24, 2022

Uh oh!

Uh oh!

Consistency: consume a single character at a time during attribute name state #519

Consistency: consume a single character at a time during attribute name state #519

Uh oh!

Conversation

jayaddison commented Dec 29, 2020

Uh oh!

gsnedders commented Jan 4, 2021

Uh oh!

jayaddison commented Jan 4, 2021

Uh oh!

jayaddison commented Dec 24, 2022

Uh oh!

Uh oh!