1. Introduction
This section is non-normative.In a number of languages and writing system, such as Japanese or Thai, words are not deliminated by spaces (or any other character) as is the case in English (See Approaches to line breaking for a discussion the approach various languages take to word separation and line breaking).
However, even if text without spaces is the dominant style in such languages, there are cases where making word boundaries (or phrase boundaries) visible through the use of spaces is desired. This is a purely stylistic effect, with no implication on the semantics of the text.
In Japan for instance, this is commonly done in books for people learning the language—young children or foreign students. People with dyslexia also tend to find this style easier to read. Recent pushes by the Japanese government for electronic text-books have raised the demand for this type of features, and proprietary ebook solutions are being proposed.
The mechanism proposed in this specification builds upon the existing use
of the wbr
element
or of U+200B ZERO WIDTH SPACE
in the document markup as a word (or phrase) delimiter.
While this practice is not that common,
it is a semantically valid use of that unicode character,
and the ability to trigger stylistic effects based on it
can only encourage its use.
2. Expanding Zero Width Spaces: the zero-width-space-expansion property
Name: | zero-width-space-expansion |
---|---|
Value: | none | space | ideographic-space |
Initial: | none |
Applies to: | inline boxes |
Inherited: | yes |
Percentages: | N/A |
Computed value: | as specified |
Canonical order: | per grammar |
Animation type: | discrete |
This name is too verbose. To be bikeshedded.
Should we allow more freeform values, like <string>, possibly limited to 1 character?
This property enables all instances of U+200B ZERO WIDTH SPACE to be replaced by the specified character.
Instances of wbr
are considered equivalent to U+200B, and are also replaced.
This substitution happens before layout, so all layout operations that depend on the characters in the content (such as CSS Text 3 § 4.1 The White Space Processing Rules, line breaking, or intrinsic sizing) must use that character instead of the original U+200B.
- none
- This property has no effect.
- space
- All instances of U+200B ZERO WIDTH SPACE are replaced by U+0020 SPACE.
- ideographic-space
- All instances of U+200B ZERO WIDTH SPACE are replaced by U+3000 IDEOGRAPHIC SPACE.
Like text-transform, this property transforms text for styling purposes. It has no effect on the underlying content, and must not affect the content of a plain text copy & paste operation.
-
this property seems saner to handle before CSS Text 3 § 4.1 The White Space Processing Rules, but text-transform happens after that. Or maybe which stage text-transform applies at depends on which value it uses?
-
these two properties seems better to cascade separately
Unlike books for adults, Japanese books for young children often feature spaces between sentence segments, to facilitate reading.
Absent any particular styling, the following sentence would be rendered as depicted below.
< p > むかしむかし、< wbr > あるところに、< wbr > おじいさんと< wbr > おばあさんが< wbr > すんでいました。
むかしむかし、あるところに、おじいさんとおばあさんがすんでいました。
Phrase-based spacing can be achieved with the following css:
p {
zero-width-space-expansion : ideographic-space;
}
むかしむかし、 あるところに、 おじいさんと おばあさんが すんでいました。
Another common variant additionally restricts the allowable line breaks to these phrase boundaries. Using the same markup, this is easily achieved with the following css:
p {
word-break : keep-all;
zero-width-space-expansion : ideographic-space;
}
むかしむかし、
In addition to making the source code more readable,
using wbr
rather than U+200B in the markup
also allow authors to classify the delimiters into different groups.
In the following example, wbr
elements are either
unmarked when they delimit a word,
or marked with class p
when they also delimit a phrase.
< p > らいしゅう< wbr > の< wbr > じゅぎょう< wbr > に< wbr class = p
> たいこ< wbr > と< wbr > ばち< wbr > を< wbr class = p
> もって< wbr > きて< wbr > ください。
Using this, it is possible not only to enable the rather common phrase based spacing, but also word by word spacing that is likely to be preferred by people with dyslexia to reduce ambiguities, or other variants such as a combination of phrase-based spacing and of word-based wrapping.
らいしゅう
p wbr.p {
zero-width-space-expansion : ideographic-space;
}
らいしゅう
p wbr {
zero-width-space-expansion : ideographic-space;
}
らいしゅう
p {
word-break : keep-all;
}
p wbr.p {
zero-width-space-expansion : ideographic-space;
}
らいしゅう
p {
word-break : keep-all;
}
p wbr {
zero-width-space-expansion : ideographic-space;
}
らいしゅう
Apendix A. Security and Privacy Considerations
This appendix is non-normative.
There are no known security or privacy impacts of this feature.
The W3C TAG is developing a Self-Review Questionnaire: Security and Privacy for editors of specifications to informatively answer. As far as currently known, here are the answers to the Questions to Consider:
- Does this specification deal with personally-identifiable information?
- No
- Does this specification deal with high-value data?
- No.
- Does this specification introduce new state for an origin that persists across browsing sessions?
- No.
- Does this specification expose any other data to an origin that it doesn’t currently have access to?
- No.
- Does this specification enable new script execution/loading mechanisms?
- No.
- Does this specification allow an origin access to a user’s location?
- No.
- Does this specification allow an origin access to sensors on a user’s device?
- No.
- Does this specification allow an origin access to aspects of a user’s local computing environment?
- No.
- Does this specification allow an origin access to other devices?
- No.
- Does this specification allow an origin some measure of control over a user agent’s native UI?
- No.
- Does this specification expose temporary identifiers to the web?
- No.
- Does this specification distinguish between behavior in first-party and third-party contexts?
- No.
- How should this specification work in the context of a user agent’s "incognito" mode?
- No difference in behavior is expected or needed.
- Does this specification persist data to a user’s local device?
- No.
- Does this specification have a "Security Considerations" and "Privacy Considerations" section?
- Yes, this is the role of this Appendix.
- Does this specification allow downgrading default security characteristics?
- No.