HTML Filter (HTML 4.01)

HTML Filter (HTML 4.01)

Status

The HTML filter is in the early stages, and is being rewritten. It should be used with caution.

Developer

Nicolas Goutte <nicog@snafu.de>.

HTML homepage

http://www.w3.org/TR/html401

Import HTML files into KWord

Features
  • Crude!

  • XHTML™ 1.0 or well-formed HTML 4.01 only!

Still to be done.
  • character references/entities

  • <font> tags

  • tables

  • images

  • use all of CSS2 (not just a very little part of it)

  • many other tags

  • importing non-well-formed HTML 4.01 or older

Export KWord files into HTML

Features
  • Export to HTML 4.01 or XHTML™ 1.0 documents

  • Character formatting (not in “Spartan” mode)

  • Partial CSS2 support (still no style sheets)

Still to be done.
  • Finish the complete re-write

  • Fix white space problems (space at start, space at end, multiple consecutive spaces)

  • Other Unicode encodings (e.g. UTF-16)

  • Other non-Unicode encodings (e.g. ASCII, Local encoding)

  • Tables

  • Images

  • Lists

  • Special treatment for paragraphs in fixed fonts (needed or not?)

  • CSS2 (treatment)

  • Have a correct font size algorithm

KDE Logo