Skip to content

4.5 documentation problems on first readthrough #1

@LosD

Description

@LosD

Problems/potential improvements found when reading through documentation for 4.5 (not all are really related to the change in version, though).
Lines are in source XML lines. Sections are in source XML ids.

  • General: We should use captions for images much more (ideally no image should ever be without a caption).
  • Chapter 1.1 line 56: "a data quality analysis" -> "Data quality analysis"
  • Chapter 1.1 section what_is_data_profiling: It would maybe be good to mention the quick analysis gained from a supported version (the "What can you tell me about my data" wizard).
  • Chapter 1.1 line 171: Line says "As of version 3". That may be a bit uninteresting now we're at 4.5?
  • Chapter 1.2 line 23: Line says that a license file is needed. But that is only true for licensed editions. Maybe add that?
  • Chapter 1.2 section adding_components: The Improve super category is mentioned as only containing Transformers, which is not true.
  • Chapter 1.2 line 241: 'but also "Write" menu for analyzers that save output to a datastore'. This is a bit simplistic, many non-writers can save to datastore. Maybe change to something like '... for analyzers that only (primarily?) save output to a datastore'.
  • Chapter 1.2 line 285: Wording is a little weird: "In the bottom part of the canvas, a help message is displayed instructing what needs to be done in current moment to build a valid job."
  • Chapter 2.2 line 160: The training tool does not open a new dialog. It's result screen is the "dialog".
  • Chapter 2.2 section DE_suppression: Rename not really completed, the text still talks about suppression, and the image is using the old name.
  • Chapter 2.2 section UK_suppression: Rename not really completed, the text still talks about suppression, and the image is using the old name. Maybe mention that it can use output data streams?
  • Chapter 2.2 section US_suppression: Rename not really completed, the text still talks about suppression. Maybe mention that it can use output data streams?
  • Chapter 2.2 section table_lookup_transformer: Maybe add that a new datastore can easily be registered with the new ➕ button and update screenshot.
  • Chapter 2.2 section national_identifiers: Screenshot outdated, still contains EAN (even highlighted, to really draw attention to itself :)).
  • Chapter 2.3 section completeness_analyzer: Maybe mention that it can use output data streams?
  • Chapter 2.4 section writers_create_staging: Maybe add that a new datastore can easily be registered and a new table can easily be generated with the new ➕ buttons and update screenshot.
  • Chapter 5.1 line 115: We do not provide signed JARs for download, so we should probably not say that we do.
  • Chapter 8.2: Screenshots should probably be upgraded to reflect that AnalyzerBeansConfiguration has been renamed to DataCleanerConfiguration.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions