Skip to content

GitHub Issue 1086: Add metrics for some problematic special characters in fields#7808

Open
XingY wants to merge 2 commits into
developfrom
fb_specialCharMetrics_1086
Open

GitHub Issue 1086: Add metrics for some problematic special characters in fields#7808
XingY wants to merge 2 commits into
developfrom
fb_specialCharMetrics_1086

Conversation

@XingY

@XingY XingY commented Jul 2, 2026

Copy link
Copy Markdown
Contributor

Rationale

GitHub Issue 1086.
This PR adds a usage metric so we can understand how frequently some problematic special characters (semicolons, commas, newlines, and double-quotes) actually occur. broken down by special character and by field type.

Related Pull Requests

Changes

  • Adds SpecialCharacterMetricsMaintenanceTask system maintenance task (due to performance concern, PostgreSQL only) that computes, per special character, the number of fields whose values (or defined choices) contain that character, grouped by
    category.
  • base domain fields (such as description) are not accounted for, the only exception is sample/source name, which is reported under dataName category.
  • TextChoice / MVTC metrics are determined based on configured valid options for fields, not actual options saved in rows.
  • exp.objectproperty backed domains (assay run, workflow custom fields, etc) don't distinguish between text/multiline, and are reported under objectStringValue

@XingY XingY requested a review from cnathe July 2, 2026 20:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant