Workbench
A visual interface for curating and annotating data, harmonizing complex datasets, and standardizing terminology using ontologies.
Overview
Workbench is designed to harmonize complex data through a user-friendly visual interface, simplifying the curation and annotation of tabular data and ontologies. It addresses challenges faced by scientists and researchers when dealing with siloed information, diverse approaches, and inconsistent terminology.
With Workbench, users can curate and edit term lists, personalized dictionaries, and semi-structured datasets to align with preferred terminologies, facilitating the journey towards Findable, Accessible, Interoperable, and Reusable (FAIR) data.
Key Features
- Intuitive Interface: Simplifies data curation and standardization processes.
- Automated Annotations: Utilizes VOCabs and ontologies for easy reproduction of annotations.
- Time-Saving: Allows creation and sharing of rules to reduce time and effort in data curation.
Workbench enhances efficiency by streamlining the data cleaning process, which traditionally requires specific domain expertise. It enables scientific data curators to work more effectively by facilitating the reuse and repetition of data and processes.
The platform supports easy annotation of data using selected vocabularies or ontologies, including those from SciBite’s extensive library enriched with over 20 million synonyms. Users can also upload custom ontologies for data annotation.
Workbench allows users to automate annotations through configurable rules, fine-tuning fuzzy-matching, and handling spelling variations and typographic errors. This is supported by TERMite, SciBite’s named entity recognition system, which integrates seamlessly with Workbench to manage data with internal codes or proprietary terms.
For collaborative projects, Workbench offers built-in sharing functionalities. Data owners can create groups to invite colleagues for viewing or editing annotations, with the option to export annotated data in Microsoft Excel format for use with third-party tools.
To utilize Workbench effectively, access to TERMite is required. Users can configure Workbench to connect to an existing TERMite server or use an embedded TERMite server if they do not have current access.
Overall, Workbench provides a comprehensive solution for automated data cleansing and standardization, supporting teams with the implementation of rules and ontologies to enhance data quality and efficiency.

