User Tools

Site Tools


doc:compare

Compare two Datasets

Tool version

1.0.2

Keywords

Selection, Lines, Common field, Datasets

Summary

Compares a column of a first dataset with a column of a second dataset.

Description

This tool compares a column of a first dataset with a column of a second dataset. It outputs lines of the first dataset for which the indicated column is matching (or not matching) the indicated column of the second dataset.

General comments (Warning/Tips)

If your data is not tab-delimited, use Text Manipulation→Convert.

Input

  • Compare: select the first dataset.
  • Using column: select the column to compare.
  • against: select the second dataset.
  • and column: select the column to compare with the defined column in the first dataset.
  • To find: select the desired option:
    • Matching rows of 1st dataset
    • Non Matching rows of 1st dataset

Output

The output dataset contains the lines of the first dataset matching (not matching) with a value of the second dataset.

Example

Usage Example: looking for genes in dataset 1 which are not referenced as belonging to a specific gene family.

Input

Compare: file1_compare.txt

Using column: c4

against:file2_compare.txt

and column: c1

To find: Non Matching rows of first dataset

With file1_compare.txt containing:

and file2_compare.txt containing:

Output

Edited on

July 22nd, 2014

doc/compare.txt · Last modified: 2014/11/28 17:04 by slegras