User Tools

Site Tools


doc:union

Union of two datasets

Tool version

1

Keywords

Join, Common field, Files

Summary

Makes union of two datasets based on common fields.

Description

This tool makes union of two datasets based on common fields.Example: the union of two datasets, A and B, corresponds to the shaded region :

The unmatched lines will be displayed in the output while the empty fields will be left blank.

General comments (Warning/Tips)

If your data is not tab-delimited, use Text Manipulation→Convert.

Common field(s) must be exactly identical (content and case).

Input

  • Join: select the first input file.
  • with: select the second input file.
  • Number of lines for header: these header lines will be output.
  • Column to be compared for first input (comma separated list of integers): enter the number of the column to compare in the first input file. If you wish to compare more than one column: use a comma-separated list of integers.
  • Column to be compared for second input (comma separated list of integers): enter the number of the column to compare in the second input file. If you wish to compare more than one column: use a comma-separated list of integers.

Output

The output is a dataset containing the union of the two input datasets. Unmatched lines are conserved and empty fields left blank. The common field is moved to the first position.

Example

Usage Example: connecting gene characteristics and the gene family they belong to.

Input

Join: file1_union.txt

with: file2_union.txt

Number of lines for header: 0

Column to be compared for first input: 4

Column to be compared for second input: 1

With file1_union.txt containing:

and file2_union.txt containing:

Output

Edited on

July 18th, 2014

doc/union.txt · Last modified: 2014/11/28 17:03 by slegras