#format wiki #language en #pragma section-numbers off #acl IanRiley:read,write,delete,revert,admin All:read #pragma keywords Ian Riley scientific technical data visualisation graph plotting veusz R variable width bar column mekko marimekko cascade waterfall coding #pragma supplementation-page on #pragma ref-dict RefDict <> = Variable width bar graphs using Veusz = == Background and rationale == Variable width bar graphs (or column charts) are used in various technical disciplines. Currently [[https://veusz.github.io|Veusz]] does not directly facilitate such graphs. However, this can be achieved using XY plotters to draw rectangles of variable height and width, either a single rectangle or a series of rectangles with the same properties/formating (such as line and fill colour) for each data category. Normally, the data for a variable width bar graph consists of three vectors, entry (category), height and width. Converting these to vectors for plotting the x-y traces for each entry could be done in a spreadsheet, but adding each manually to an XY plotter in Veusz could be a bit tedious if there are more than a few entries. This is best achieved programmatically, and here R has been used for this task; conversion of the raw data and the generation of the initial Veusz plot for subsequent customisation. Others describe methods to do this using spreadsheet graphs (e.g., <>, <>), however these are relative complex and are not easily automated. Labelling the graphs in these examples is particularly awkward. So a method using R and Veusz seems to be an easier and more flexible alternative. Of course, this can be done in R directly (using `barplot(heights, widths)` or R packages, ggplot or mekko and the like), however, this does not come with the convince provided by Veusz. Ideally, it should be done with a Python script integrated into Veusz, but this is my solution for now. == Procedure == Provide source data in standard Veusz format in a file named `varBars.data.txt`, as shown below. The entries do not need to be sorted by height and there can multiple entries per category. The category names must follow Python naming rules, so no spaces or prohibited symbols (however, the R code replaces the operators, +, -, / and *, with tildes). Commonly underscores would be used to replace spaces, but Veusz uses these to start subscripting in text elements, so '.' could be used (see the [[#Genome repeats|genome example]] below where this was done). {{{ descriptor entry height width apples 20 14 oranges 25 18 peaches 12 5 pears 5 10 bananas 30 2 }}} Run the [[#R code|R code]] ensuring that `varBars.data.txt` is in working directory, or use `setwd()` to point to the appropriate location. The code will create two files, `varBars.txt` and `varBars.vsz`. `varBars.txt`: The first dataset in the file is the left, centre, right, top and bottom coordinates of the first bar for each category. The subsequent datasets are the x-y coordinates for each category (which may represent more than one bar per category if multiple entries per category are included). For the above example data, `varBars.txt` will contain the following. {{{ descriptor entries left center right top bottom bananas 0 1 2 30 0 oranges 2 11 20 25 0 apples 20 27 34 20 0 peaches 34 36.5 39 12 0 pears 39 44 49 5 0 descriptor x_bananas y_bananas 0 0 0 30 2 30 2 0 ...[and so on for the four other categories] }}} `varBars.vsz`: This file is the code to generate the initial graph in Veusz. It consists of a single graph with labels, category XY plotters and a key. Open `varBars.vsz` in Veusz and modify as needed. Probably only one form of labelling will be required, so either the labels or the key can be hidden or deleted. The position of the labels can be changed using the three x and two y data vectors (`'left', 'centre', 'right', 'top' and 'bottom'`) and changing the rotation to suit. Axis labels will need to be added and lengths adjusted. In the initial graph, the axis lengths have been extended to accommodate the diagonal labels. Also, the aspect ratio might need to be altered if there are many entries (see the [[#Genome repeats|genome example]] below). == Examples == === Fruit === [[attachment:varBars.fruit.vsz]] {{attachment:varBars.fruit.png||height=200}} === Genome repeats === [[attachment:varBars.data.genome.txt]] [[attachment:varBars.genome.txt]] Initial - [[attachment:varBars.genome.vsz]] {{attachment:varBars.genome.png||height=200}} Edited - [[attachment:varBars.genome.edited.vsz]] {{attachment:varBars.genome.edited.png||height=200}} Note: in this case the category names included characters not valid in Python names (space and '/'), so these were replaced with '.' and '~', respectively, before running the R code. Then, in the edited version, they were reverted using the Veusz `'replace text'` tool applied to text elements only. == R code == [[attachment:varBars.R/varBars.R|R code file]] === Copyright === @copyright: 2018 Ian Riley === License === GNU GPL, see COPYING for details. == Known issues and limitations == * No provision for negative heights. If this was done, rather than XY plotters, polygons would need to be used as the below line fill (`'fill 1'`) formatting only works if the if the graph origin is at y = 0. * Colours theme1 to theme16 are used to distinguish each category, so if more than 16 categories are included, the colour allocations will need to be edited (especially if using a key not direct bar labelling). ----- Hits:: <>