[Rivet] Unicode issue with two analysis.cc files in Rivet 2.6.0

Andy Buckley andy.buckley at cern.ch
Tue Jul 10 21:11:00 BST 2018


I think we did this, but just a note that we also have to be careful: this info goes into LaTeX and HTML files... I am not sure those (particularly the former) will properly deal with Unicode symbols. They usually get into the files via people copying and pasting abstracts from paper PDFs (which was not really the original point of the Description field, but oh well...)

Andy
Dr Andy Buckley, Lecturer / Royal Society University Research Fellow
Particle Physics Experiment Group, University of Glasgow

On Jul 10 2018, at 12:09 pm, David Grellscheid <david.grellscheid at durham.ac.uk> wrote:
>
> Hi all,
> please fix this the right way round. UTF-8 characters are allowed in the
> supplementary files. The parsing code needs to de-/encode them correctly.
>
> David
>
> On 10/07/2018 11:38, Klaus Rabbertz wrote:
> > Dear Rivet developers,
> >
> > I tried to install the latest Rivet version 2.6.0 on Ubuntu
> > 16.04 LTS and found a little issue with make in the doc directory:
> >
> > ...
> > Using output file name 'analyses.html' and directory 'analyses'
> > Traceback (most recent call last):
> > File "mk-analysis-html", line 201, in <module>
> > page += "<kbd>" + src.replace("\n", "<br/>\n") + "</kbd>"
> > UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 in position
> > 3214: ordinal not in range(128)
> > Makefile:597: recipe for target 'analyses.html' failed
> > make[1]: *** [analyses.html] Error 1
> > make[1]: Leaving directory '/home/rabbertz/local/src/Rivet-2.6.0/doc'
> > ...
> >
> > The reason is non-unicode characters in two analysis .cc files from
> > analysis/plugin...
> >
> > ATLAS_2016_I1449082.cc contains non-ASCII minus signs in these lines
> > 75: // Missing ET is calculated as the 4–vector sum of neutrinos
> > from W/Z-boson decays. Tau decays are
> > 77: // way as electrons or muons, i.e. the parent is required not
> > to be a hadron or quark (u − b).
> > 151: // A jet is a b–jet if any B–hadron is included in the jet.
> >
> > CMS_2017_I1594909.cc contains a non-ASCII delta symbol
> > 215: // Region Njet Nb-jet HT [GeV] HTmiss [GeV] Parton
> > multiplicity Heavy flavor ? ∆m
> >
> > I checked with
> > grep --color='auto' -P -n '[^\x00-\x7F]' analysis.cc
> >
> > After removing these characters the installation went fine.
> > Ciao,
> > Klaus
> > _______________________________________________
> > Rivet mailing list
> > Rivet at projects.hepforge.org
> > https://www.hepforge.org/lists/listinfo/rivet
>
> _______________________________________________
> Rivet mailing list
> Rivet at projects.hepforge.org
> https://www.hepforge.org/lists/listinfo/rivet
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.hepforge.org/lists-archive/rivet/attachments/20180710/9770f4dc/attachment.html>


More information about the Rivet mailing list