Re: [R] Rd files with unknown encoding?

From: Prof Brian Ripley <ripley_at_stats.ox.ac.uk>
Date: Mon, 17 Dec 2007 10:08:57 +0000 (GMT)

Here's a slightly cleaner version:

showNonASCII <- function(x)
{

     ind <- is.na(iconv(x, "latin1", "ASCII"))
     xxx <- iconv(x[ind], "latin1", "ASCII", sub="byte")
     if(any(ind)) cat(which(ind), ": ", xxx, "\n", sep="")
}

used as

> showNonASCII(readLines("foo.Rd"))

On Sat, 15 Dec 2007, Spencer Graves wrote:

> Dear Prof. Ripley:
>     Thanks very much.  I did as you suggested, which I'll outline here to 
> make it easier for anyone else who might have a similar problem:
>          * Read the offending *.Rd file in R using 'readLines'
>          * Applied 'iconv' to the character vector, following the last 
> example in the help file.  This translated all offending characters into a 
> multi-character sequence starting with '<'.
>          * Used 'regexpr' to find all occurrences of '<'.
>     The latter identified other uses of '<' but produced a sufficiently 
> short list that I was able to find the problems fairly easily.
>     Thanks again.
>     Spencer Graves   p.s.  And in the future, I will refer 'Rd' questions to 
> 'R-devel', per your suggestion. 
> Prof Brian Ripley wrote:

>> On Wed, 12 Dec 2007, Spencer Graves wrote:
>>
>>
>>> How can I identify the problem generating a warning in R CMD check
>>> for "Rd files with unknown encoding"?
>>>
>>> Google identified an email from John Fox with a reply from Brian
>>> Ripley about this last 12 Jun 2007.
>>>
>>
>> But not on this list:
>>
>> https://stat.ethz.ch/pipermail/r-devel/2007-June/046055.html
>>
>> R-devel would have been more appropriate for this too.
>>
>>
>>> This suggests that I may have accidentally entered some possibly
>>> non-printing character into the offending Rd file. The message tells me
>>> which file, but I don't know which lines in the file. Is there some way
>>> of finding the offending character(s) without laboriously running R CMD
>>> check after deleting different portions of the file until I isolate the
>>> problem?
>>>
>>
>> I did say so in that thread:
>>
>> https://stat.ethz.ch/pipermail/r-devel/2007-June/046061.html
>>
>> You can do much the same in R via iconv("", "C", sub="byte"), provided you
>> can read the file in (it may not be representable in your current locale,
>> but you could run R in a Latin-1 locale, if your OS has one).
>>
>>
-- 
Brian D. Ripley,                  ripley_at_stats.ox.ac.uk
Professor of Applied Statistics,  http://www.stats.ox.ac.uk/~ripley/
University of Oxford,             Tel:  +44 1865 272861 (self)
1 South Parks Road,                     +44 1865 272866 (PA)
Oxford OX1 3TG, UK                Fax:  +44 1865 272595

______________________________________________
R-help_at_r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
Received on Mon 17 Dec 2007 - 10:17:06 GMT

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.2.0, at Mon 17 Dec 2007 - 10:30:19 GMT.

Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-help. Please read the posting guide before posting to the list.