Re: [Rd] Large discrepancies in the same object being saved to .RData

From: Paul Johnson <pauljohn32_at_gmail.com>
Date: Sat, 10 Jul 2010 13:33:39 -0500

On Wed, Jul 7, 2010 at 7:12 AM, Duncan Murdoch <murdoch.duncan_at_gmail.com> wrote:

> On 06/07/2010 9:04 PM, Julian.Taylor_at_csiro.au wrote:

>>
>> Hi developers,
>>
>>
>>
>> After some investigation I have found there can be large discrepancies in
>> the same object being saved as an external "xx.RData" file. The immediate
>> repercussion of this is the possible increased size of your .RData workspace
>> for no apparent reason.
>>
>>
>>
> I haven't worked through your example, but in general the way that local
> objects get captured is when part of the return value includes an
> environment.

Hi, can I ask a follow up question?

Is there a tool to browse *.Rdata files without loading them into R?

In HDF5 (a data storage format we use sometimes), there is a CLI program "h5dump" that will spit out line-by-line all the contents of a storage entity. It will literally track through all the metadata, all the vectors of scores, etc. I've found that handy to "see what's really in there" in cases like the one that OP asked about. Sometimes, we find that there are things that are "in there" by mistake, as Duncan describes, and then we can try to figure why they are in there.

pj

-- 
Paul E. Johnson
Professor, Political Science
1541 Lilac Lane, Room 504
University of Kansas

______________________________________________
R-devel_at_r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel
Received on Sat 10 Jul 2010 - 18:40:31 GMT

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.2.0, at Sun 11 Jul 2010 - 01:50:13 GMT.

Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-devel. Please read the posting guide before posting to the list.

list of date sections of archive