{"id":1931,"date":"2021-09-27T15:20:22","date_gmt":"2021-09-27T15:20:22","guid":{"rendered":"https:\/\/procogia.com\/reproducibility-with-r\/"},"modified":"2024-04-04T12:21:28","modified_gmt":"2024-04-04T12:21:28","slug":"reproducibility-with-r","status":"publish","type":"post","link":"https:\/\/procogia.com\/reproducibility-with-r\/","title":{"rendered":"Reproducibility with R"},"content":{"rendered":"\r\n

In order to analyze any dataset you need a process. If those data sets involve vast amounts of information, how do you always ensure that the same exact same processes are followed each time? If you analyze a data set several times over using the same algorithms and the same tools, the assumption is that the results produced would be the same each time. Right? Wrong. This is not always the case. Why, because it\u2019s often very difficult to keep track of every step of the analysis and if this is deviated from, the results will vary. Variations in results can devalue the accuracy of data analysis. Businesses that are fully invested in data science simply can\u2019t afford inaccuracies in their data, as the results are often what drive key strategy and the business decision making process. A new R product, called Targets, has been developed by a leading pharmaceutical company, Eli Lilly. Targets is an incredibly important tool for data science as it allows a reproducible workflow to be maintained.<\/p>\r\n


\r\n

Workshops to facilitate data reproducibility with R<\/h3>\r\n

\"\"<\/p>\r\n\r\n

As R experts at ProCogia, our experience working with R is unrivalled; we are currently the only full service RStudio – Posit partner on the Pacific West Coast. We work with cutting edge technology, such as the newly developed Targets package, which allows us to offer our clients the very latest in data science techniques. As well as presenting a keynote presentation and workshop at the global CascadiaR conference, our team holds bespoke training sessions and practical workshops for our clients focusing on the R Targets package. If you\u2019d like to know more about our training programs, which can be delivered online or in person, we\u2019d love to hear from you.\u00a0<\/p>\r\n

Cutting edge technology<\/h3>\r\n

\"\" The open source, freely available Targets package supersedes Drake, an older R-focused package. Targets creates a framework which wraps existing analysis in a code allowing the user to detect when a change has been made to an existing analytics program. This ability to detect changes allows users to go back in time and reproduce analysis exactly. Targets has been developed to be accessible to R users and allows data scientists and researchers to work entirely within R. It can easily be adopted and combined within existing workflows, helping users maintain their data analysis projects.<\/p>\r\n


\r\n

Using Targets<\/h3>\r\n

\"\" Many organisations use R to help analyze statistical information such as customer retention figures, customer churn rates etc. This information often runs onto many thousands of lines of code and, prior to Targets, the onus has always been on the analyst to document their approach diligently. The introduction of Targets ensures the accuracy of replicated analysis whilst also addressing any compliance issues as, in effect, it creates an audit trail of how a data set has been analyzed. Targets enable complicated workloads to be reproduced at the push of a button.<\/p>\r\n


\r\n

Benefits of using Targets:<\/h3>\r\n

The benefits of using R\u2019s open source Targets package includes:<\/p>\r\n