{"id":1931,"date":"2021-09-27T15:20:22","date_gmt":"2021-09-27T15:20:22","guid":{"rendered":"https:\/\/procogia.com\/reproducibility-with-r\/"},"modified":"2024-04-04T12:21:28","modified_gmt":"2024-04-04T12:21:28","slug":"reproducibility-with-r","status":"publish","type":"post","link":"https:\/\/procogia.com\/reproducibility-with-r\/","title":{"rendered":"Reproducibility with R"},"content":{"rendered":"\r\n
In order to analyze any dataset you need a process. If those data sets involve vast amounts of information, how do you always ensure that the same exact same processes are followed each time? If you analyze a data set several times over using the same algorithms and the same tools, the assumption is that the results produced would be the same each time. Right? Wrong. This is not always the case. Why, because it\u2019s often very difficult to keep track of every step of the analysis and if this is deviated from, the results will vary. Variations in results can devalue the accuracy of data analysis. Businesses that are fully invested in data science simply can\u2019t afford inaccuracies in their data, as the results are often what drive key strategy and the business decision making process. A new R product, called Targets, has been developed by a leading pharmaceutical company, Eli Lilly. Targets is an incredibly important tool for data science as it allows a reproducible workflow to be maintained.<\/p>\r\n
<\/p>\r\n\r\n
As R experts at ProCogia, our experience working with R is unrivalled; we are currently the only full service RStudio – Posit partner on the Pacific West Coast. We work with cutting edge technology, such as the newly developed Targets package, which allows us to offer our clients the very latest in data science techniques. As well as presenting a keynote presentation and workshop at the global CascadiaR conference, our team holds bespoke training sessions and practical workshops for our clients focusing on the R Targets package. If you\u2019d like to know more about our training programs, which can be delivered online or in person, we\u2019d love to hear from you.\u00a0<\/p>\r\n
The open source, freely available Targets package supersedes Drake, an older R-focused package. Targets creates a framework which wraps existing analysis in a code allowing the user to detect when a change has been made to an existing analytics program. This ability to detect changes allows users to go back in time and reproduce analysis exactly. Targets has been developed to be accessible to R users and allows data scientists and researchers to work entirely within R. It can easily be adopted and combined within existing workflows, helping users maintain their data analysis projects.<\/p>\r\n
Many organisations use R to help analyze statistical information such as customer retention figures, customer churn rates etc. This information often runs onto many thousands of lines of code and, prior to Targets, the onus has always been on the analyst to document their approach diligently. The introduction of Targets ensures the accuracy of replicated analysis whilst also addressing any compliance issues as, in effect, it creates an audit trail of how a data set has been analyzed. Targets enable complicated workloads to be reproduced at the push of a button.<\/p>\r\n
The benefits of using R\u2019s open source Targets package includes:<\/p>\r\n
Targets is applied at a project level; a small amount of code wraps around existing analytics code. The whole purpose of Targets is to track what the code within is doing. \u201cWe always take time to communicate to our clients the importance of starting with the correct base. It\u2019s worth spending time to get the right systems and software in place at the start of a task, this will ensure that future headaches further down the road can be avoided,\u201d explains Mike, Data Science Consultant.<\/strong><\/p>\r\n Targets is perfect for repeating analytics work on differing data sets, for example drugs discovery trials. \u201cThe ability to create a new visualization or modify your findings on the fly without messing up any other part of your analytics or sacrificing accuracy is key.\u201d adds Mike.<\/p>","protected":false},"excerpt":{"rendered":" In order to analyze any dataset you need a process. If those data sets involve vast amounts of information, how do you always ensure that the same exact same processes are followed each time?<\/p>\n","protected":false},"author":1,"featured_media":1932,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"content-type":"","footnotes":""},"categories":[13],"tags":[],"acf":[],"_links":{"self":[{"href":"https:\/\/procogia.com\/wp-json\/wp\/v2\/posts\/1931"}],"collection":[{"href":"https:\/\/procogia.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/procogia.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/procogia.com\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/procogia.com\/wp-json\/wp\/v2\/comments?post=1931"}],"version-history":[{"count":6,"href":"https:\/\/procogia.com\/wp-json\/wp\/v2\/posts\/1931\/revisions"}],"predecessor-version":[{"id":4913,"href":"https:\/\/procogia.com\/wp-json\/wp\/v2\/posts\/1931\/revisions\/4913"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/procogia.com\/wp-json\/wp\/v2\/media\/1932"}],"wp:attachment":[{"href":"https:\/\/procogia.com\/wp-json\/wp\/v2\/media?parent=1931"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/procogia.com\/wp-json\/wp\/v2\/categories?post=1931"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/procogia.com\/wp-json\/wp\/v2\/tags?post=1931"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}
\r\nWho uses Targets?<\/h3>\r\n