The Best IDE for Data Science Still Needs to be Built
Introduction Data science involves many programming languages and each comes with its preferred integrated development environment (IDE). I know this
Project Overview:
A large market research company in North American relied exclusively on SAS for all their analytic needs. SAS is an antiquated system and they were paying millions of dollars a year for an analytic platform that was no longer meeting their needs. They wanted to migrate their data and analytics to Databricks and needed help translating the code.
Methodology:
We worked with a multidisciplinary team that IT support, database engineers, and a large team of R developers. My team developed a system for preparing SAS programs for migration, testing the outputs, and deploying the code to Databricks using the sparklyr package.
Results:
In just over 18 months, my team was able to translate several hundred-thousand lines of SAS code subsequently deployed to Databricks. The client has ended all SAS processes and is now saving $7m a year.
Project Overview
Our client is a small hospital system in California. They are too small to hire a dedicated team of data analysts, so many of their staff had tried using Rstudio to manage some patient data. This ran into privacy and data quality issues and they needed help designing and using a modern analytic platform.
Methodology:
We recommended a Posit Workbench platform so they could collaborate within a modern server environment. We configured Workbench on an AWS cloud account that is 100% managed by our consultants. We also developed a bespoke 6-week R training course based on their unique data and processes which covered everything from basic data types, writing custom functions, and incoporating AI in our code development.
Results:
The client is able to manage their patient data within a secure modern analytic system. They also have access to an example code base for exploring their data. The R training program included slide decks, exercises and quizes, and recorded trainings that can be useful as new staff are onboarded to the system.
Introduction Data science involves many programming languages and each comes with its preferred integrated development environment (IDE). I know this
Introduction to R Vulnerability HiddenLayer researchers identified a vulnerability in the R language that has security experts worried due to
Making the jump from SAS to R is a daunting prospect that requires a significant investment in time and effort.
At ProCogia, we specialize in guiding teams worldwide through the transition from legacy SAS systems to modern, open-source R and
Applied best practices in code conversion from SAS production processes to R for one of the largest market research companies