|
The rapid advancement in machine learning and data science fields have aided data scientists in arriving at meaningful insights. However, it’s not been an easy task to optimize machine learning infrastructures to allow data scientists to focus on their core [See the full post…] |
Listen/download audio 27:02
|
|
Categories: Audio Podcast, Code Together, Intel Tags: AI Artificial intelligence, API, Data Science, data scientist, domain specific languages, heterogeneous, machine learning, Modin, OmniSci, oneAPI, Opensource, Pandas, Python, Pytorch, SQL, XGBoost
|
Data scientists spend 60% of their time cleaning and preprocessing data, transforming this dirty data into crystallized insights. Dataframes, such as Pandas, provide exceptional tooling to address data wrangling tasks, yet Pandas themselves increasingly lack ease and speed as they [See the full post…] |
Listen/download audio
|
|
Categories: Audio Podcast, Code Together, Intel Tags: Alex Baden, algebra, Apache Arrow, CPU, data analytics, data preprocessing, Data Science, data science pipeline, data scientist, database, dataframe, Devin Petersohn, DPC++, GPU, heterogeneous, Intel, Intel AI Analytics Toolkit, Intel Optane, just-in-time compilation, LLVM, Modin, NoSQL, OmniSci, OmniSciDB, oneAPI, Open Source, Pandas, Python, scikit-learn, SQL, VTune Profiler