Data Science at the Command Line. Obtain, Scrub, Explore, and Model Data with Unix Power Tools. 2nd Ed. 47823

Jeroen Janssens

This thoroughly revised guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You'll learn how to combine small yet powerful command-line tools to quickly obtain, scrub, explore, and model your data. To get you started, author Jeroen Janssens provides a Docker image packed with over 100 Unix power tools—useful whether you work with Windows, macOS, or Linux.You'll quickly discover why the command line is an agile, scalable, and extensible technology. Even if you're comfortable processing data with Python or R, you'll learn how to greatly improve your data science workflow by leveraging the command line's power. This book is ideal for data scientists, analysts, engineers, system administrators, and researchers.

Obtain data from websites, APIs, databases, and spreadsheets
Perform scrub operations on text, CSV, HTML, XML, and JSON files
Explore data, compute descriptive statistics, and create visualizations
Manage your data science workflow
Create your own tools from one-liners and existing Python or R code
Parallelize and distribute data-intensive pipelines
Model data with dimensionality reduction, regression, and classification algorithms
Leverage the command line from Python, Jupyter, R, RStudio, and Apache Spark

Автор
Jeroen Janssens
Категорія
Комп'ютерна література
Мова
Англійська
Рік
2021
Сторінок
250
Формат
180х235 мм
Обкладинка
М'яка
Тип паперу
Офсетний
Ілюстрації
Чорно-білі
Номер видання
2-ге вид.
Вага, г
450
Жанр
Аналіз данихБази даних
Вік
16+

2300 ₴

Купити

80-120 ₴

Відділення Нова Пошта80 ₴

Поштомат Нова Пошта80 ₴

Кур’єр Нова Пошта120 ₴

50-90 ₴

Відділення УкрПошта50 ₴

Кур’єр за адресою90 ₴

Data Science at the Command Line. Obtain, Scrub, Explore, and Model Data with Unix Power Tools. 2nd Ed. - фото 1

47823

Залиште свій відгук про книгу,
допоможіть тим, хто ще не читав

Mastering Snowflake Solutions: Supporting Analytics and Data Sharing. 1st Ed.

47854

Mastering Snowflake Solutions: Supporting Analytics and Data Sharing. 1st Ed.

1300 ₴

Visualize This: The FlowingData Guide to Design, Visualization, and Statistics 2nd Edition

48100

Visualize This: The FlowingData Guide to Design, Visualization, and Statistics 2nd Edition

1310 ₴

Natural Language Processing with Transformers. Revised Edition

21368

Natural Language Processing with Transformers. Revised Edition

Lewis Tunstall, Leandro von Werra

1390 ₴

The Data Science Design Manual (Texts in Computer Science) 2017th Edition

48132

The Data Science Design Manual (Texts in Computer Science) 2017th Edition

Steven S. Skiena

1580 ₴

MongoDB Performance Tuning. Optimizing MongoDB Databases and their Applications. 1st Ed.

47850

MongoDB Performance Tuning. Optimizing MongoDB Databases and their Applications. 1st Ed.

Guy Harrison, Michael Harrison

1600 ₴

Machine Learning with PySpark. With Natural Language Processing and Recommender Systems. 2nd Ed.

47857

Machine Learning with PySpark. With Natural Language Processing and Recommender Systems. 2nd Ed.

1600 ₴

Econometrics and Data Science. 1st Ed.

47873

Econometrics and Data Science. 1st Ed.

Tshepo Chris Nokeri

1700 ₴

Data Science Solutions with Python. 1st Ed.

47876

Data Science Solutions with Python. 1st Ed.

Tshepo Chris Nokeri

1700 ₴

Modern PyQt. Create GUI Applications for Project Management, Computer Vision, and Data Analysis. 1st Ed.

47849

Modern PyQt. Create GUI Applications for Project Management, Computer Vision, and Data Analysis. 1st Ed.

1800 ₴

Deep Learning: A Practitioner's Approach 1st Edition

48033

Deep Learning: A Practitioner's Approach 1st Edition

2200 ₴