- Добавил: literator
- Дата: 11-09-2020, 21:13
- Комментариев: 0

Автор: Jeroen Janssens
Издательство: O’Reilly Media
Год: 2020-09-11
Страниц: 80
Язык: английский
Формат: pdf, rtf, epub
Размер: 11.4 MB
This thoroughly revised guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You’ll learn how to combine small yet powerful command-line tools to quickly obtain, scrub, explore, and model your data. To get you started, author Jeroen Janssens provides a Docker image packed with over 80 tools—useful whether you work with Windows, macOS, or Linux. Today, data scientists can choose from an overwhelming collection of exciting technologies and programming languages. Python, R, Hadoop, Julia, Pig, Hive, and Spark are but a few examples. You may already have experience in one or more of these. If so, then why should you still care about the command line for doing data science? What does the command line have to offer that these other technologies and programming languages do not?