• Shop by category
  • Powered by eBay
  • Data Algorithms with Spark: Recipes and Design Patterns for Scaling Up using PyS

    • Item No : 388247528248
    • Condition : Brand New
    • Brand : No brand Info
    • Seller : the_nile_uk_store
    • Current Bid : US $58.44
    • * Item Description

    • The Nile on eBay
        FREE SHIPPING UK WIDE
       

      Data Algorithms with Spark

      by Mahmoud Parsian

      With this hands-on guide, anyone looking for an introduction to Spark will learn practical algorithms and examples using PySpark. In each chapter, author Mahmoud Parsian shows you how to solve a data problem with a set of Spark transformations and algorithms.

      FORMAT
      Paperback
      LANGUAGE
      English
      CONDITION
      Brand New


      Publisher Description

      Apache Spark's speed, ease of use, sophisticated analytics, and multilanguage support makes practical knowledge of this cluster-computing framework a required skill for data engineers and data scientists. With this hands-on guide, anyone looking for an introduction to Spark will learn practical algorithms and examples using PySpark.

      In each chapter, author Mahmoud Parsian shows you how to solve a data problem with a set of Spark transformations and algorithms. You'll learn how to tackle problems involving ETL, design patterns, machine learning algorithms, data partitioning, and genomics analysis. Each detailed recipe includes PySpark algorithms using the PySpark driver and shell script.

      With this book, you will:

      Learn how to select Spark transformations for optimized solutions Explore powerful transformations and reductions including reduceByKey(), combineByKey(), and mapPartitions() Understand data partitioning for optimized queries Build and apply a model using PySpark design patterns Apply motif-finding algorithms to graph data Analyze graph data by using the GraphFrames API Apply PySpark algorithms to clinical and genomics data Learn how to use and apply feature engineering in ML algorithms Understand and use practical and pragmatic data design patterns"

      Author Biography

      Mahmoud Parsian, Ph.D. in Computer Science, is a practicing software professional with 30 years of experience as a developer, designer, architect, and author. For the past 15 years, he has been involved in Java server-side, databases, MapReduce, Spark, PySpark, and distributed computing.

      Details

      ISBN1492082384
      Author Mahmoud Parsian
      Short Title Data Algorithms with Spark
      Pages 500
      Language English
      ISBN-10 1492082384
      ISBN-13 9781492082385
      Format Paperback
      Audience Professional and Scholarly
      Place of Publication Sebastopol
      Country of Publication United States
      Year 2022
      Publication Date 2022-04-30
      AU Release Date 2022-04-30
      NZ Release Date 2022-04-30
      US Release Date 2022-04-30
      UK Release Date 2022-04-30
      Publisher O'Reilly Media
      Imprint O'Reilly Media
      Subtitle Recipes and Design Patterns for Scaling Up using PySpark
      DEWEY 005.1

      TheNile_Item_ID:135159087;
    ★ Recommended Products Related To This Item
    ♥ Best Selling Products in this category