.NET for Apache Spark:一个.NET平台开源免费跨平台的大数据分析框架

网友投稿 771 2022-11-05

.NET for Apache Spark:一个.NET平台开源免费跨平台的大数据分析框架

.NET for Apache Spark:一个.NET平台开源免费跨平台的大数据分析框架

.NET for Apache® Spark™

.NET for Apache Spark provides high performance APIs for using Apache Spark from C# and F#. With these .NET APIs, you can access the most popular Dataframe and SparkSQL aspects of Apache Spark, for working with structured data, and Spark Structured Streaming, for working with streaming data.

.NET for Apache Spark is compliant with .NET Standard - a formal specification of .NET APIs that are common across .NET implementations. This means you can use .NET for Apache Spark anywhere you write .NET code allowing you to reuse all the knowledge, skills, code, and libraries you already have as a .NET developer.

.NET for Apache Spark runs on Windows, Linux, and macOS using .NET Core, or Windows using .NET Framework. It also runs on all major cloud providers including Azure HDInsight Spark, Amazon EMR Spark, AWS & Azure Databricks.

Note: We currently have a Spark Project Improvement Proposal JIRA at SPIP: .NET bindings for Apache Spark to work with the community towards getting .NET support by default into Apache Spark. We highly encourage you to participate in the discussion.

Table of Contents

Supported Apache SparkReleasesGet StartedBuild StatusBuilding from SourceSamplesContributingInspiration and Special ThanksHow to Engage, Contribute and Provide Feedback.NET FoundationCode of ConductLicense

Supported Apache Spark

Apache Spark.NET for Apache Spark
2.3.*v0.11.0
2.4.0
2.4.1
2.4.3
2.4.4
2.4.5
2.4.2Not supported

Releases

.NET for Apache Spark releases are available here and NuGet packages are available here.

Get Started

These instructions will show you how to run a .NET for Apache Spark app using .NET Core.

Windows InstructionsUbuntu InstructionsMacOs Instructions

Build Status

Building from Source

Building from source is very easy and the whole process (from cloning to being able to run your app) should take less than 15 minutes!

Samples

There are two types of samples/apps in the .NET for Apache Spark repo:

We welcome contributions to both categories!

Contributing

We welcome contributions! Please review our contribution guide.

Inspiration and Special Thanks

This project would not have been possible without the outstanding work from the following communities:

Apache Spark: Unified Analytics Engine for Big Data, the underlying backend execution engine for .NET for Apache SparkMobius: C# and F# language binding and extensions to Apache Spark, a pre-cursor project to .NET for Apache Spark from the same Microsoft group.PySpark: Python bindings for Apache Spark, one of the implementations .NET for Apache Spark derives inspiration from.sparkR: one of the implementations .NET for Apache Spark derives inspiration from.Apache Arrow: A cross-language development platform for in-memory data. This library provides .NET for Apache Spark with efficient ways to transfer column major data between the JVM and .NET CLR.Pyrolite - Java and .NET interface to Python's pickle and Pyro protocols. This library provides .NET for Apache Spark with efficient ways to transfer row major data between the JVM and .NET CLR.Databricks: Unified analytics platform. Many thanks to all the suggestions from them towards making .NET for Apache Spark run on Azure and AWS Databricks.

How to Engage, Contribute and Provide Feedback

The .NET for Apache Spark team encourages contributions, both issues and PRs. The first step is finding an existing issue you want to contribute to or if you cannot find any, open an issue.

.NET Foundation

The .NET for Apache Spark project is part of the .NET Foundation.

Code of Conduct

This project has adopted the code of conduct defined by the Contributor Covenant to clarify expected behavior in our community. For more information, see the .NET Foundation Code of Conduct.

License

.NET for Apache Spark is licensed under the MIT license.

版权声明:本文内容由网络用户投稿,版权归原作者所有,本站不拥有其著作权,亦不承担相应法律责任。如果您发现本站中有涉嫌抄袭或描述失实的内容,请联系我们jiasou666@gmail.com 处理,核实后本网站将在24小时内删除侵权内容。

上一篇:Apache MXNet 一个轻量级,便携式,灵活的分布式/移动深度学习框架
下一篇:523. 连续的子数组和
相关文章

 发表评论

暂时没有评论,来抢沙发吧~