Posts

Showing posts from August, 2016

Topic Classification using Latent Dirichlet Allocation(LDA) ML Library

importorg.apache.spark.sql.SparkSessionvalsparkSession=SparkSession.builder .master("local") .appName("my-spark-app") .config("spark.some.config.option", "config-value") .getOrCreate()​valdf=spark.read.json("dbfs:/mnt/JSON10/JSON/sampleDoc.txt") import org.apache.spark.sql.SparkSession sparkSession: org.apache.spark.sql.SparkSession = org.apache.spark.sql.SparkSession@351b4b37 df: org.apache.spark.sql.DataFrame = [filename: string, id: string ... 1 more field]