2018/01 ~ Technical Essentials

Jan 17, 2018

Writing Generic UDFs in Spark

17
January

Apache Spark offers the ability to write Generic UDFs. However, for an idiomatic implementation, there are a couple of things that one needs to keep in mind. You should return a subtype of Option because Spark treats None subtype automatically as null and is able to extract value from Some subtype. Your Generic UDFs should be able to handle Option or regular type as input. To accomplish this, use type matching in case of Option and recursively extract values. This scenario occurs, if...

Posted on Wednesday, January 17, 2018 by Unknown

Testing Spark Dataframes

17
January

Testing Spark Dataframe transforms is essential and can be accomplished in a more reusable manner. The way, I generally accomplish that is to Read the expected and test Dataframe, and Invoke the desired transform, and Calculate the difference between dataframes. The only caveat in calculating the difference is that in built except function is not sufficient for columns with decimal column types and that requires a bit of work. To accomplish generic dataframe comparison: We need to look at the type of the column and when its...

Posted on Wednesday, January 17, 2018 by Unknown

Technical Essentials

Java, ADF, Android, Identity Management, Data Science, Machine Learning, Fusion Middleware, Linux, Counter Strike 1.6, BSD, Windows, Programming, Search Engines

ADF Examining memory leaks

Simply Read: A RSS/Atom Reader

Event Handlers in OIM 11g

Analyzing and Fixing memory leaks in Android

Driving security for ADF Essentials: Part1

Target Reconciliation Scheduler in OIM 11g

Using Lucene in Android

Better Charting with NVD3

ADF: Doing Runtime Entity Validations

ADF View Criteria Performance Impact

Jan 17, 2018

Writing Generic UDFs in Spark

Testing Spark Dataframes

Subscribe to Feeds

About Me

Google Translate

Connect !

Categories

Total Pageviews

Popular Posts

Blog Archive

Site Search