For an application running on the MapReduce platform, where will the jar package that the application depends on be placed?
In Spark, assuming that lines is a DStream object, the filter statement can filter out 80% of the data. Which statement is correct for the following two statements:
X: lines.filter(...).groupByKey(...)
Y: lines.groupByKey(...).filter(...)
In Streaming application development, which interface does Bolt use to send Tuples?