class ConnectedComponents extends Arguments with Logging with WithAlgorithmChoice
Connected Components algorithm.
Computes the connected component membership of each vertex and returns a DataFrame of vertex information with each vertex assigned a component ID.
The resulting DataFrame contains all the vertex information and one additional column:
- component (
LongType
): unique ID for this component
- Alphabetic
- By Inheritance
- ConnectedComponents
- WithAlgorithmChoice
- Logging
- Arguments
- AnyRef
- Any
- by any2stringadd
- by StringFormat
- by Ensuring
- by ArrowAssoc
- Hide All
- Show All
- Public
- All
Value Members
-
final
def
!=(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
final
def
##(): Int
- Definition Classes
- AnyRef → Any
-
def
+(other: String): String
- Implicit
- This member is added by an implicit conversion from ConnectedComponents to any2stringadd[ConnectedComponents] performed by method any2stringadd in scala.Predef.
- Definition Classes
- any2stringadd
-
def
->[B](y: B): (ConnectedComponents, B)
- Implicit
- This member is added by an implicit conversion from ConnectedComponents to ArrowAssoc[ConnectedComponents] performed by method ArrowAssoc in scala.Predef.
- Definition Classes
- ArrowAssoc
- Annotations
- @inline()
-
final
def
==(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
val
ALGO_GRAPHFRAMES: String
- Attributes
- protected
- Definition Classes
- WithAlgorithmChoice
-
val
ALGO_GRAPHX: String
- Attributes
- protected
- Definition Classes
- WithAlgorithmChoice
-
val
algorithm: String
- Attributes
- protected
- Definition Classes
- WithAlgorithmChoice
-
final
def
asInstanceOf[T0]: T0
- Definition Classes
- Any
-
def
clone(): AnyRef
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws( ... ) @native() @HotSpotIntrinsicCandidate()
-
def
ensuring(cond: (ConnectedComponents) ⇒ Boolean, msg: ⇒ Any): ConnectedComponents
- Implicit
- This member is added by an implicit conversion from ConnectedComponents to Ensuring[ConnectedComponents] performed by method Ensuring in scala.Predef.
- Definition Classes
- Ensuring
-
def
ensuring(cond: (ConnectedComponents) ⇒ Boolean): ConnectedComponents
- Implicit
- This member is added by an implicit conversion from ConnectedComponents to Ensuring[ConnectedComponents] performed by method Ensuring in scala.Predef.
- Definition Classes
- Ensuring
-
def
ensuring(cond: Boolean, msg: ⇒ Any): ConnectedComponents
- Implicit
- This member is added by an implicit conversion from ConnectedComponents to Ensuring[ConnectedComponents] performed by method Ensuring in scala.Predef.
- Definition Classes
- Ensuring
-
def
ensuring(cond: Boolean): ConnectedComponents
- Implicit
- This member is added by an implicit conversion from ConnectedComponents to Ensuring[ConnectedComponents] performed by method Ensuring in scala.Predef.
- Definition Classes
- Ensuring
-
final
def
eq(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
-
def
equals(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
def
getAlgorithm: String
- Definition Classes
- WithAlgorithmChoice
-
def
getBroadcastThreshold: Int
Gets broadcast threshold in propagating component assignment.
Gets broadcast threshold in propagating component assignment.
-
def
getCheckpointInterval: Int
Gets checkpoint interval.
Gets checkpoint interval.
-
final
def
getClass(): Class[_]
- Definition Classes
- AnyRef → Any
- Annotations
- @native() @HotSpotIntrinsicCandidate()
-
def
getIntermediateStorageLevel: StorageLevel
Gets storage level for intermediate datasets that require multiple passes.
-
def
hashCode(): Int
- Definition Classes
- AnyRef → Any
- Annotations
- @native() @HotSpotIntrinsicCandidate()
-
final
def
isInstanceOf[T0]: Boolean
- Definition Classes
- Any
-
def
logDebug(s: ⇒ String): Unit
- Attributes
- protected
- Definition Classes
- Logging
-
def
logInfo(s: ⇒ String): Unit
- Attributes
- protected
- Definition Classes
- Logging
-
def
logTrace(s: ⇒ String): Unit
- Attributes
- protected
- Definition Classes
- Logging
-
def
logWarn(s: ⇒ String): Unit
- Attributes
- protected
- Definition Classes
- Logging
-
final
def
ne(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
-
final
def
notify(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native() @HotSpotIntrinsicCandidate()
-
final
def
notifyAll(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native() @HotSpotIntrinsicCandidate()
-
def
run(): DataFrame
Runs the algorithm.
-
def
setAlgorithm(value: String): ConnectedComponents.this.type
- Definition Classes
- WithAlgorithmChoice
-
def
setBroadcastThreshold(value: Int): ConnectedComponents.this.type
Sets broadcast threshold in propagating component assignments (default: 1000000).
Sets broadcast threshold in propagating component assignments (default: 1000000). If a node degree is greater than this threshold at some iteration, its component assignment will be collected and then broadcasted back to propagate the assignment to its neighbors. Otherwise, the assignment propagation is done by a normal Spark join. This parameter is only used when the algorithm is set to "graphframes".
-
def
setCheckpointInterval(value: Int): ConnectedComponents.this.type
Sets checkpoint interval in terms of number of iterations (default: 2).
Sets checkpoint interval in terms of number of iterations (default: 2). Checkpointing regularly helps recover from failures, clean shuffle files, shorten the lineage of the computation graph, and reduce the complexity of plan optimization. As of Spark 2.0, the complexity of plan optimization would grow exponentially without checkpointing. Hence, disabling or setting longer-than-default checkpoint intervals are not recommended. Checkpoint data is saved under
org.apache.spark.SparkContext.getCheckpointDir
with prefix "connected-components". If the checkpoint directory is not set, this throws ajava.io.IOException
. Set a nonpositive value to disable checkpointing. This parameter is only used when the algorithm is set to "graphframes". Its default value might change in the future.- See also
org.apache.spark.SparkContext.setCheckpointDir
in Spark API doc
-
def
setIntermediateStorageLevel(value: StorageLevel): ConnectedComponents.this.type
Sets storage level for intermediate datasets that require multiple passes (default:
).MEMORY_AND_DISK
-
val
supportedAlgorithms: Array[String]
- Definition Classes
- WithAlgorithmChoice
-
final
def
synchronized[T0](arg0: ⇒ T0): T0
- Definition Classes
- AnyRef
-
def
toString(): String
- Definition Classes
- AnyRef → Any
-
final
def
wait(arg0: Long, arg1: Int): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... )
-
final
def
wait(arg0: Long): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... ) @native()
-
final
def
wait(): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... )
-
def
→[B](y: B): (ConnectedComponents, B)
- Implicit
- This member is added by an implicit conversion from ConnectedComponents to ArrowAssoc[ConnectedComponents] performed by method ArrowAssoc in scala.Predef.
- Definition Classes
- ArrowAssoc
Deprecated Value Members
-
def
finalize(): Unit
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws( classOf[java.lang.Throwable] ) @Deprecated
- Deprecated
-
def
formatted(fmtstr: String): String
- Implicit
- This member is added by an implicit conversion from ConnectedComponents to StringFormat[ConnectedComponents] performed by method StringFormat in scala.Predef.
- Definition Classes
- StringFormat
- Annotations
- @deprecated @inline()
- Deprecated
(Since version 2.12.16) Use
formatString.format(value)
instead ofvalue.formatted(formatString)
, or use thef""
string interpolator. In Java 15 and later,formatted
resolves to the new method in String which has reversed parameters.