Neo4J and Weka 2

1. Combining NEO4J graph databse with WEKA Basic “toy” example drawn upon mining SEC filings of Form -D

2. Experiment :Find intersection among VC firms related to Google and its latest acquisitions (i.e the “Dataset”) and play with “predicting” the chance of newly funded startup being acquired by Google by examining proximity.

3. Weka: Machine learning toolkit containing classification and clustering algorithms. In this case used for creating recommendations based on input. Neo4j: Graph Database. Very suitable for social networks data. Used here for finding “shortest path” between two nodes

4. Neo4J can handle large sets of unstructured linked data:

5. RDF : Subject- Property- Object Neo4J: Node 1–Relationship-Node2

6. Statement: “Sequoia Capital Funded Google” Initialize Database: grapb = new EmbeddedGraphDatabase( “SEC" ); index = new LuceneIndexService( graphDb ); Create the Nodes: Node Sequoia = graphDb.createNode(); Sequoia.setProperty( "name", “Seqioua Capital” ); Node Google = graphDb.createNode(); Google.setProperty( "name", “Google” ); index.index(Sequoia , "name“,” Seqioua Capital”) ); Create Relationship: Relationship rel = Sequoia.createRelationshipTo(Google, Relationship.FUNDED);

7. Traversertraverser = node.traverse( Order.DEPTH_FIRST, topEvaluator.END_OF_NETWORK, new ReturnableEvaluator(){public booleanisReturnableNode(TraversalPositioncurrentPosition){Relationship last =currentPosition.lastRelationshipTraversed(); return( last.getType().equals(InvestorRelationTypes.FUNDED) ) return false; } }, InvestorRelationTypes.BOARD, Direction.INCOMING, InvestorRelationTypes.FUNDED, Direction.INCOMING, InvestorRelationTypes.ACQUIRED, Direction.OUTGOING ); return traverser.getAllNodes();

8. “Path to Google:”

9. Weka Create Attributes (table input) Create DataSet for Learning Build predictive model Evaluate quality of Model Predict the rank based on input

11. Instance –Dataset consist of Instances

12. Attribute –Each instance consist of attributes

14. Example:Attributes

15. 1) Create Attributes: Attribute pathAttribute = new Attribute("path");Attribute categoryAttribute = new Attribute("category");Attribute similiarityAttribute = new Attribute("similarity");Attribute probabiityAttribute = new Attribute("probability"); In Weka a vector is container foR Attributes FastVector allAttributes = new FastVector(4); allAttributes.addElement(pathAttribute); allAttributes.addElement(categoryAttribute); 2) Create Dataset:Instance is a “container” of Attributesand the Dataset is container of Instances. Instances trainingDataSet = new Instances("VC", allAttributes, 17); For each instance we set values to be trained upon: Instance instance = new Instance(4);instance.setDataset(trainingDataSet);instance.setValue(0, path);instance.setValue(1, category); instance.setValue(2, similiarity); instance.setValue(3, rank); trainingDataSet.add(instance);

16. 3) Train Classifier and Evaluate RBFNetwork rbfLearner = new RBFNetwork(); rbfLearner.setNumClusters(17); rbfLearner.buildClassifier(trainingDataSet ); Evaluation learningSetEvaluation = new Evaluation(learningDataset); learningSetEvaluation.evaluateModel(rbfLearner, learningDataset); 4) Predict Unknown Cases Instance instance = new Instance(4);instance.setDataset(trainingDataSet);instance.setValue(0, path);instance.setValue(1, category); instance.setValue(2, similiarity); instance.setValue(3, 0); double prediction = rbfLearner.classifyInstance(testInstance);

Neo4J and Weka 2

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Destacado

Destacado (19)

Similar a Neo4J and Weka 2

Similar a Neo4J and Weka 2 (20)

Último

Último (20)

Neo4J and Weka 2