# Difference between revisions of "Federated Learning"

From MgmtWiki

(→Privacy Enhancing) |
(→Using Test Data) |
||

Line 14: | Line 14: | ||

==Using Test Data== | ==Using Test Data== | ||

− | The goal is to determine whether selecting data in a manner that mirrors real life and federating populations that mirror real life will be | + | * Test data will not likely be distributed as it would be in real life. |

− | # Break the | + | * The goal is to determine whether selecting data in a manner that mirrors real life and federating populations that mirror real life will be lost to the results with uniform sorting. The following plan is designed to show the impact of distributing data the way that data is likely to be separated in the real world. |

+ | # Break the synthetic data into two groups that are randomly selected from the data and getting a result with a single aggregation of those two sets. | ||

# Break the population into 50 groups with random selection. | # Break the population into 50 groups with random selection. | ||

− | # Break the population into 50 groups with deliberately selected groups be overrepresented in each group. | + | # Break the population into 50 groups with deliberately selected groups be overrepresented in each group and of widely different numbers of individuals. |

# Aggregate each selection into a single distribution. | # Aggregate each selection into a single distribution. | ||

− | # Measure the discrepancy between the results. | + | # Measure the discrepancy between the results to see impact of real-world distributions on the federated learning. |

==References== | ==References== | ||

[[Category: Artificial Intelligence]] | [[Category: Artificial Intelligence]] |

## Revision as of 21:42, 2 September 2022

## Contents

## Full Title

A means of learning where the nodes can operate independently to create a common understanding of a problem.

## Context

- Most human learning is federated in the sense that each human operates as an independent entity which receives inputs and creates outputs.
- In this pattern we model the human tendency to spread processing to each node with a similar hierarchy of capability among the nodes.

## A Hierarchical Directed Graph

One solution is to create a network of all nodes that run any learning algorithm into a tree with paths that always move towards the root and away from the leaves as well as paths that go from the root out to the leaves.

## Privacy Enhancing

To make the graph privacy-enhancing we demand that any personally identifiable information (PII) is restricted to the leaves.

## Using Test Data

- Test data will not likely be distributed as it would be in real life.
- The goal is to determine whether selecting data in a manner that mirrors real life and federating populations that mirror real life will be lost to the results with uniform sorting. The following plan is designed to show the impact of distributing data the way that data is likely to be separated in the real world.

- Break the synthetic data into two groups that are randomly selected from the data and getting a result with a single aggregation of those two sets.
- Break the population into 50 groups with random selection.
- Break the population into 50 groups with deliberately selected groups be overrepresented in each group and of widely different numbers of individuals.
- Aggregate each selection into a single distribution.
- Measure the discrepancy between the results to see impact of real-world distributions on the federated learning.