• [leetcode]295. Find Median from Data Stream数据流的中位数


    Median is the middle value in an ordered integer list. If the size of the list is even, there is no middle value. So the median is the mean of the two middle value.

    For example,

    [2,3,4], the median is 3

    [2,3], the median is (2 + 3) / 2 = 2.5

    Design a data structure that supports the following two operations:

    • void addNum(int num) - Add a integer number from the data stream to the data structure.
    • double findMedian() - Return the median of all elements so far.

    Example:

    addNum(1)
    addNum(2)
    findMedian() -> 1.5
    addNum(3) 
    findMedian() -> 2

    题意:

    给定一堆数据,求其中位数(将数据升序排列。若数据个数为奇,则正中为中位数;若数据个数为偶,则正中两数平均值为中位数)


    思路:

    显然若数据大,排序再找中位数是不现实的。

    用PriorityQueue分别建maxHeap, minHeap, 并保证maxHeap.size() - minHeap.size() <=1 

    如此,当最终maxHeap.size() = minHeap.size()  说明数据个数为偶,取两个Heap的peek值的平均值

             当最终maxHeap.size() > minHeap.size()  说明数据个数为奇,取maxHeap.peek

    扫数据中的每个元素

    [6,  10,  2,  6,  5,  0,  6,  3]

      ^  若maxHeap.isEmpty() || 当前元素 < maxHeap.peek() , 扔到maxHeap里去

    [6,  10,  2,  6,  5,  0,  6,  3]

              ^ 若当前元素 >= maxHeap.peek() , 扔到minHeap里去

    [6,  10,  2,  6,  5,  0,  6,  3]

                       ^   当前元素 < maxHeap.peek() , 扔到maxHeap里去

    [6,  10,  2,  6,  5,  0,  6,  3]

                                ^   若当前元素 >= maxHeap.peek() , 扔到minHeap里去,minHeap会自动将内部最小值sort到peek位置

    [6,  10,  2,  6,  5,  0,  6,  3]

                                         ^   当前元素 < maxHeap.peek() , 扔到maxHeap里去

    此时maxHeap.size() - minHeap.size()  = 2 需要维护平衡

    ......

    如此以往,直至扫完整个数据。


     代码:

     1 class MedianFinder {
     2 
     3     /** initialize your data structure here. */
     4     private PriorityQueue<Integer> _maxHeap; 
     5     private PriorityQueue<Integer>  _minHeap;
     6 
     7     public MedianFinder() {
     8         _maxHeap = new  PriorityQueue<> ((o1,o2) -> o2-o1); 
     9         _minHeap = new  PriorityQueue<> ((o1,o2) -> o1-o2); 
    10     }
    11     
    12     public void addNum(int num) {
    13         // put each integer into either maxHeap or minHeap
    14         if(_maxHeap.isEmpty() || num < _maxHeap.peek()){
    15             _maxHeap.add(num);
    16         } else{
    17             _minHeap.add(num);
    18         }
    19         // keep balance 
    20         if(_maxHeap.size() ==_minHeap.size()+2){
    21             _minHeap.add(_maxHeap.poll());
    22         }
    23         
    24           if(_minHeap.size() ==_maxHeap.size()+1){
    25             _maxHeap.add(_minHeap.poll());
    26         }
    27         
    28     }
    29     // the number of data is even or odd 
    30     public double findMedian() {
    31         if (_maxHeap.size() == _minHeap.size()) {
    32             return (_maxHeap.peek() + _minHeap.peek()) /  2.0;
    33         }else {
    34             return _maxHeap.peek();
    35         }
    36     }       
    37 }
    38 
    39 /**
    40  * Your MedianFinder object will be instantiated and called as such:
    41  * MedianFinder obj = new MedianFinder();
    42  * obj.addNum(num);
    43  * double param_2 = obj.findMedian();
    44  */
  • 相关阅读:
    Rsync企业实战之自动异地备份(转)
    Linux启动过程详解 (转)
    Linux系统下修改环境变量PATH路径的三种方法
    linux更改启动级别后,无法启动的问题解决
    MySQLAdmin用法
    mysql toolkit 用法[备忘] (转)
    mysql edit
    MySQL中SSL配置
    mysql ALTER COLUMN MODIFY COLUMN CHANGE COLUMN 区别及用法 (转)
    MySQL 使用mysqld_multi部署单机多实例详细过程 (转)
  • 原文地址:https://www.cnblogs.com/liuliu5151/p/9189717.html
Copyright © 2020-2023  润新知