About calculation of Standard deviation(σ)

Welcome to our Forums Product Development Bug reports About calculation of Standard deviation(σ)

Viewing 4 posts - 1 through 4 (of 4 total)
  • Author
    Posts
  • #730
    peng wang
    Participant

      I found when I use excel to calculate the standard deviation, the result is not the same as calculated in the DataGraph, but it is same with the “s” called as “Unbiased estimate of the standard deviation”. I don’t understand statistics, however, when I check some websites and my previous data, I found it should be wrong in DataGraph. Kindly hope you give me an answer, thank you.

      #755
      dgteam
      Moderator

        First let me explain what we do in DataGraph …

        In DataGraph, s is the sample standard deviation and σ is the population standard deviation. The population standard deviation is also simply referred to as the standard deviation and s is the unbiased estimate of σ.  s is appropriate for small samples from a larger population.  The one you use depends on the data/analysis.

        I found two basic examples here that demonstrate the difference:
        https://en.wikipedia.org/wiki/Standard_deviation

        The first example is a sample of a larger population, so ‘s’ is used.

        Example 1

        The second is a sample of the entire population and ‘σ’ is used.

        Example 2

        Note the values from these graphs were calculated in DataGraph and match the values shown on the wiki page.   We have also verified our calculations with other stats software (i.e., R).

        Based on that, the equations in DataGraph are correctly implemented. If you’re interested here are the equations:

        Equations

        Also, we just uploaded an example file with these graphs/equations to the on-line examples, that you can get here …

        onlineexamples

        In Excel, they used to have the function STDEV, which calculated s.
        https://support.office.com/en-us/article/STDEV-function-51FECAAA-231E-4BBB-9230-33650A72C9B0

        Excel now has separate functions, STDEV.S or STDEV.P
        https://support.office.com/en-us/article/stdev-s-function-7d69cf97-0c1f-4acf-be27-f3e83904cc23

        I assume this change was made in Excel to avoid confusion about which one was being calculated.  If you were using STDEV then you were calculating s.  This seems consistent with your comparison between DataGraph and Excel.

        Note that the equations for s and σ are similar, the difference being in the denominator of s (n-1).  As a result, s is always greater than σ, for the same set of data.  As n increases, s approaches the value of σ. Thus for large sample sizes n>30, s and σ are approx. the same values.

        Sounds like you may have been choosing σ instead of s, but hopefully this helps explain each one and why you would pick one versus another.

         

         

         

         

        • This reply was modified 4 years, 7 months ago by dgteam.
        #759
        peng wang
        Participant

          Thank you so much. Before I don’t understand the definition of “population standard deviation” and “sample standard deviation”. In my work, I always use the STDEV in Excel and when I learn the manual of DataGraph “Global variable” Page 21, and I found a form displays sigma “standard deviation” and s “Unbiased estimate of the standard deviation”. So I thought the sigma is the STDEV I used, now I understand and I think if you correct the definition with “population standard deviation” should be better than “Standard deviation”. Of course, maybe because I am a layman.

          #762
          dgteam
          Moderator

            You’re Welcome.  Glad that was helpful.

            We will add those descriptions in the documentation.

          Viewing 4 posts - 1 through 4 (of 4 total)
          • You must be logged in to reply to this topic.

          Welcome to our Forums Product Development Bug reports About calculation of Standard deviation(σ)