I think am done with this. Feedback before i release it on an unsuspecting public, please!

git-svn-id: https://shorewall.svn.sourceforge.net/svnroot/shorewall/trunk@3695 fbd18981-670d-0410-9b5c-8dc0c1a9a2bb
This commit is contained in:
paulgear 2006-03-18 12:36:41 +00:00
parent 7fb04eaf51
commit 7d1ec11072

View File

@ -38,75 +38,86 @@
<title>Background</title> <title>Background</title>
<para>In early March 2006, i embarked on the journey of surveying <para>In early March 2006, i embarked on the journey of surveying
Shorewall users. Initially this sprang from my own curiosity: it seemed to Shorewall users. Initially this sprang from my own curiosity: i thought
me that some of the systems at work on which i was using Shorewall were that some of the systems at work on which i use Shorewall may be bigger
bigger and more complex than a lot of the ones others were using, and i and more complex than most others, and i wanted to find out if there are
wanted to find out if there were people out there who used it like i did. people out there who use Shorewall like i do. As started thinking about
As started thinking about the questions i would ask, i realised that i the questions i would ask, i realised that if i asked the right questions,
could ask a few more questions that might help us as a project to i could create a survey that might help the Shorewall project better to
understand a bit more about all of our users.</para> understand its users.</para>
<para>I used <ulink url="http://www.zoomerang.com">Zoomerang</ulink> to <para>I used <ulink url="http://www.zoomerang.com">Zoomerang</ulink> to
create the survey. It has a number of tools that make it really easy to create the survey. It has a number of tools that make it easy to create
create useful surveys. To get the most benefit out of Zoomerang, you have useful surveys. To get the most benefit out of Zoomerang, you have to
to subscribe to their professional version. In the long term, it would be subscribe to their professional version.</para>
great to see a practical free software alternative that we could
self-host. A number of free content management systems such as <ulink <para>In the long term, it would be great to have a practical free
url="http://drupal.org">Drupal</ulink> have a survey module, but when i software alternative that could be self-hosted. A number of free content
last looked at them, they were much more limited and harder to use than management systems such as <ulink url="http://drupal.org">Drupal</ulink>
Zoomerang.</para> have a survey module, but when i last looked at them, they were more
limited and harder to use than Zoomerang.</para>
<section> <section>
<title>Take the survey</title> <title>Survey and results links</title>
<para>The survey is still open as of this writing, and can be accessed <para>The survey is still open as of this writing, and can be accessed
at <ulink url="http://www.zoomerang.com/survey.zgi?p=WEB2253NNBCN44">the at <ulink url="http://www.zoomerang.com/survey.zgi?p=WEB2253NNBCN44">the
Zoomerang survey page</ulink>. Further participation is encouraged. The Zoomerang survey page</ulink>. Further participation is encouraged. The
figures quoted in this document reflect the latest results at the time figures quoted in this document reflect the results at the time of
of writing.</para> writing.</para>
</section>
<section>
<title>Survey results</title>
<para>The <ulink <para>The <ulink
url="http://www.zoomerang.com/reports/public_report.zgi?ID=L22KHC6BPGLS">public url="http://www.zoomerang.com/reports/public_report.zgi?ID=L22KHC6BPGLS">public
results</ulink> of the survey are also available. If you complete the results</ulink> of the survey are available. If you complete the survey,
survey, a link to the results is provided on the thank you page.</para> a link to the results is provided on the thank you page.</para>
</section>
<section>
<title>Sample size</title>
<para>An important note about this survey is that it has a small sample
size (103 complete responses at the time of writing), so any conclusions
drawn should be considered tentative.</para>
<para>To speculate on the overall number of users that this sample
represents, the <ulink
url="http://popcon.debian.org/source/by_inst.gz">Debian popularity
contest</ulink> reports 478 installations of Shorewall, 285 of which are
in active use. Assuming that the popularity contest represents 30% of
the Debian installed base (likely ridiculously optimistic), this would
make the number of active Shorewall systems approximately:</para>
<para>285 / 0.3 (percentage of Debian systems) / 0.26 (percentage Debian
holds of all distributions) = 3654 (rounding up the numbers to the
nearest whole, and assuming the percentages extrapolate
regularly)</para>
<para>This means that our survey represents a maximum of 2.8% of the
installed base, likely far less.</para>
</section>
<section>
<title>Other possible inaccuracies</title>
<para>Additionally, since the survey was open to multiple responses, it
could be that some people answered the questions about themselves more
than once, despite instructions to the contrary in the introduction
page.</para>
<para>There is an error in the released version of the survey for
question 15 (RAM size): it was a multiple choice question rather than
single choice, and thus there were more results than expected. The
number of errors doesn't seem to be significant.</para>
<para>If you notice any errors in this analysis, or have any suggestions
about how to improve it, please contact the author at <ulink
url="mailto:pgear@shorewall.net">pgear@shorewall.net</ulink>.</para>
</section> </section>
</section> </section>
<section> <section>
<title>Results analysis</title> <title>Results analysis</title>
<para>An important note about this survey is that it has a small sample
size (103 complete responses at the time of writing), so any conclusions
drawn should be considered tentative.</para>
<para>To speculate on the overall number of users that this sample
represents, the <ulink
url="http://popcon.debian.org/source/by_inst.gz">Debian popularity
contest</ulink> reports 478 installations of Shorewall, 285 of which are
in active use. Assuming that the popularity contest represents 30% of the
Debian installed base (likely ridiculously optimistic), this would make
the number of active Shorewall systems approximately:</para>
<para>285 / 0.3 (percentage of Debian systems) / 0.26 (percentage Debian
holds of all distributions) = 3654 (rounding up the numbers to the nearest
whole, and assuming the percentages extrapolate regularly)</para>
<para>This means that our survey represents a maximum of 2.8% of the
installed base, likely far less.</para>
<para>Additionally, since the survey was open to multiple responses, it
could be that some people answered the questions about themselves more
than once, despite instructions to the contrary in the introduction
page.</para>
<para>If you notice any errors in this analysis, or have any suggestions
about how to improve it, please contact the author at <ulink
url="mailto:pgear@shorewall.net">pgear@shorewall.net</ulink>.</para>
<section> <section>
<title>Organisations</title> <title>Organisations</title>
@ -222,9 +233,9 @@
hardware, with a further 6% running it on x86-64/EM64T platforms. One hardware, with a further 6% running it on x86-64/EM64T platforms. One
response was received indicating use of Shorewall on MIPS (Linksys WRT response was received indicating use of Shorewall on MIPS (Linksys WRT
platform). No responses were received for any other hardware platform. platform). No responses were received for any other hardware platform.
While it is not surprising that Intel architectures would be first, While it is not surprising that Intel architectures would be dominant,
given their marketplace dominance, it seems a little skewed not to have given their market share, it seems a little skewed not to have any
any representatives of other distributions.</para> representatives of other distributions.</para>
<para>A good spread of CPU power is shown in the survey responses. The <para>A good spread of CPU power is shown in the survey responses. The
largest group was 400-999 MHz (30%), with only 16% of responses largest group was 400-999 MHz (30%), with only 16% of responses
@ -238,11 +249,8 @@
bias towards higher RAM figures. The majority (52%) of systems have bias towards higher RAM figures. The majority (52%) of systems have
between 256 and 1023 MB; only 11% of systems have less than 128 MB; 28% between 256 and 1023 MB; only 11% of systems have less than 128 MB; 28%
have 1024 MB or more. This reflects the more server-oriented workload have 1024 MB or more. This reflects the more server-oriented workload
that many Shorewall systems run (see the section on server roles below). that many Shorewall systems run (see the section on server roles
(Note that there is an error in the released version of the survey for below).</para>
this question: it was a multiple choice question rather than single
choice, and thus there were more results than expected. However, the
number of errors doesn't seem to be significant.)</para>
<para>Shorewall systems on the whole tend toward smaller OS hard disks, <para>Shorewall systems on the whole tend toward smaller OS hard disks,
with 42% having disks 39 GB or smaller. The largest group by a small with 42% having disks 39 GB or smaller. The largest group by a small
@ -277,9 +285,9 @@
<para>The message about maintaining an up-to-date Shorewall system seems <para>The message about maintaining an up-to-date Shorewall system seems
to have gotten through, with 61% of respondents running the latest to have gotten through, with 61% of respondents running the latest
version (3.0), and an additional 22% running the previous stable version stable version (3.0), and an additional 22% running the previous stable
(2.4). Only 14% of users are running unsupported versions (2.2 and version (2.4). Only 14% of users are running unsupported older versions
older).</para> (2.2 and previous).</para>
<para>The most common roles played by Shorewall systems are:</para> <para>The most common roles played by Shorewall systems are:</para>
@ -309,8 +317,8 @@
<section> <section>
<title>Comments from users</title> <title>Comments from users</title>
<para>Here's a sample of the comments we received about the survey - <para>Following is a sample of the comments we received about the survey
they have been carefully sanitised to make us look good. ;-)</para> - they have been carefully sanitised to make us look good. ;-)</para>
<itemizedlist> <itemizedlist>
<listitem> <listitem>
@ -357,6 +365,76 @@
</section> </section>
</section> </section>
<section>
<title>Lessons learned about survey technique</title>
<section>
<title>Treat surveys like releasing free software</title>
<itemizedlist>
<listitem>
<para>test on a small group before you go public</para>
</listitem>
<listitem>
<para>release early and often</para>
</listitem>
<listitem>
<para>make branches (copies) when you release alpha and beta
versions</para>
</listitem>
<listitem>
<para>merge the changes from branches (lessons you learned in those
versions) into the main trunk</para>
</listitem>
</itemizedlist>
</section>
<section>
<title>Start small and work towards what you want to know with specific,
concrete questions</title>
<para>I tried to do everything in one survey, and ended up confusing
some people. For example, despite the fact that the survey's start page
clearly says "Please answer the questions for only ONE SYSTEM running
Shorewall", i received multiple comments saying that they couldn't
answer accurately because they ran more than Shorewall system.</para>
<para>It would have been better to have two surveys: one about the
people who use Shorewall, and another about the systems they run it on.
Better still would be for Shorewall to automatically collect appropriate
information about systems and request permission to send it to a central
location for statistical analysis. How to do this and maintain users'
privacy and obtain their permission efficiently is not an easy problem
with a product like Shorewall, which doesn't actually stay running on
user systems, and doesn't present a user interface per se.</para>
</section>
<section>
<title>Be prepared beforehand</title>
<para>Within hours of the survey's release, 50% of the results were in.
Within 3 days, it hit the Zoomerang basic survey limit of 100 responses.
I had not planned for such an enthusiastic response, and also was too
busy to download all of the results before the survey's time limit
expired. Fortunately, i was able to obtain funding to allow a Zoomerang
"pro" subscription to be purchased and thus provide advanced analysis,
and complete downloads of the results.</para>
</section>
<section>
<title>Incrementally improve your surveys</title>
<para>The final version of this survey was released still with a few
bugs. The released version was just a copy of my master survey, and i
continued to maintain the master after the final survey was released
(and during this analysis), and i'm sure the next version will be even
better.</para>
</section>
</section>
<section> <section>
<title>Possible implications for the Shorewall project</title> <title>Possible implications for the Shorewall project</title>
@ -370,7 +448,7 @@
<para>Shorewall seems to be predominantly used in small, i386-based <para>Shorewall seems to be predominantly used in small, i386-based
environments such as home LANs and small businesses. It seems to be environments such as home LANs and small businesses. It seems to be
frequently combined with a number of other basic functions, such as DNS, frequently combined with a number of other basic functions, such as DNS,
DHCP, NTP, VPN. Integration with (or perhaps providing a plugin module DHCP, NTP, VPN. Integration with (or perhaps providing a plug-in module
for) a dedicated gateway distribution such as ipcop, Smoothwall, or Clark for) a dedicated gateway distribution such as ipcop, Smoothwall, or Clark
Connect might be a good way to serve the needs of our users.</para> Connect might be a good way to serve the needs of our users.</para>
</section> </section>
@ -378,100 +456,19 @@
<section> <section>
<title>Possible implications for other free software projects</title> <title>Possible implications for other free software projects</title>
<para></para> <itemizedlist>
</section> <listitem>
<para>The essence of free software is software by the people, for the
people. Knowing who the people are and what their needs are is
critical to this process.</para>
</listitem>
<section> <listitem>
<title>Lessons learned about surveys</title> <para>If at all possible, build statistics gathering into your
application, and find a way to encourage people to use it. This
<para></para> concrete data will help confirm the results of any surveys you might
conduct.</para>
<section> </listitem>
<title>What i did right</title> </itemizedlist>
<para></para>
<itemizedlist>
<listitem>
<para>Treat surveys like releasing free software:</para>
<itemizedlist>
<listitem>
<para>release early and often</para>
</listitem>
<listitem>
<para>test on a small group before you go public</para>
</listitem>
<listitem>
<para>make branches (copies) when you release alpha and beta
versions</para>
</listitem>
<listitem>
<para>merge the changes from branches (lessons you learned in
those versions) into the main trunk</para>
</listitem>
</itemizedlist>
</listitem>
</itemizedlist>
</section>
<section>
<title>What i did wrong</title>
<para></para>
<itemizedlist>
<listitem>
<para>Start small and work towards what you want to know.</para>
<para>I tried to do everything in one survey, and ended up confusing
some people. For example, despite the fact that the survey's start
page clearly says "Please answer the questions for only ONE SYSTEM
running Shorewall", i received multiple comments saying that they
couldn't answer accurately because they ran more than Shorewall
system.</para>
<para>It would have been better to have two surveys: one about the
people who use Shorewall, and another about the systems they run it
on. Better still would be for Shorewall to automatically collect
appropriate information about systems and request permission to send
it to a central location for statistical analysis. How to do this
and maintain users' privacy and obtain their permission efficiently
is not an easy problem with a product like Shorewall, which doesn't
actually stay running on user systems, and doesn't present a user
interface per-se.</para>
</listitem>
<listitem>
<para>Be prepared beforehand.</para>
<para>Within hours of the survey's release, 50% of the results were
in. Within 3 days, it hit the Zoomerang basic survey limit of 100
responses. I had not planned for such an enthusiastic response, and
also was too busy to download all of the results before the survey's
time limit expired. Fortunately, i was able to obtain funding to
allow a Zoomerang "pro" subscription to be purchased and thus
provide advanced analysis, and complete downloads of the
results.</para>
</listitem>
</itemizedlist>
<para></para>
</section>
</section>
<section>
<title></title>
<para></para>
</section>
<section>
<title></title>
<para></para>
</section> </section>
</article> </article>