[Cadre-politics] first summer projects
Dan MacNeil
dan at thecsl.org
Thu Jun 1 01:59:25 EDT 2006
CONTROL PANEL (Manny)
SERVICE MONITORING (Matt & Rob)
LDAP (John)
DOWNTIME DATABASE (Kamala)
SPAM FILTERING (John & TBA)
CUSTOMER SURVEY (TBA)
LOWELL DEEDS / UTEC (dan)
BACKUP (john)
SLICE (john)
VIRTUAL SERVERS (TBA)
CONTROL PANEL (Manny)
Right now, Manny is working on confirming that there is not an existing
web based control panel that meets our needs. He's taking the existing
short easy to understand entry [1] at wikipedia and fleshing it out.
[1] http://en.wikipedia.org/wiki/Control_Panel_%28Web_Hosting%29
Apart from serving us and the users of wikipedia, this will look good in
his portfolio. Next steps would be to start installing or to start writing.
SERVICE MONITORING (Matt & Rob)
Mon is a program that runs on one server and periodically checks to see
if email,web,database,dns,ldap,etc are running, if they aren't it will
take some action, send an email/text message/IM or trip a relay or whatever.
In their first day, Rob & Matt got Mon [2] installed on a workstation
and monitoring another workstation. Next steps are listing the hosts &
services we wish to monitor, defining the alert and re-alert levels we
need, installing it on a dedicated boxes and moving it offsite to
downtown and CA. From past analysis of downtime, this should reduce the
time it takes for us to respond to issues.
[2] http://www.kernel.org/software/mon/
The time consuming part will be adding config files to CVS, which
requires learning CVS and creating an installation checklist.
One thing to note is that out of the box Mon won't catch some failures
in our email system. It just probes to see if stmtp the server is
answering on the SMTP (simple mail transport protocol ) port. It doesn't
have a way to know that the virus scanning queue behind the mail queue
is stalled.
LDAP (John)
Our downtime actually isn't that big but a big chuck of our recent
downtime is due to the LDAP server halting. John's working on creating
a backup server. Eventually we may distribute caching LDAP servers in
the way we now distribute caching DNS servers.
DOWNTIME DATABASE (Kamala)
Kamala's working on the database to automate the info now at
http://downtime.thecsl.org . Odds are good, Matt & Rob will write a
custom alert to insert mon alerts into
SPAM FILTERING (unassigned but probably going John's way)
This is really starting to get out of hand. We need to update our filter
rules and create a process for keeping them updated. This could be
complicated or it could be simple. Since the CS dept's anti-spam
appliance with a $20,000 per year maintence contract is starting to fail
as well, I'm guessing "complicated".
We also need to start doing SPF [3] for at least the listservs. SPF is a
way to say that certain servers (lists.thecsl.org) are authorized to
send email and others (boris.stick-it-you.ru ) are not.
[3] http://openspf.org/
SPF only indirectly reduces our spam. We'll get just a little less
bounce back from messages that have us forged as senders. However if we
want our mail delivered, we need to start with SPF. AOL, comcast,
verizon are starting to score non-SPF mail as "SPAM-LIKE"
The final anti-spam task is to make things moduler enough that we can
just drop another identical spam filtering box in the lineup to handle
increased load.
CUSTOMER SURVEY (next free person)
Some stuff like SPAM and downtime people have been pretty vocal on
already. --We don't need a survey to know about this stuff.
However, many important people, (that's no lady, that's my wife) have
said that we need to ask our customers what the want. These important
people are right. Future projects should be strongly influanced by
customer desire.
Josh Murry's (thank you) last draft is attached. I can imagine moving a
big chunk of it to a format like:
You have 100 points to spend:
____ service improvement 01
(needs 10 pts for 50% success chance
100 pts for for 100% success chance)
____ service improvement 02
____ service improvement 03
...allocate these points among the options would be a good idea.
LOWELL DEEDS / UTEC (dan)
July is the end of the fiscal year for lowelldeeds.com, they don't like
to rack up expenses that confuse the bean counters then. I'll spend that
time on UTEC. (yeah, you've heard that before)
BACKUP (john)
To reach the next plateau, (3 copies of 30 days worth of backups for
everything, 1 copy 1 mile away), John & I pretty much just need to move
the offsite-backup server offsite.
Other plateaus like quick bare metal restores and removable media to
thwart evil insiders are for the future.
SLICE (john)
I think there is more work to do with this, but to my shame I've not
talked to John & Linda enough to know what.
VIRTUAL SERVERS (TBA)
Virtual servers [] let you run several operating systems on one physical
machine. As we got a $3,000 grant to buy a server to do this we should.
[] http://www.xensource.com/solutions/
...if nothing else we get the quick bare metal restores for free.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: CSL Survey[2].doc
Type: application/msword
Size: 24576 bytes
Desc: not available
Url : http://lists.thecsl.org/pipermail/cadre-politics/attachments/20060601/0ec81dab/CSLSurvey2-0001.doc
More information about the Cadre-politics
mailing list