Sipping: Systemically Incremental Programming
Overview
The process of splitting up a project into SIPs is just the first part of the process. Once the individual SIPs have been identified, the next step is deciding the order in which to implement the SIPs.
This is a critical part of Sipping. Usually, the first SIP to implement is the one that most quickly hooks up the desired functionality into a system that can be used to test it. Once this high level SIP has been implemented, more detailed SIPs can be implemented to further test and extend the functionality original developed.
Let's take a sample project. We are going to write a file search tool that will:
- Look through a directory (and optionally sub directories) for files with specific names (including wild cards)
- Be able to filter for specific file types (extensions)
- Be able to look in specific file types for a text
- Be able to sort the results according to one or more sorting algorithms
The FileSearcher class
This problem can be broken down into a variety of SIPs. In thinking about the problem, I find it helpful to first decide the overall system architecture. In this case, I think that I would like to design a class called FileSearcher. This class will collect all of the related configuration data such as RootDirectory, SortOrder, ValidExtensions, TextExtensions, etc into one place.
During this early thought process, I make some high level design decisions. These are by no means final decisions. Quite to the contrary, they are simply goals or objectives that will help to guide the design process. At this point, all I can do is to image a very fully functional class called FileSearcher that will provide (centrally) all of the functionality that we're looking to incorporate into our model. As we add new features, this class will provide the framework that all of our new functionality will be added to.
With this high level design in mind, the first SIP that I will design (and code by the way) is the underlying framework that all of the other features will be built on. In other words, in the first SIP I almost always want to have a functional prototype of the final design. It usually won't do most of what I actually want - but it will do a basic implementation of the final design - and hopefully will work in a way that is as close to the final design as possible.
For the first SIP, I usually skip to the last step, and design the framework of the final class that I want. In this case, here's what my first sip looks like...
SIP 1: Create a basic FileSearcher class
The objective of this step is to create a functional enough FileSearcher class that it can actually be used. I will create a simple UI to test my class. My plan with SIP 1 is to create the following functionality:
FileSearcher:
- Create the FileSearcher class
- Add a String RootDirectory { get; set; } property that will define the root of the search area
- Define an event called FileFound that will be fired each time a matching file is found
- Define a void Search() method that will start a search of the RootDirectory and fire the FileFound event for each file found.
That's it for the FileSearcher class. It will (as you can see) still not filter files, search sub-directories, sort or really do any of the other features that we might want to implement. But it will do something, and will let us design a UI to test our class.
Testing UI:
- A windows form with a ListBox, Button and FileSearcher property.
- When the button is clicked, it will clear the ListBox's Items, and call FileSearcher.Search().
- In the FileFound event handler, the file found will simply be added to the ListBox.
Again - that's it. Once implemented, the application can be run. Since the RootDirectory property is set (by default) to 'C:\', what should show up in the ListBox when the Search button is clicked, is a list of files in the root of your C Drive. Here's the code:
FileSearcher.cs
1public class FileSearcher
2{
3 public FileSearcher()
4 {
5 this.RootDirectory = "C:\\";
6 }
7 /// <summary>
8 ///
9 /// </summary>
10 public event EventHandler FileFound;
11 protected virtual void OnFileFound(FileInfo fi)
12 {
13 if (this.FileFound != null) this.FileFound(fi, EventArgs.Empty);
14 }
15
16 /// <summary>
17 ///
18 /// </summary>
19 public String RootDirectory { get; set; }
20
21 /// <summary>
22 ///
23 /// </summary>
24 public void Search()
25 {
26
27 this.SearchDirectory(this.RootDirectory);
28 }
29
30 /// <summary>
31 ///
32 /// </summary>
33 /// <param name="directory"></param>
34 private void SearchDirectory(string directory)
35 {
36
37 foreach (String file in Directory.GetFiles(directory))
38 {
39 this.OnFileFound(new FileInfo(file));
40 }
41 }
42}
At this point, the class is very simple. In fact, by design, it is as simple as I can possibly make it while still providing enough functionality to actually make it a useful enough class to be testable.
By keeping it so simple and focused, my goal is to make it the best possible class that I can at this point. All of the directory names should make sense, work together, be cohesive, and of course - functional. The most important decision being made at this point is, what's the best overall architecture. Questions that could be asked at this point include:
- Do I want to pass parameters into the Search() method, or as implemented here, have all of the configuration options (like the RootDirectory property) as class level properties?
- Do I want to have a SearchResultEventArgs class, and make the FileFound event an EventHandler<SearchResultEventArgs> instead of a simple EventHandler event? This is definitely something that will be examined in future SIPs, and there is a high probability that this will be enhanced later. For now, a simple FileInfo as the sender is totally sufficient for our purposes.
And the consuming form Form1.cs:
1public partial class Form1 : Form
2{
3 public Form1()
4 {
5 InitializeComponent();
6
7
8 this.FileSearcher.FileFound += new EventHandler(FileSearcher_FileFound);
9 }
10
11 /// <summary>
12 ///
13 /// </summary>
14 void FileSearcher_FileFound(object sender, EventArgs e)
15 {
16 searchResults.Items.Add(sender);
17 }
18
19 /// <summary>
20 ///
21 /// </summary>
22 private FileSearcher FileSearcher = new FileSearcher();
23
24 /// <summary>
25 ///
26 /// </summary>
27 private void searchButton_Click(object sender, EventArgs e)
28 {
29 this.searchResults.Items.Clear();
30 this.FileSearcher.Search();
31 }
32}
All the Form1 does is call FileSearcher.Search(), and then add the files found to the listbox. At this point, you can't configure the RootDirectory, and no other search features are implemented. The goal of the first SIP is almost always to put enough of a framework in place, that the rest of the features desired can be easily tested. The order of the rest of the SIPs becomes less important, but I will lay out the rest of the SIPs as I would be inclined to implement them.
- SIP 2: Recursion
- SIP 3: File Pattern Matching
- SIP 4: Valid Extension Restriction
- SIP 5: Text File Searching
- SIP 6: Sorting
With this list in front of us, I'd like to point something out. For simply listing files, using a simple EventHandler is totally sufficient. This is why, for the first pass, using a simple EventHandler made more sense. When we add recursion, this EventHandler will still work. Even after SIPs 3 & 4, when we've added the ability to search for files with specific name and extension filtering, it should still work.
Once we get to SIP 5 however, and are searching through text files for specific text, we might need a SearchResultEventArgs that can provide more information than a simple FileInfo can convey. Specifically, I can picture wanting to return details about what matched, a list of matching lines, portions of the matching text, etc.
Preemptive Optimization
It is very important to point out that while we could preemptively put this structure into place now, at this point we don't really know enough to actually do that effectively. To some extent, the Sipping method is specifically designed to prevent this kind of work from happening "before it's time". Instead, that change should be made systemically, as an incremental change to the FileSearcher class, when we get to that SIP!
There's no need to rush - we'll get there soon enough. Doing it now would simply confuse the process, make the current task more complicated, and at this point, really not add anything to the equation (other than confusion/complexity).
Putting this structure in place now (when it is not yet needed) would be overkill. If all we ever implemented was this first FileSearcher that could simply search 1 directory of files, with no filtering or sorting of any kind - the EventHandler delegate is totally adequate. Leave it at that for now, and trust that it will be easy enough to add (as it's own SIP) down the road.
Ready for the next SIP? Let's incrementally add the ability to search sub directories.
SIP 2: Sub-Directories
I would now like to add the ability to search through sub-directories. I don't necessarily want to always search recursively, but the class should have the ability to do so. At this point, because I don't want to have to search through my entire hard drive every time I click search, I'll also update the testing UI configure a smaller RootDirectory, and turn ON the RecursiveSearch property.
FileSearch.cs
1public class FileSearcher
2{
3...
4 /// <summary>
5 ///
6 /// </summary>
7 public Boolean RecursiveSearch { get; set; }
8
9 /// <summary>
10 ///
11 /// </summary>
12 /// <param name="directory"></param>
13 private void SearchDirectory(string directory)
14 {
15
16 if (this.RecursiveSearch)
17 {
18 foreach (String subDirectory in Directory.GetDirectories(directory))
19 {
20 this.SearchDirectory(subDirectory);
21 }
22 }
23
24
25 foreach (String file in Directory.GetFiles(directory))
26 {
27 this.OnFileFound(new FileInfo(file));
28
29 }
30 }
31}
This adds to the file searcher class a Boolean RecursiveSearch property that, when true, will search through Subdirectories of the RootDirectory in addition to what it did before. This adds ~5 new lines of code, and a huge amount of new functional.
Now we test - and ensure that we don't just get the RootDirectory files, but also all subfolders and files. We might also at this point test, that if we set the RecursiveSearch to false, that again, we only get the RootDirectory files.
Our class is starting to take shape. Let's take another SIP.
SIP 3: File pattern matching
Lets add the ability to filter the exact files that we're searching for. Given the infrastructure that we've already put in place, this should be another minor tweak (Systemic Incremental Programming change) to the system.
1public class FileSearcher
2{
3...
4 /// <summary>
5 ///
6 /// </summary>
7 public String SearchPattern { get; set; }
8
9 /// <summary>
10 ///
11 /// </summary>
12 /// <param name="searchPattern"></param>
13 public void Search(String searchPattern)
14 {
15 this.SearchPattern = searchPattern;
16 this.Search();
17 }
18...
19 public void Search()
20 {
21
22 if (String.IsNullOrEmpty(this.SearchPattern)) this.SearchPattern = "*.*";
23
24
25 this.SearchDirectory(this.RootDirectory);
26 }
27...
28 private void SearchDirectory(string directory)
29 {
30...
31
32 foreach (String file in Directory.GetFiles(directory, this.SearchPattern))
33...
34 }
35}
You may notice that when adding the "SearchPattern" property, I also chose to add an overloaded version of Search that could update this search pattern for us. This is an improvement that was "easy" to see, because adding support for search patterns was all I was thinking about that this phase (SIP).
Our system can now search for files matching a specific search pattern and can search recursively. It still can't look for specific extensions (except as provided by the search pattern), search for text inside of files or sort the results. Those will come. Let's add Extension filtering next.
SIP 4: Valid Extension Filtering
For this feature, I'm picturing a separate filter entirely from the SearchPattern. Basically, I'm picturing a List<String> ValidExtensions { get; } property can that be used to set valid extensions for the overall search. In looking at our current system, the most logical place for this seems to be in the OnFileFound method. In other words, right before the event gets fired, make sure that the file being returned matches our list of valid extensions (if any are present).
Here again, we should design a simple, powerful, well tested feature, because this is only thing that we're thinking about in SIP 4. Everything else already works. It should also be really easy to tweak out testing UI to validate that what we put in place does in fact, systemically modify our class, exactly as expected.
FileSearcher.cs:
1public class FileSearcher
2{
3 /// <summary>
4 ///
5 /// </summary>
6 public event EventHandler FileFound;
7 protected virtual void OnFileFound(FileInfo fi)
8 {
9...
10 if (this.FileFound != null)
11 {
12
13 if (this.ValidExtensions.Count > 0)
14 {
15
16 if (this.ValidExtensions.Count(ext =>
17 ext.ToLower() == fi.Extension.ToLower()) == 0) return;
18 }
19 this.FileFound(fi, EventArgs.Empty);
20 }
21 }
22
23 /// <summary>
24 ///
25 /// </summary>
26 public List<String> ValidExtensions { get; private set; }
27...
28}
This SIP adds a new List<String> ValidExtensions { get; } property that can be used to filter the final search results to a specific list of valid extensions. If any ValidExtensions are specified, then it now counts how many of those extensions match the current file, and if 0 match, it returns before firing the FileFound event.
Simple, minor, incremental change to the system, which can now be fully tested against our previous code to verify that it works, just as expected.
I think we're ready for SIP 5 - text file searching.
SIP 5: Text Searching
An interesting thing happened when implementing this SIP which is A) Not uncommon when Sipping and B) Yet another reason that sipping proves to be so effective. Specifically, while originally, this feature seemed superficially at least to be a relatively easy thing to implement, a variety of complications showed themselves, which have caused me to add 4 more SIPs to our plan.
Specifically, when first approaching this SIP I quickly realized that to begin with, I was going to leave the existing EventHandler in place, and push the more sophisticated EventHandler<SearchResultEventArgs> implementation to a later SIP. This decision was based on the fact that this SIP was already complicated enough, without adding the complexity of restructuring how the event actually gets fired.
Also, when I first tested the SIP, it included a variety of files that were not text files. When they weren't text files, I was not searching them, but I was including them in the result. This was obviously a mistake (bug), but because this was the only feature that I was focusing on, I realized that this could (in certain situations at least) actually be the desired behavior. In other words, I am going to add a SIP at the end of my list that will add a new property called ExcludeNonTextFilesFromSearch (which will default to true). If it was set to true, all matching Non-text files would be included, but only text files that matched would be included.
Another "bug" that was exposed through testing was that I had a TextExtension that was not in the list of ValidExtensions. Again, as a later SIP I could add a feature that said "bool AssumeTextExtensionsAreValid"
In summary, completing this SIP in the simplest way that I could, created 3 new SIPs to add to our list
New SIPs:
- SIP 7: SearchResultEventArgs
- SIP 8: Add bool AssumeTextExtensionsAreValid property
- SIP 9: Add bool ExcludeNonTextFilesFromSearch property.
I choose not to implement these now to keep my changes small, targeted and incremental. I will come back to these features at a later time though.
1public class FileSearcher
2{
3...
4 protected virtual void OnFileFound(FileInfo fi)
5 {
6 if (this.FileFound != null)
7 {
8...
9
10 if (!String.IsNullOrEmpty(this.TextFileSearchString))
11 {
12
13 if (this.TextExtensions.Count(ext =>
14 ext.ToLower() == fi.Extension.ToLower()) > 0)
15 {
16
17 String searchString = String.Empty;
18 String fileContents = String.Empty;
19
20
21 if (this.CaseInsensitiveTextFileSearch)
22 {
23 searchString = this.TextFileSearchString.ToLower();
24 fileContents = File.ReadAllText(fi.FullName).ToLower();
25 }
26 else
27 {
28 searchString = this.TextFileSearchString;
29 fileContents = File.ReadAllText(fi.FullName);
30 }
31
32
33 if (!fileContents.Contains(searchString)) return;
34 }
35
36 else return;
37 }
38
39
40 this.FileFound(fi, EventArgs.Empty);
41 }
42 }
43...
44 /// <summary>
45 ///
46 /// </summary>
47 public List<String> TextExtensions { get; private set; }
48
49 /// <summary>
50 ///
51 /// </summary>
52 public String TextFileSearchString { get; set; }
53
54 /// <summary>
55 ///
56 /// </summary>
57 public Boolean CaseInsensitiveTextFileSearch { get; set; }
58...
59}
Conclusion
I am not going to put all of the SIPs into this article, but you can see how the problem evolves slowly, SIP by SIP. When thinking about the order of your SIPs, I would suggest the following guidelines:
- The most important SIP is the 1st one. The objective of the first SIP should be to put the basic framework in place so that the rest of the SIPs can be tested, one by one, as they are developed.
- The order of the rest of the SIPs is substantially less important. That being said, I tend to implement the simplest ones first, so that by the time I'm getting to the more complicated SIPs, the system has already been well tested, and logical inconsistencies in the design have been worked out under simpler scenarios. In this way, when I get to the more complicated SIPs (like text searching in this case), the rest of the system was already in place, and I could really focus on just the text searching, which was already complicated enough.
- Look for opportunities to split your SIPs. Often, items that appear to be one SIP when starting a project, turn out to be 3, 4 or even more SIPs when you actually dig into the code. This is what Sipping is designed to handle. When you're not sipping, as a step gets more complicated, traditional approaches usually just try to juggle more and more balls, and often we end up with Spaghetti. The whole point of sipping is that we have a system that is specifically designed to allow us to say "This is getting complicated, let me push these other 3 'features' to a later SIP." They are almost always good ideas, but getting distracted with new ideas is usually what gets developers into trouble. If it's really a good idea, it should fit beautifully into the system if/when we choose to tackle that new feature - as it's own SIP!
Hopefully this has helped to further explain how Sipping works, and how the SIP implementation order can be used to keep your project On-track, On-budget, and defect free!
The Final Code:
1public class FileSearcher
2{
3 public FileSearcher()
4 {
5 this.RootDirectory = "C:\\";
6 this.RecursiveSearch = true;
7 this.ValidExtensions = new List<String>();
8 this.TextExtensions = new List<String>(new String[] { ".txt" });
9 }
10
11 /// <summary>
12 ///
13 /// </summary>
14 public event EventHandler FileFound;
15 protected virtual void OnFileFound(FileInfo fi)
16 {
17 if (this.FileFound != null)
18 {
19
20 if (this.ValidExtensions.Count > 0)
21 {
22
23 if (this.ValidExtensions.Count(ext =>
24 ext.ToLower() == fi.Extension.ToLower()) == 0) return;
25 }
26
27
28 if (!String.IsNullOrEmpty(this.TextFileSearchString))
29 {
30
31 if (this.TextExtensions.Count(ext =>
32 ext.ToLower() == fi.Extension.ToLower()) > 0)
33 {
34
35 String searchString = String.Empty;
36 String fileContents = String.Empty;
37
38
39 if (this.CaseInsensitiveTextFileSearch)
40 {
41 searchString = this.TextFileSearchString.ToLower();
42 fileContents = File.ReadAllText(fi.FullName).ToLower();
43 }
44 else
45 {
46 searchString = this.TextFileSearchString;
47 fileContents = File.ReadAllText(fi.FullName);
48 }
49
50
51 if (!fileContents.Contains(searchString)) return;
52 }
53
54 else return;
55 }
56
57
58 this.FileFound(fi, EventArgs.Empty);
59 }
60 }
61
62 /// <summary>
63 ///
64 /// </summary>
65 public String RootDirectory { get; set; }
66
67 /// <summary>
68 ///
69 /// </summary>
70 public String SearchPattern { get; set; }
71
72 /// <summary>
73 ///
74 /// </summary>
75 public List<String> ValidExtensions { get; private set; }
76
77 /// <summary>
78 ///
79 /// </summary>
80 public List<String> TextExtensions { get; private set; }
81
82 /// <summary>
83 ///
84 /// </summary>
85 public String TextFileSearchString { get; set; }
86
87 /// <summary>
88 ///
89 /// </summary>
90 public Boolean CaseInsensitiveTextFileSearch { get; set; }
91
92 /// <summary>
93 ///
94 /// </summary>
95 public Boolean RecursiveSearch { get; set; }
96
97 /// <summary>
98 ///
99 /// </summary>
100 /// <param name="searchPattern"></param>
101 public void Search(String searchPattern)
102 {
103 this.SearchPattern = searchPattern;
104 this.Search();
105 }
106
107 /// <summary>
108 ///
109 /// </summary>
110 public void Search()
111 {
112
113 if (String.IsNullOrEmpty(this.SearchPattern)) this.SearchPattern = "*.*";
114
115
116 this.SearchDirectory(this.RootDirectory);
117 }
118
119 /// <summary>
120 ///
121 /// </summary>
122 /// <param name="directory"></param>
123 private void SearchDirectory(String directory)
124 {
125
126 if (this.RecursiveSearch)
127 {
128 foreach (String subDirectory in Directory.GetDirectories(directory))
129 {
130 this.SearchDirectory(subDirectory);
131 }
132 }
133
134
135 foreach (String file in Directory.GetFiles(directory, this.SearchPattern))
136 {
137 this.OnFileFound(new FileInfo(file));
139 }
140 }
141}