{"id":292281,"date":"2019-07-10T12:30:59","date_gmt":"2019-07-10T19:30:59","guid":{"rendered":"https:\/\/css-tricks.com\/?p=292281"},"modified":"2019-07-10T12:30:59","modified_gmt":"2019-07-10T19:30:59","slug":"types-or-tests-why-not-both","status":"publish","type":"post","link":"https:\/\/css-tricks.com\/types-or-tests-why-not-both\/","title":{"rendered":"Types or Tests: Why Not Both?"},"content":{"rendered":"<p>Every now and then, a debate flares up about the value of typed JavaScript. &#8220;Just write more tests!&#8221; yell some opponents. &#8220;Replace unit tests with types!&#8221; scream others. Both are right in some ways, and wrong in others. Twitter affords little room for nuance. But in the space of this article we can try to lay out a reasoned argument for how both can and should coexist.<\/p>\n<p><!--more--><\/p>\n<h3>Correctness: what we all really want<\/h3>\n<p>It\u2019s best to start at the end. What we really want out of all this meta-engineering at the end is <strong>correctness<\/strong>. I don\u2019t mean the <a href=\"https:\/\/en.wikipedia.org\/wiki\/Correctness_(computer_science)\" rel=\"noopener\">strict theoretical computer science definition<\/a> of it, but a more general adherence of program behavior to its specification: We have an idea of how our program ought to work in our heads, and the process of <em>programming<\/em> organizes bits and bytes to make that idea into reality. Because we aren\u2019t always precise about what we want, and because we\u2019d like to have confidence that our program didn\u2019t break when we made a change, we write types and tests on top of the raw code we already have to write just to make things work in the first place.<\/p>\n<p>So, if we accept that correctness is what we want, and types and tests are just automated ways to get there, it would be great to have a visual model of how types and tests help us achieve correctness, and therefore understand where they overlap and where they complement each other.<\/p>\n<h3>A visual model of program correctness<\/h3>\n<p>If we imagine the entire infinite Turing-complete possible space of everything programs can ever possibly do \u2014 <strong>inclusive of failures<\/strong> \u2014 as a vast gray expanse, then what we want our program to do, our specification, is a very, very, very small subset of that possible space (the green diamond below, exaggerated in size for sake of showing something):<\/p>\n<figure id=\"post-292282\" class=\"align-left media-292282\"><img decoding=\"async\" src=\"https:\/\/i0.wp.com\/css-tricks.com\/wp-content\/uploads\/2019\/07\/s_75BE9AA9C5AF355491AA7B4874002257D56913221730C2A0F068D5CA95AE6B61_1560785031506_image.png?ssl=1\" alt=\"A large gray box with a green diamond in the bottom-right hand corner that is labeled correct.\" data-recalc-dims=\"1\" \/><\/figure>\n<p>Our job in programming is to wrangle our program as close to the specification as possible (knowing, of course, we are imperfect, and our spec is constantly in motion, e.g. due to human error, new features or under-specified behavior; so we never quite manage to achieve exact overlap):<\/p>\n<figure id=\"post-292283\" class=\"align-left media-292283\"><img decoding=\"async\" src=\"https:\/\/i0.wp.com\/css-tricks.com\/wp-content\/uploads\/2019\/07\/s_75BE9AA9C5AF355491AA7B4874002257D56913221730C2A0F068D5CA95AE6B61_1560785063324_image.png?ssl=1\" alt=\"The same gray box and green diamond shown earlier, but with a green border around the diamond that is slightly off center to indicate room for error.\" data-recalc-dims=\"1\" \/><\/figure>\n<p>Note, again, that the boundaries of our program\u2019s behavior <strong>also include planned and unplanned errors<\/strong> for the purposes of our discussion here. Our meaning of &#8220;correctness&#8221; includes planned errors, but does not include unplanned errors.<\/p>\n<h3>Tests and Correctness<\/h3>\n<p>We write tests to ensure that our program fits our expectations, but have a number of choices of things to test:<\/p>\n<figure id=\"post-292284\" class=\"align-left media-292284\"><img decoding=\"async\" src=\"https:\/\/i0.wp.com\/css-tricks.com\/wp-content\/uploads\/2019\/07\/s_75BE9AA9C5AF355491AA7B4874002257D56913221730C2A0F068D5CA95AE6B61_1560785292565_image.png?ssl=1\" alt=\"A series of red, purple and orange dots have been added to the diagram to represent different possible tests.\" data-recalc-dims=\"1\" \/><\/figure>\n<p>The ideal tests are the orange dots in the diagram \u2014 they accurately test that our program does overlap the spec. In this visualization, we don\u2019t really distinguish between types of tests, but you might imagine unit tests as really <em>small<\/em> dots, while integration\/end-to-end tests are <em>large<\/em> dots. Either way, they are dots, because no one test fully describes every path through a program. (In fact, you can have 100% code coverage and <strong>still<\/strong> not test every path because of the combinatorial explosion!)<\/p>\n<p>The blue dot in this diagram is a bad test. Sure, it tests that our program works, but it doesn\u2019t actually pin it to the underlying spec (what we really want out of our program, at the end of the day). The moment we fix our program to align closer to spec, this test breaks, giving us a false positive.<\/p>\n<p>The purple dot is a valuable test because it tests how we think our program should work and identifies an area where our program currently doesn\u2019t. Leading with purple tests and fixing the program implementation accordingly is also known as <strong>Test-Driven Development<\/strong>.<\/p>\n<p>The red test in this diagram is a <em>rare<\/em> test. Instead of normal (orange) tests that test &#8220;happy paths&#8221; (including planned error states), this is a test that expects and verifies that &#8220;<em>un<\/em>happy paths&#8221; fail. If this test &#8220;passes&#8221; where it should &#8220;fail,&#8221; that is a huge early warning sign that something went wrong \u2014 but it is basically impossible to write enough tests to cover the vast expanse of possible unhappy paths that exist outside of the green spec area. People rarely find value testing that things that shouldn&#8217;t work don&#8217;t work, so they don\u2019t do it; but it can still be a helpful early warning sign when things go wrong.<\/p>\n<h3>Types and Correctness<\/h3>\n<p>Where tests are single points on the possibility space of what our program can do, types represent categories carving entire sections from the total possible space. We can visualize them as rectangles:<\/p>\n<figure id=\"post-292285\" class=\"align-left media-292285\"><img decoding=\"async\" src=\"https:\/\/i0.wp.com\/css-tricks.com\/wp-content\/uploads\/2019\/07\/s_75BE9AA9C5AF355491AA7B4874002257D56913221730C2A0F068D5CA95AE6B61_1560946803236_image.png?ssl=1\" alt=\"A purple box has been drawn around the green bordered diamond in the chart to represent the boundary for different types of tests for the program.\" data-recalc-dims=\"1\" \/><\/figure>\n<p>We pick a rectangle to contrast the diamond representing the program, because no type system alone can fully describe our program behavior using types alone. (To pick a trivial example of this, an <code>id<\/code> that should always be a positive integer is a <code>number<\/code> type, but the <code>number<\/code> type also accepts fractions and negative numbers. There is no way to restrict a <code>number<\/code> type to a specific range, beyond a very simple union of number literals.) <\/p>\n<figure id=\"post-292286\" class=\"align-left media-292286\"><img decoding=\"async\" src=\"https:\/\/i0.wp.com\/css-tricks.com\/wp-content\/uploads\/2019\/07\/s_75BE9AA9C5AF355491AA7B4874002257D56913221730C2A0F068D5CA95AE6B61_1560948576082_image.png?ssl=1\" alt=\"Several more different colored borders are added to the diamond in the chart to represent different tests. Any tests outside of the purple box that was drawn earlier are considered invalid.\" data-recalc-dims=\"1\" \/><\/figure>\n<p>Types serve as a constraint on where our program can go as you code. If our program starts to exceed the specified boundaries of your program\u2019s types, our type-checker (like TypeScript or Flow) will simply refuse to let us compile our program. This is nice, because in a dynamic language like JavaScript, it is very easy to accidentally create a crashing program that certainly wasn\u2019t something you intended. The simplest value add is automated null checking. If <code>foo<\/code> has no method called <code>bar<\/code>, then calling <code>foo.bar()<\/code> will cause the all-too-familiar <code>undefined is not a function<\/code> runtime exception. If <code>foo<\/code> were typed at all, this could have been caught by the type-checker <em>while writing<\/em>, with specific attribution to the problematic line of code (with autocomplete as a concomitant benefit). This is something tests simply cannot do.<\/p>\n<p>We might want to write strict types for our program as though we are trying to write the smallest possible rectangle that still fits our spec. However, this has a learning curve, because taking full advantage of type systems involves learning a whole new syntax and grammar of operators and generic type logic needed to model the full dynamic range of JavaScript. <a href=\"https:\/\/github.com\/microsoft\/TypeScript-New-Handbook\" rel=\"noopener\">Handbooks<\/a> and <a href=\"https:\/\/github.com\/typescript-cheatsheets\/react-typescript-cheatsheet\/\" rel=\"noopener\">Cheatsheets<\/a> help lower this learning curve, and more investment is needed here. <\/p>\n<p>Fortunately, this adoption\/learning curve doesn\u2019t have to stop us. Since type-checking is an opt-in process with Flow and configurable strictness with TypeScript (with the ability to selectively <code>ignore<\/code>  troublesome lines of code), we have our pick from a spectrum of type safety. We can even model this, too:<\/p>\n<figure id=\"post-292287\" class=\"align-left media-292287\"><img decoding=\"async\" src=\"https:\/\/i0.wp.com\/css-tricks.com\/wp-content\/uploads\/2019\/07\/s_75BE9AA9C5AF355491AA7B4874002257D56913221730C2A0F068D5CA95AE6B61_1560949771576_image.png?ssl=1\" alt=\"Larger green and red box borders have been drawn around the tests. With the purple box, these represent types of tests.\" data-recalc-dims=\"1\" \/><\/figure>\n<p>Larger rectangles, like the big red one in the chart above, represent a very permissive adoption of a type system on your codebase \u2014 for example, allowing <code>implicitAny<\/code> and fully relying on type inference to merely restrict our program from the worst of our coding.<\/p>\n<p>Moderate strictness (like the medium-size green rectangle) could represent a more faithful typing, but with plenty of escape hatches, like using explicit instances of <code>any<\/code> all over the codebase and manual type assertions. Still, the possible surface area of valid programs that don\u2019t match our spec is massively reduced even with this light typing work.<\/p>\n<p>Maximum strictness, like the purple rectangle, keeps things so tight to our spec that it sometimes finds parts of your program that don\u2019t fit (and these are often unplanned errors in your program behavior). Finding bugs in an existing program like this is a very common story from teams converting vanilla JavaScript codebases. However, getting maximum type safety out of our type-checker likely involves taking advantage of generic types and special operators designed to refine and narrow the possible space of types for each variable and function.<\/p>\n<p>Notice that we don\u2019t technically have to write our program first before writing the types. After all, we just want our types to closely model our spec, so really we can write our types first and then backfill the implementation later. In theory, this would be <strong>Type-Driven Development<\/strong>; in practice, few people actually develop this way since types intimately permeate and interleave with our actual program code.<\/p>\n<h3>Putting them together<\/h3>\n<p>What we are eventually building up to is an intuitive visualization of how both types and tests complement each other in guaranteeing our program\u2019s <strong>correctness<\/strong>.<\/p>\n<figure id=\"post-292288\" class=\"align-left media-292288\"><img decoding=\"async\" src=\"https:\/\/i0.wp.com\/css-tricks.com\/wp-content\/uploads\/2019\/07\/s_75BE9AA9C5AF355491AA7B4874002257D56913221730C2A0F068D5CA95AE6B61_1560952737247_image.png?ssl=1\" alt=\"Back to the original diagram with a green diamond representing correctness, a green border that is slightly off center that represents parameters for correctness, an orange dot in each border of the green diamond border representing tests, and a purple box border around everything to represent the possible test types.\" data-recalc-dims=\"1\" \/><\/figure>\n<p>Our <strong>Tests<\/strong> assert that our program specifically performs as intended in select key paths (although there are certain other variations of tests as discussed above, the vast majority of tests do this). In the language of the visualization we have developed, they &#8220;pin&#8221; the dark green diamond of our program to the light green diamond of our spec. Any movement away by our program breaks these tests, which makes them squawk. This is excellent! Tests are also infinitely flexible and configurable for the most custom of use cases.<\/p>\n<p>Our <strong>Types<\/strong> assert that our program doesn\u2019t run away from us by disallowing possible failure modes beyond a boundary that we draw, hopefully as tightly as possible around our spec. In the language of our visualization, they &#8220;contain&#8221; the possible drift of our program away from our spec (as we are always imperfect, and every mistake we make adds additional failure behavior to our program). Types are also blunt, but powerful (because of type inference and editor tooling) tools that benefit from a strong community supplying types you don\u2019t have to write from scratch.<\/p>\n<p>In short:<\/p>\n<ul>\n<li>Tests are best at ensuring happy paths work.<\/li>\n<li>Types are best at preventing unhappy paths from existing.<\/li>\n<\/ul>\n<p>Use them together based on their strengths, for best results!<\/p>\n<hr>\n<p>If you\u2019d like to read more about how Types and Tests intersect, Gary Bernhardt\u2019s excellent talk on <a href=\"https:\/\/www.destroyallsoftware.com\/talks\/boundaries\" rel=\"noopener\">Boundaries<\/a> and Kent C. Dodds\u2019 <a href=\"https:\/\/testingjavascript.com\/\" rel=\"noopener\">Testing Trophy<\/a> were significant influences in my thinking for this article.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Every now and then, a debate flares up about the value of typed JavaScript. &#8220;Just write more tests!&#8221; yell some opponents. &#8220;Replace unit tests with types!&#8221; scream others. Both are right in some ways, and wrong in others. Twitter affords little room for nuance. But in the space of this article we can try to [&hellip;]<\/p>\n","protected":false},"author":257132,"featured_media":292360,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_bbp_topic_count":0,"_bbp_reply_count":0,"_bbp_total_topic_count":0,"_bbp_total_reply_count":0,"_bbp_voice_count":0,"_bbp_anonymous_reply_count":0,"_bbp_topic_count_hidden":0,"_bbp_reply_count_hidden":0,"_bbp_forum_subforum_count":0,"sig_custom_text":"","sig_image_type":"featured-image","sig_custom_image":0,"sig_is_disabled":false,"inline_featured_image":false,"c2c_always_allow_admin_comments":false,"footnotes":"","jetpack_publicize_message":"","jetpack_is_tweetstorm":false,"jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":[]},"categories":[4],"tags":[2228,529],"jetpack_publicize_connections":[],"acf":[],"jetpack_featured_media_url":"https:\/\/i0.wp.com\/css-tricks.com\/wp-content\/uploads\/2019\/07\/types-tests.png?fit=1200%2C600&ssl=1","jetpack-related-posts":[{"id":341342,"url":"https:\/\/css-tricks.com\/front-end-testing-is-for-everyone\/","url_meta":{"origin":292281,"position":0},"title":"Front-End Testing is For Everyone","date":"June 1, 2021","format":false,"excerpt":"Testing is one of those things that you either get super excited about or kinda close your eyes and walk away. Whichever camp you fall into, I\u2019m here to tell you that front-end testing is for everyone. In fact, there are many types of tests and perhaps that is where\u2026","rel":"","context":"In &quot;Article&quot;","img":{"alt_text":"","src":"https:\/\/i0.wp.com\/css-tricks.com\/wp-content\/uploads\/2021\/05\/24gR1Ckg.png?fit=1200%2C900&ssl=1&resize=350%2C200","width":350,"height":200},"classes":[]},{"id":316110,"url":"https:\/\/css-tricks.com\/links-on-performance-ii\/","url_meta":{"origin":292281,"position":1},"title":"Links on Performance II","date":"July 2, 2020","format":false,"excerpt":"Just had a couple of good performance links burning a hole in my pocket, so blogging them like a good little blogger. Web Performance Recipes With Puppeteer Puppeteer is an Node library for spinning up a copy of Chrome \"headlessly\" (i.e. no UI) and controlling it. People use it for\u2026","rel":"","context":"In &quot;Article&quot;","img":{"alt_text":"","src":"https:\/\/i0.wp.com\/css-tricks.com\/wp-content\/uploads\/2019\/08\/website-lightning.png?fit=1200%2C600&ssl=1&resize=350%2C200","width":350,"height":200},"classes":[]},{"id":307242,"url":"https:\/\/css-tricks.com\/react-integration-testing-greater-coverage-fewer-tests\/","url_meta":{"origin":292281,"position":2},"title":"React Integration Testing: Greater Coverage, Fewer Tests","date":"May 1, 2020","format":false,"excerpt":"Integration tests are a natural fit for interactive websites, like ones you might build with React. They validate how a user interacts with your app without the overhead of end-to-end testing.\u00a0 This article follows an exercise that starts with a simple website, validates behavior with unit and integration tests, and\u2026","rel":"","context":"In &quot;Article&quot;","img":{"alt_text":"","src":"https:\/\/i0.wp.com\/css-tricks.com\/wp-content\/uploads\/2020\/04\/end-to-end-testing.png?fit=1200%2C600&ssl=1&resize=350%2C200","width":350,"height":200},"classes":[]},{"id":301886,"url":"https:\/\/css-tricks.com\/netlify-high-fives\/","url_meta":{"origin":292281,"position":3},"title":"Netlify High-Fives","date":"January 14, 2020","format":false,"excerpt":"We've got Netlify as a sponsor around here again this year, which is just fantastic. Big fan. Our own Sarah Drasner is Head of DX (Developer Experience) over there, if you hadn't heard. And if you haven't heard of Netlify, well, you're in for a treat. It's a web host,\u2026","rel":"","context":"In &quot;Article&quot;","img":{"alt_text":"","src":"https:\/\/i0.wp.com\/css-tricks.com\/wp-content\/uploads\/2019\/10\/netlify-tiers.png?fit=1200%2C720&ssl=1&resize=350%2C200","width":350,"height":200},"classes":[]},{"id":353842,"url":"https:\/\/css-tricks.com\/testing-vue-components-with-cypress\/","url_meta":{"origin":292281,"position":4},"title":"Testing Vue Components With Cypress","date":"October 27, 2021","format":false,"excerpt":"Cypress is an automated test runner for browser-based applications and pages. I\u2019ve used it for years to write end-to-end tests for web projects, and was happy to see recently that individual component testing had come to Cypress. I work on a large enterprise Vue application, and we already use Cypress\u2026","rel":"","context":"In &quot;Article&quot;","img":{"alt_text":"","src":"https:\/\/i0.wp.com\/css-tricks.com\/wp-content\/uploads\/2021\/10\/vue-cypress-testing-assembly-line.jpg?fit=1200%2C600&ssl=1&resize=350%2C200","width":350,"height":200},"classes":[]},{"id":298727,"url":"https:\/\/css-tricks.com\/how-we-perform-frontend-testing-on-stackpaths-customer-portal\/","url_meta":{"origin":292281,"position":5},"title":"How We Perform Frontend Testing on StackPath\u2019s Customer Portal","date":"November 15, 2019","format":false,"excerpt":"Nice post from Thomas Ladd about how their front-end team does testing. The list feels like a nice place to be: TypeScript - A language, but you're essentially getting various testing for free (passing the right arguments and types of variables) Jest - Unit tests. JavaScript functions are doing the\u2026","rel":"","context":"In &quot;Link&quot;","img":{"alt_text":"","src":"https:\/\/i0.wp.com\/css-tricks.com\/wp-content\/uploads\/2019\/11\/end-to-end-squiggle.png?fit=1200%2C600&ssl=1&resize=350%2C200","width":350,"height":200},"classes":[]}],"featured_media_src_url":"https:\/\/i0.wp.com\/css-tricks.com\/wp-content\/uploads\/2019\/07\/types-tests.png?fit=1024%2C512&ssl=1","_links":{"self":[{"href":"https:\/\/css-tricks.com\/wp-json\/wp\/v2\/posts\/292281"}],"collection":[{"href":"https:\/\/css-tricks.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/css-tricks.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/css-tricks.com\/wp-json\/wp\/v2\/users\/257132"}],"replies":[{"embeddable":true,"href":"https:\/\/css-tricks.com\/wp-json\/wp\/v2\/comments?post=292281"}],"version-history":[{"count":4,"href":"https:\/\/css-tricks.com\/wp-json\/wp\/v2\/posts\/292281\/revisions"}],"predecessor-version":[{"id":292316,"href":"https:\/\/css-tricks.com\/wp-json\/wp\/v2\/posts\/292281\/revisions\/292316"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/css-tricks.com\/wp-json\/wp\/v2\/media\/292360"}],"wp:attachment":[{"href":"https:\/\/css-tricks.com\/wp-json\/wp\/v2\/media?parent=292281"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/css-tricks.com\/wp-json\/wp\/v2\/categories?post=292281"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/css-tricks.com\/wp-json\/wp\/v2\/tags?post=292281"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}