Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: expanded results, listed errors in EAS

...

Code Block
{
    "spatialReference": {
        "wkid": 4326
    }, 
    "candidates": [
        {
            "address": "115  MAIN ST", 
            "location": {
                "y": 37.79152826999239, 
                "x": -122.39400753158013
            }, 
            "score": 100, 
            "attributes": {
                "base_address_num": 115, 
                "base_address_suffix": null, 
                "street_name": "MAIN", 
                "street_type": "ST", 
                "zipcode": "94105", 
                "parcels": [
                    {
                        "address_base_flg": true, 
                        "blk_lot": "3717012", 
                        "unit_num": null
                    }, 
                    {
                        "address_base_flg": true, 
                        "blk_lot": "3717013", 
                        "unit_num": null
                    }
                ], 
                "units": []
            }
        }
    ]
}
Scoring

If everything matches perfectly, you should get a score of 100. If you misspell the street name by 1 letter you get a score of 98.

Results and Performance

Using this example file from Dept. of Public Health, we saw a rate of about 1000 addresses per minute with barely any load on the servers. The results on this same dataset are summarized in this table.

scorepercent
880.02
900.10
946.28
962.89
9890.29
Scoring

If everything matches perfectly, you should get a score of 100. If you misspell the streetname by 1 letter you get a score of 98.

The example file contains 4,725 rows with 908 unique addresses. 896 of these addresses (98.7%) were resolved. Where multiple addresses are returned, a reasonably good result can be expected from accepting the match with the highest score. Of the 896 addresses that were resolved, 863 (96%) returned a single result. 33 (3%) returned two addresses, and 2 (0.02%) returned three addresses. The result with the highest score was the correct candidate in all cases.

Of the remaining 12 addresses that were not resolved:

Two addresses failed due to irretrievable data-entry errors:

625 \ St
1344  St

Seven addresses were not in the enterprise address database:

509 MINNA St
24 WENTWORTH St
6 St. Louis Alley
15 ROMOLO Pl
3101 MISSION St
752 SHOTWELL St
2725 VAN NESS Ave

Five listed extra information or multiple street numbers, and were resolved manually:

2358 2362 15th St2362 15th St
2855 2857 2859 BUSH St2855 Bush St
3354 3358 cesar chavez St3354 Cesar Chavez St
654 GRANT Ave  2654 Grant Ave
3148 CESAR CHAVEZ BLDG#14 St3148 Cesar Chavez

 

Client

Here is a working command line client.

...

The server code is here and here.

The tests are over here.

Invalid Address List

The following apparent errors in the EAS were discovered in the course of debugging and examining the results of this geocoder:

AddressError Description
15 ROMOLO PlNot in EAS, although the parcel is mapped
3148 CESAR CHAVEZ BLDG#14 StMany addresses for one parcel
752 SHOTWELL StNot in EAS, although the parcel is mapped
670 NatomaLinked to two APNs, both of which are retired
1443 Clayton StLinked to many other addresses, probably erroneously
1411 Mason StLinked to an adjacent parcel, probably erroneously

Attachments